2023年5月14日 09:08:37go评论97阅读模式

英文:

How to explode Python Pandas Dataframe and merge strings from other dataframe?

问题

Dataframe1中有大量的数据行和列。其中一列是Text。Text列中的某些行包含字符串，其中一些字符串包含了{ExplodeEList2}。

如何将Dataframe1中的这些特定行展开并将每个字符串中的{ExplodeEList2}替换为分开的数据框EList2['Name']中的每个名称？谢谢！我已经整天在尝试解决这个问题了。

Dataframe1：

Text
不相关的数据
随机示例文本 {ExplodeElist2} 和更多随机示例文本。
其他不相关的数据

EList2：

Name
Jack
Jon
Sally

我应该如何在Dataframe1中生成以下结果：

Text
不相关的数据
随机示例文本 Jack 和更多随机示例文本。
随机示例文本 Jon 和更多随机示例文本。
随机示例文本 Sally 和更多随机示例文本。
其他不相关的数据

英文:

Dataframe1 has a lot of rows and columns of data. One column is Text. Certain rows in Text column have strings and some strings include within the strings this {ExplodeEList2}

How to explode (expand) those specific rows of Dataframe1 and replace {ExplodeEList2} in each string with each name contained in the separate dataframe EList2['Name']? Thank you! I've been banging my head against my keyboard all day trying to solve this.

Dataframe1:

Text
Unrelated data
Random sample text {ExplodeElist2} and more random sample text.
Other unrelated data

EList2:

Name
Jack
Jon
Sally

How do I generate this in Dataframe1:

Text
Unrelated data
Random sample text Jack and more random sample text.
Random sample text Jon and more random sample text.
Random sample text Sally and more random sample text.
Other unrelated data

答案1

得分: 1

你可以使用 apply 来处理 DataFrame1 中包含字符串 ExplodeElist2 的所有 Text 值，将该字符串替换为一组替代值。然后，你可以使用 explode 来展开该列表：

mask = DataFrame1['Text'].str.contains('{ExplodeElist2}')
DataFrame1.loc[mask, 'Text'] = DataFrame1.loc[mask, 'Text'].apply(lambda s:展开收缩
])
DataFrame1 = DataFrame1.explode('Text').reset_index(drop=True)

输出（针对你的示例数据）：

                                                Text
0                                     无关数据
1  随机示例文本 Jack 和更多随机示例文本...
2  随机示例文本 Jon 和更多随机示例文本 ...
3  随机示例文本 Sally 和更多随机示例文本...
4                               其他无关数据

英文:

You can use apply to process all the Text values in DataFrame1 which contain the string ExplodeElist2, replacing the string with a list of replaced values. You can then explode that list:

mask = DataFrame1[&#39;Text&#39;].str.contains(&#39;{ExplodeElist2}&#39;)
DataFrame1.loc[mask, &#39;Text&#39;] = DataFrame1.loc[mask, &#39;Text&#39;].apply(lambda s:展开收缩
])
DataFrame1 = DataFrame1.explode(&#39;Text&#39;).reset_index(drop=True)

Output (for your sample data):

                                                Text
0                                     Unrelated data
1  Random sample text Jack and more random sample...
2  Random sample text Jon and more random sample ...
3  Random sample text Sally and more random sampl...
4                               Other unrelated data

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何将Python Pandas数据框拆分并合并来自其他数据框的字符串？

问题

答案1

替换Python中子列表内的数值

无法使用shareplum将列表项“添加”到SharePoint列表。

How to find 2 integers that can form the numerical values of a list and structure the answers in another list?

“Spyder Python | No module named pip” 翻译为 “Spyder Python | 未找到 pip 模块”。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。