如何在pandas数据框中删除字符串的部分？

huangapple

117266
文章

0
评论

2020年1月6日 21:40:08go评论118阅读模式

英文:

How to remove section of string in pandas dataframe?

问题

以下是翻译好的部分：

要做的是在数据框列中删除字符串中'of'之前的所有文本。例如：

ColA          ColB 
 1       '12 miles ESE of Jackson,MS'
 2       '8 miles NE of New York, NY'
 3       '223 miles SW of Atlanta, GA'

我想要的是这样的结果：

ColA           ColB 
 1           'Jackson,MS'
 2           'New York,NY'
 3           'Atlanta,GA'

谢谢！

英文:

What I am looking to do is remove all text before the work 'of' in a string in a dataframe column. For example:

ColA          ColB 
 1       &#39;12 miles ESE of Jackson,MS&#39;
 2       &#39;8 miles NE of New York, NY&#39;
 3       &#39;223 miles SW of Atlanta, GA&#39;

What I am looking to get is this:

ColA           ColB 
 1           &#39;Jackson,MS&#39;
 2           &#39;New York,NY&#39;
 3           &#39;Atlanta,GA&#39;

Thank you!

答案1

得分: 3

你可以执行：

df['ColB'] = df['ColB'].str.split('of').str[1]

英文:

You can do:

df[&#39;ColB&#39;] = df[&#39;ColB&#39;].str.split(&#39;of&#39;).str[1]

答案2

得分: 2

"'" + df['ColB'].str.extract("of\s(.*$)") 输出：

0 'Jackson,MS'
1 'New York, NY'
2 'Atlanta, GA'

英文:

Try, using regex and .str.extract:

&quot;&#39;&quot; + df[&#39;ColB&#39;].str.extract(&quot;of\s(.*$)&quot;)

Output:

                0
0    &#39;Jackson,MS&#39;
1  &#39;New York, NY&#39;
2   &#39;Atlanta, GA&#39;

答案3

得分: 1

使用 .replace 方法：

df.ColB = df.ColB.replace(r'.*of (.*?)', '\\1', regex=True)

然后你的 ColB 列将会是：

    ColB
0  Jackson,MS
1  New York, NY
2  Atlanta, GA

英文:

Use .replace:

df.ColB = df.ColB.replace(r&#39;.*of (.*)&#39;, &#39;\&#39;, regex=True)

then your ColB will be

	ColB
0 	Jackson,MS
1 	New York, NY
2 	Atlanta, GA

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

本文由 huangapple 发表于 2020年1月6日 21:40:08
转载请务必保留本文链接：https://go.coder-hub.com/59613160.html

dataframe
pandas
python

“decorator design” 和 “template design” 在Python中有什么区别？

go 99 06/15

Django：如何根据月份过滤 Django 对象？

go 91 04/06

Python Polars：如何在应用循环中添加进度条

go 156 02/24

Customtkinter: 为什么这个 event.widget 丢失了正确的网格信息？

go 89 02/06

如何在pandas数据框中删除字符串的部分？

问题

答案1

答案2

答案3

“decorator design” 和 “template design” 在Python中有什么区别？

Django：如何根据月份过滤 Django 对象？

Python Polars：如何在应用循环中添加进度条

Customtkinter: 为什么这个 event.widget 丢失了正确的网格信息？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。