从字符串列中提取两列

huangapple

117266
文章

0
评论

2023年3月7日 06:52:32go评论110阅读模式

英文:

Extract two columns from a column of strings

问题

我有一个数据框，其中包含以下格式的字符串

Rondon&#243;polis (c/ 5,2%) 3500 7000 2789 4258

我需要创建两列并保持这种方式。我一直在尝试使用正则表达式，但仍然无法

A	B
Rondonópolis (c/ 5,2%)	3500 7000 2789 4258

英文:

I have a dataframe, which contains strings in this format

Rondon&#243;polis (c/ 5,2%) 3500 7000 2789 4258

I need to create two columns and stay that way. I've been trying to use regex but I still can't

A	B
Rondonópolis (c/ 5,2%)	3500 7000 2789 4258

答案1

得分: 1

使用str.extract来提取两个组：一个是数字（四个四位数），另一个是这些数字之前的所有内容。

df = pd.DataFrame({'my_column': ["Rondonópolis (c/ 5,2%) 3500 7000 2789 4258", "Ponta Grossa 2100 3121 4578 3234"]})
df[['A', 'B']] = df['my_column'].str.extract(r"(.+) (\d{4} \d{4} \d{4} \d{4})")

英文:

Use str.extract to extract two groups: one of the numbers (four 4-digit numbers) and the other everything preceding those numbers.

df = pd.DataFrame({&#39;my_column&#39;: [&quot;Rondon&#243;polis (c/ 5,2%) 3500 7000 2789 4258&quot;, &quot;Ponta Grossa 2100 3121 4578 3234&quot;]})
df[[&#39;A&#39;, &#39;B&#39;]] = df[&#39;my_column&#39;].str.extract(r&quot;(.+) (\d{4} \d{4} \d{4} \d{4})&quot;)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

本文由 huangapple 发表于 2023年3月7日 06:52:32
转载请务必保留本文链接：https://go.coder-hub.com/75656560.html

dataframe
pandas
python
regex
string

Simple Python code doesn’t work with Brython

go 96 03/07

参考 polars.DataFrame.height 在 with_columns 中。

go 118 06/12

Pandas DataFrame在特定行之前检查条件

go 105 01/06

语法错误：在 “from keras.utils import to_categorical” 中。

go 107 04/17

从字符串列中提取两列

问题

答案1

Simple Python code doesn’t work with Brython

参考 polars.DataFrame.height 在 with_columns 中。

Pandas DataFrame在特定行之前检查条件

语法错误：在 “from keras.utils import to_categorical” 中。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。