2023年6月8日 06:17:38go评论95阅读模式

英文:

split column with values inside and outside brackets python

问题

"I need to split (in python code) my column "code" into 2 columns:"
- "我需要在Python代码中拆分我的列"code"为2个列："
""outside" with value outside the brackets"
- "“outside”列的值为括号外的部分"
""inside" with value inside the brackets"
- "“inside”列的值为括号内的部分"
"I'd create a "prepared" column by adding a "+" separator after each letter before the number."
- "我会创建一个"prepared"列，通过在每个字母前添加"+"分隔符来实现。"
"| id | code | outside| inside| prepared"
- "| 编号 | 代码 | 外部| 内部| 准备好"
"| 1 | -(83C24H) | - | 83C24H | 83C + 24H"
- "| 1 | -(83C24H) | - | 83C24H | 83C + 24H"
"| 2 | 30(30C14H) | 30 | 30C14H | 30C + 14H"
- "| 2 | 30(30C14H) | 30 | 30C14H | 30C + 14H"
"| 3 | 25 | 25 | 0 | 0"
- "| 3 | 25 | 25 | 0 | 0"
"Thank u!"
- "谢谢！"

英文:

I need to split (in python code) my column "code" into 2 columns:

"outside" with value outside the brackets
"inside" with value inside the brackets

I'd create a "prepared" column by adding a "+" separator after each letter before the number.

id	code	outside	inside	prepared
1	-(83C24H)	-	83C24H	83C + 24H
2	30(30C14H)	30	30C14H	30C + 14H
3	25	25	0	0

Thank u!

答案1

得分: 1

尝试：

df['outside'] = df['code'].str.replace(r'\([^)]*\)', '', regex=True)
df['inside'] = df['code'].str.extract(r'\(([^)]+)')
print(df)

打印：

   id        code outside  inside
0   1   -(83C24H)       -  83C24H
1   2  30(30C14H)      30  30C14H

编辑：使用更新后的数据框：

mask = df['code'].str.contains(r'\(.*\)', regex=True)
df['inside'] = df.loc[mask, 'code'].str.extract(r'\(([^)]+)')
df['outside'] = df.loc[mask, 'code'].str.replace(r'\([^)]*\)', '', regex=True)
df['inside'] = df['inside'].fillna(df['code'])
df['outside'] = df['outside'].fillna('0')
print(df)

打印：

   id        code  inside outside
0   1   -(83C24H)  83C24H       -
1   2  30(30C14H)  30C14H      30
2   3          25      25       0

英文:

Try:

df[&#39;outside&#39;] = df[&#39;code&#39;].str.replace(r&#39;\([^)]*\)&#39;, &#39;&#39;, regex=True)
df[&#39;inside&#39;] = df[&#39;code&#39;].str.extract(r&#39;\(([^)]+)&#39;)
print(df)

Prints:

   id        code outside  inside
0   1   -(83C24H)       -  83C24H
1   2  30(30C14H)      30  30C14H

EDIT: With updated dataframe:

mask = df[&#39;code&#39;].str.contains(r&#39;\(.*\)&#39;, regex=True)
df[&#39;inside&#39;] = df.loc[mask, &#39;code&#39;].str.extract(r&#39;\(([^)]+)&#39;)
df[&#39;outside&#39;] = df.loc[mask, &#39;code&#39;].str.replace(r&#39;\([^)]*\)&#39;, &#39;&#39;, regex=True)
df[&#39;inside&#39;] = df[&#39;inside&#39;].fillna(df[&#39;code&#39;])
df[&#39;outside&#39;] = df[&#39;outside&#39;].fillna(&#39;0&#39;)
print(df)

Prints:

   id        code  inside outside
0   1   -(83C24H)  83C24H       -
1   2  30(30C14H)  30C14H      30
2   3          25      25       0

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

分割包含括号内外值的列 Python

问题

答案1

创建子表格，基于数据框的列数值。

检测CSV文件中的多个标题

如何使用XLSXWRITER默认显示分页。

重复使用 BigQuery 查询作业作为基础查询，以供进一步操作使用。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。