使用`.str.split(expand=True)`为什么会丢失信息？

huangapple

117266
文章

0
评论

2023年1月9日 03:21:50go评论99阅读模式

英文:

Why am I losing information with .str.split(expand=True)?

问题

我正在尝试扩展一个由字符串组成的数据框的列，类似于这样：

ATTGG
CATGC
GTGCC

将其转换为一个新数据框中的多列。

我使用的命令是：

newdf = pd.DataFrame(df['col'].str.split("", expand=True))

在打印时，我发现第一列和第一行实际上是索引：

0 1 2 3 4 5
1 C A T G C
2 G T G C C

而且我的第一行被截断了，可能是因为索引的存在。

为什么我的第一行被截断了？我可以怎么做来修复这个问题？

英文:

I'm trying to expand a column of a dataframe which is made up of strings, something like this:

ATTGG
CATGC
GTGCC

into several columns in a new dataframe.

The command I used is

newdf = pd.DataFrame(df[&#39;col&#39;].str.split(&quot;&quot;, expand = True)

When printing, I found that the first column and the first row are actually the index:

0 1 2 3 4 5
1 C A T G C
2 G T G C C

and that my first row is cut off, presumably because of the presence of the index.

Why is my first row cut off? What can I do to fix this?

答案1

得分: 1

将字符串转换为列表后再创建数据框：

newdf = pd.DataFrame.from_records(df['col'].map(list))
print(newdf)
# 输出
   0  1  2  3  4
0  A  T  T  G  G
1  C  A  T  G  C
2  G  T  G  C  C

英文:

Convert your string to list before creating the dataframe:

newdf = pd.DataFrame.from_records(df[&#39;col&#39;].map(list))
print(newdf)
# Output
   0  1  2  3  4
0  A  T  T  G  G
1  C  A  T  G  C
2  G  T  G  C  C
</details>

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

本文由 huangapple 发表于 2023年1月9日 03:21:50
转载请务必保留本文链接：https://go.coder-hub.com/75050639.html

dataframe
pandas
python
split
string

ParserError: 数据标记化错误。C错误：在第27行期望1个字段，但看到367个。

go 98 04/04

Flask SQLAlchemy，密码不存储在数据库中

go 96 03/31

Sympy返回log而不是ln。

go 125 01/03

为什么生成的tkinter按钮-1事件无法识别？

go 105 04/13

使用`.str.split(expand=True)`为什么会丢失信息？

问题

答案1

ParserError: 数据标记化错误。C错误：在第27行期望1个字段，但看到367个。

Flask SQLAlchemy，密码不存储在数据库中

Sympy返回log而不是ln。

为什么生成的tkinter按钮-1事件无法识别？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。