2023年5月30日 01:10:15go评论94阅读模式

英文:

How to get first name and last name when last name is multiple names in pandas

问题

我有一个数据框，需要分离名字的姓和名。到目前为止，我已经做到了这一步。

df = [['Victor De La Cruz', 'Ashley Smith', 'Angel Miguel Hernandez', 'Hank Hill']]
df['first_name'] = df.str.split().str[0]
df['last_name'] = df.str.split().str[1:]

输出结果如下：

   first_name          last_name
0      Victor  [De, La, Cruz]
1     Ashley           [Smith]
2      Angel  [Miguel, Hernandez]
3       Hank           [Hill]

我尝试使用 df['last_name'].replace('[', '') 来去除不需要的所有字符，但没有成功。

期望的输出如下：

  first_name         last_name
0       Paul       De La Cruz
1     Ashley             Smith
2      Angel  Miguel Hernandez
3       Hank              Hill

有任何建议吗？谢谢！

英文:

I have a data frame and need to separate first and last name. So far this is where I got to.

df = [[&#39;Victor De La Cruz&#39;, &#39;Ashley Smith&#39;, &#39;Angel Miguel Hernandez&#39;, &#39;Hank Hill&#39;]] 
df[&#39;first_name&#39;] = df.str.split().str[0]
df[&#39;last_name&#39;] = df.str.split().str[1:]

OutPut

first_name        last_name 
 Victor           [De, La, Cruz]
 Ashley           [Smith] 
 Angel            [Miguel, Hernandez]
 Hank             [Hill]

I have tried using df'last_name'].replace('[', '')for all characters not wanted but it didn't work.

Desired Output

 first_name      last_name 
   Paul          De La Cruz 
   Ashley        Smith 
   Angel         Miguel Hernandez
   Hank          Hill

Any Suggestions would be helpful thank you!

答案1

得分: 1

在 split() 后，您的系列中包含列表对象，而不是字符串，这就是为什么 .replace() 没有意义的原因。

英文:

Just join back

df[&#39;last_name&#39;] = df[&#39;last_name&#39;].str.join(&#39; &#39;)

After the split(), you have list objects in your series, not strings, which is why .replace() doesn't make sense.

答案2

得分: 1

I'd suggest using the n keyword argument to limit the splits to only the first space. You could also use expand=True:

import pandas as pd
s = pd.Series([
    'Victor De La Cruz',
    'Ashley Smith',
    'Angel Miguel Hernandez',
    'Hank Hill'
])
df = s.str.split(n=1, expand=True)
df.columns = ["first_name", "last_name"]

  first_name         last_name
0     Victor        De La Cruz
1     Ashley             Smith
2      Angel  Miguel Hernandez
3       Hank              Hill

英文:

I'd suggest using the n keyword argument to limit the splits to only the first space. You could also use expand=True:

import pandas as pd
s = pd.Series([
    &#39;Victor De La Cruz&#39;,
    &#39;Ashley Smith&#39;,
    &#39;Angel Miguel Hernandez&#39;,
    &#39;Hank Hill&#39;
])
df = s.str.split(n=1, expand=True)
df.columns = [&quot;first_name&quot;, &quot;last_name&quot;]

  first_name         last_name
0     Victor        De La Cruz
1     Ashley             Smith
2      Angel  Miguel Hernandez
3       Hank              Hill
</details>

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在 pandas 中获取姓和名，当姓是多个名字时。

问题

答案1

答案2

如何在Python中优化Pascal’s Triangle？

polars使用DataFrame的行与Expression API。

使用CSS防止在两栏HTML报告中过早换行。

递归函数用于读取 YAML 文件，查找包含其他 YAML 文件的键。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。