2023年2月6日 19:14:31go评论103阅读模式

英文:

Create numpy array from panda daataframe inside a For loop

问题

以下是翻译好的代码部分：

让我们假设我有以下的数据框：
data = {"Names": ["Ray", "John", "Mole", "Smith", "Jay", "Marc", "Tom", "Rick"],
        "Sports": ["Soccer", "Judo", "Tenis", "Judo", "Tenis","Soccer","Judo","Tenis"]}
我想要使用一个循环，对于每个独特的运动项目，我可以获取一个包含参与该运动项目的人的numpy数组。在伪代码中，可以解释为：
for unique sport in sports:
    nArray = 包含练习该运动的人的numpy数组
    ---------
    对nArray执行某些操作
    -------

请注意，代码中的伪代码部分没有被翻译，只有注释和字符串被翻译。

英文:

Lets say that i have the following dataframe:

data = {&quot;Names&quot;: [&quot;Ray&quot;, &quot;John&quot;, &quot;Mole&quot;, &quot;Smith&quot;, &quot;Jay&quot;, &quot;Marc&quot;, &quot;Tom&quot;, &quot;Rick&quot;],
        &quot;Sports&quot;: [&quot;Soccer&quot;, &quot;Judo&quot;, &quot;Tenis&quot;, &quot;Judo&quot;, &quot;Tenis&quot;,&quot;Soccer&quot;,&quot;Judo&quot;,&quot;Tenis&quot;]}

I want to have a for loop like that for each unique Sport i am able to retrieve a numpy array containing the Names of people playing that sport. In pseudo code that can be explainded as

for unique sport in sports:
    nArray= numpy array of names of people practicing sport
    ---------
    Do something with nArray
    -------

答案1

得分: 0

使用 GroupBy.apply 与 np.array：

df = pd.DataFrame(data)
s = df.groupby('Sports')['Names'].apply(np.array)
print (s)
Sports
Judo      [John, Smith, Tom]
Soccer           [Ray, Marc]
Tenis      [Mole, Jay, Rick]
Name: Names, dtype: object
for sport, name in s.items():
    print (name)
    ['John' 'Smith' 'Tom']
    ['Ray' 'Marc']
    ['Mole' 'Jay' 'Rick']

英文:

Use GroupBy.apply with np.array:

df = pd.DataFrame(data)
s = df.groupby(&#39;Sports&#39;)[&#39;Names&#39;].apply(np.array)
print (s)
Sports
Judo      [John, Smith, Tom]
Soccer           [Ray, Marc]
Tenis      [Mole, Jay, Rick]
Name: Names, dtype: object
for sport, name in s.items():
    print (name)
    [&#39;John&#39; &#39;Smith&#39; &#39;Tom&#39;]
    [&#39;Ray&#39; &#39;Marc&#39;]
    [&#39;Mole&#39; &#39;Jay&#39; &#39;Rick&#39;]

答案2

得分: 0

一种方法是

df = pd.DataFrame(data)
for sport in df.Sports.unique():
    list_of_names = list(df[df.Sports == sport].Names)
    data = np.array(list_of_names)

英文:

one way to go

df = pd.DataFrame(data)
for sport in df.Sports.unique():
    list_of_names = list(df[df.Sports == sport].Names)
    data = np.array(list_of_names)

答案3

得分: 0

import numpy as np
import pandas as pd
data = {"Names": ["Ray", "John", "Mole", "Smith", "Jay", "Marc", "Tom", "Rick"],
        "Sports": ["Soccer", "Judo", "Tenis", "Judo", "Tenis", "Soccer", "Judo", "Tenis"]}
df = pd.DataFrame(data)
unique_sports = df['Sports'].unique()
for sport in unique_sports:
    uniqueNames = np.array(df[df['Sports'] == sport]['Names'])
print(uniqueNames)

Result:

['Mole' 'Jay' 'Rick']

英文:

You can do by pandas library for get list array of sport persons name.

import numpy as np
import pandas as pd
data = {&quot;Names&quot;: [&quot;Ray&quot;, &quot;John&quot;, &quot;Mole&quot;, &quot;Smith&quot;, &quot;Jay&quot;, &quot;Marc&quot;, &quot;Tom&quot;, &quot;Rick&quot;],
        &quot;Sports&quot;: [&quot;Soccer&quot;, &quot;Judo&quot;, &quot;Tenis&quot;, &quot;Judo&quot;, &quot;Tenis&quot;,&quot;Soccer&quot;,&quot;Judo&quot;,&quot;Tenis&quot;]}
df = pd.DataFrame(data)
unique_sports = df[&#39;Sports&#39;].unique()
for sport in unique_sports:
    uniqueNames = np.array(df[df[&#39;Sports&#39;] == sport][&#39;Names&#39;])
print(uniqueNames)

Result :

[&#39;Mole&#39; &#39;Jay&#39; &#39;Rick&#39;]

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

从Pandas数据框内部的for循环中创建NumPy数组。

问题

答案1

答案2

答案3

使用Python-SymPy在给定条件下分析计算函数的积分。

Pattern to work around the static value of a python default argument

Discord Python Bot 只响应私信消息。

如何将Matlab随机生成器状态恢复到numpy？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

发表评论