2023年1月9日 01:33:52go评论104阅读模式

英文:

Write Values found by Pandas Dataframe's .loc function into an array

问题

我有一个Google电子表格，成功加载到了一个Pandas数据框中：

Tag1    Tag2    Tag3    Tag4    Tag5    MobileNo
Blue    Yellow  Green   Velvet  Red     12345678
Blue    Yellow  Pink    Grey            234556778
Red     Yellow  Orange  Velvet          4456568
Red     Yellow  Grey    Blue            3454655467

现在我不太熟悉Pandas。
我需要将所有在它们的行中具有一个标签的MobileNo写入一个数组中。

就像这样：

tag_red_results = ['12345678', '4456568', '3454655467']

我该如何实现这一点？

英文:

I have a google spreadsheet which i managed to load into a pandas dataframe:

Tag1	Tag2	Tag3	Tag4	Tag5	MobileNo
Blue	Yellow	Green	Velvet	Red	    12345678
Blue	Yellow	Pink	Grey	        234556778
Red	    Yellow	Orange	Velvet		    4456568
Red	    Yellow	Grey	Blue		    3454655467

Now i am not really familiar with pandas.
I would need all MobileNo which have a tag in one of the 5 tag columns within their rows to be written into an array.

tag_red_results = [&#39;12345678&#39;, &#39;4456568&#39;, &#39;3454655467&#39;]

How can i accomplish this?

答案1

得分: 1

使用 pandas.DataFrame.loc 与 布尔索引 ：

# 是否将MobileNo标记为“Red”？
m = df.filter(like="Tag").eq("Red").any(axis=1)
s = df.loc[m, "MobileNo"]

如果需要一个列表，可以使用 pandas.Series.to_list ：

tag_red_results = s.to_list()
#[12345678, 4456568, 3454655467]

或者，如果你需要一个NumPy数组，可以使用 pandas.Series.to_numpy ：

tag_red_results = s.to_numpy()
#array([  12345678,    4456568, 3454655467], dtype=int64)

英文:

IIUC, use pandas.DataFrame.loc with boolean indexing :

# is the MobileNo tagged as &quot;Red&quot; ?
m = df.filter(like=&quot;Tag&quot;).eq(&quot;Red&quot;).any(axis=1)
s = df.loc[m, &quot;MobileNo&quot;]

If a list is needed, then use pandas.Series.to_list :

tag_red_results = s.to_list()
#[12345678, 4456568, 3454655467]

Or, if you need a numpy array, use pandas.Series.to_numpy :

tag_red_results = s.to_numpy()
#array([  12345678,    4456568, 3454655467], dtype=int64)

答案2

得分: 0

你还可以使用 melt 来展开你的标签列：

&gt;&gt;&gt; df.melt('MobileNo').loc[lambda x: x['value'] == 'Red', 'MobileNo'].tolist()
[4456568, 3454655467, 12345678]

英文:

You can also use melt to flatten your tag columns:

&gt;&gt;&gt; df.melt(&#39;MobileNo&#39;).loc[lambda x: x[&#39;value&#39;] == &#39;Red&#39;, &#39;MobileNo&#39;].tolist()
[4456568, 3454655467, 12345678]

答案3

得分: 0

谢谢Timeless！

你的解决方案完美地运行了！

以下是我的代码：

def readColorsDataFromClientSheet(sheetId, tag):
    ss = sheets[sheetId]
    df = ss.find('Colors').to_frame(index_col='Clients')
    tagged = df.filter(like='Tag').eq(tag).any(axis=1)
    mobile_numbers = df.loc[tagged, "MobileNo"].tolist()
    print(mobile_numbers)
return mobile_numbers

请注意，这里的代码部分没有进行翻译。

英文:

Thank you Timeless!

your solution worked perfectly!

Below is my code:

def readColorsDataFromClientSheet(sheetId, tag):
    ss = sheets[sheetId]
    df = ss.find(&#39;Colors&#39;).to_frame(index_col=&#39;Clients&#39;)
    tagged = df.filter(like=&#39;Tag&#39;).eq(tag).any(axis=1)
    mobile_numbers = df.loc[tagged, &quot;MobileNo&quot;].tolist()
    print(mobile_numbers)
return mobile_numbers

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用Pandas Dataframe的.loc函数找到的数值写入一个数组中。

问题

答案1

答案2

答案3

如何过滤 pandas 数据框（DF）并根据这些条件创建三个新的数据框（DF）？

如何使用Python中的Selenium Webdriver最佳方式登录Gmail？

重复每列的值两次，将它们放在一起。

查找两个列表中共同的最大数 #python

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。