2023年7月24日 00:04:16go评论136阅读模式

英文:

Sorting in Pandas Dataframe

问题

我理解主要的排序是在列名为 b 的列上进行的，但即使在传递了要排序的列名列表后，为什么不会相应地在列 a 上执行二次排序呢？

我在这里理解错了吗？

英文:

Suppose I create a pandas data frame as :

my_newobj = pd.DataFrame({'b': [4, 7, -3, 2], 'a': [0, 4, 7, 1]})

And I try to pass a list to the by parameter for sorting values in each column name as such:

my_newobj.sort_values(by=['b', 'a'])

I understand the primary sorting done on column name b but why isn't secondary sorting performed on column a accordingly as well even after passing a list of column names to sort on?

Am I understanding something wrong here?

答案1

得分: 1

以下是您要的翻译内容：

对于您的数据，第一列永远不会有重复（换句话说，第一列中的值是唯一的），因此不需要关心第二列，请考虑以下示例：

import pandas as pd
df = pd.DataFrame({"b":[1,1,1,0,0,0],"a":[1,7,3,5,3,2]})
print(df.sort_values(by=['b', 'a']))

输出结果如下：

英文:

For your data there is never tie in 1st column (in other words values in 1st column are unique), so no need to care about 2nd column, consider following example

import pandas as pd
df = pd.DataFrame({&quot;b&quot;:[1,1,1,0,0,0],&quot;a&quot;:[1,7,3,5,3,2]})
print(df.sort_values(by=[&#39;b&#39;, &#39;a&#39;]))

gives output

答案2

得分: 0

以下是翻译好的代码部分：

import pandas as pd

my_newobj = pd.DataFrame({'b': [4, 7, -3, 2], 'a': [0, 4, 7, 1]})

分离数据框并删除索引值：

a = my_newobj['a'].sort_values().reset_index(drop=True)
b = my_newobj['b'].sort_values().reset_index(drop=True)

重新合并：

final = pd.DataFrame()
final['b'] = b
final['a'] = a

英文:

This is a little over complicated but it works:

import pandas as pd

my_newobj = pd.DataFrame({&#39;b&#39;: [4, 7, -3, 2], &#39;a&#39;: [0, 4, 7, 1]})

Separate the dataframes and drop the index values:

a = my_newobj[&#39;a&#39;].sort_values().reset_index(drop=True)
b = my_newobj[&#39;b&#39;].sort_values().reset_index(drop=True)
# my_newobj[&#39;b&#39;].sort_values(inplace=False).reset_index(drop=True)

Rejoin:

final = pd.DataFrame()
final[&#39;b&#39;] = b
final[&#39;a&#39;] = a

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在Pandas数据框中排序

问题

答案1

答案2

This is a little over complicated but it works:

选择Pandas中的行并迭代它们，以根据另一行中的值更改列中的值。

在数据框中通过另一列上的条件搜索数值。

“Unable to import module ‘lambda_function’: No module named ‘msgspec._core’,”

如何在seaborn中跨多个图中保持色调关联？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。