2023年3月9日 16:29:57go评论162阅读模式

英文:

How to find the overall average in pandas

问题

1 - 类别 DevTool 平均值计算如下

（A 系统的 1 类别 DevTool 计数 + B 系统的 1 类别 DevTool 计数 + ...等）/ 具有 1 类别 DevTool 的系统数量

= （1+2+1+2）/ 4 = 1.5

类似地
0 - 类别 DevTool 平均值计算如下

（A 系统的 0 类别 DevTool 计数 + B 系统的 0 类别 DevTool 计数 + ...等）/ 具有 0 类别 DevTool 的系统数量

= （1+1+2）/ 3 = 1.33

为了执行这个平均值计算，每次我都将数据移到 Excel 并使用 Excel 内置函数来获取这些 1 和 0 类别的平均值。我不确定如何在 pandas 中直接执行这个平均值计算，并获得分别为 1 和 0 类别的平均值 1.5 和 1.33。

英文:

I have a dataframe below:

# initialize list of lists
data = [[&#39;A&#39;,&#39;Excel&#39;,&#39;1&#39;], [&#39;A&#39;,&#39;Word_soft&#39;,&#39;0&#39;],[&#39;B&#39;,&#39;Excel&#39;,&#39;1&#39;],[&#39;B&#39;,&#39;Word&#39;,&#39;1&#39;],[&#39;C&#39;,&#39;Word&#39;,&#39;1&#39;],[&#39;C&#39;,&#39;Word_soft&#39;,&#39;0&#39;],[&#39;D&#39;,&#39;Java2&#39;,&#39;1&#39;],[&#39;D&#39;,&#39;Java&#39;,&#39;1&#39;],[&#39;E&#39;,&#39;PPT&#39;,&#39;0&#39;], [&#39;E&#39;,&#39;Word_soft&#39;,&#39;0&#39;]]
  
# Create the pandas DataFrame
df = pd.DataFrame(data, columns=[&#39;System&#39;,&#39;App&#39;,&#39;DevTool&#39;])

I performed the below to get the count of DevTool in each System.

df.groupby([&#39;System&#39;,&#39;DevTool&#39;])[&#39;DevTool&#39;].count()

I need to find the overall DevTool Average in each category 1 and 0 as below

1 - Category DevTool average calculation as below

(A System's 1 category DevTool count + B System's 1 category DevTool count + ...soon) / Number of systems which have 1 category DevTool

= (1+2+1+2) / 4 = 1.5

Similarly
0 - Category Devtool average calculation as below

(A System's 0 category DevTool count + B System's 0 category DevTool count + ...soon) / Number of systems which have 0 category DevTool

= (1+1+2) / 3 = 1.33

In order to perform this average calculation, everytime I move the data to excel and use the excel inbuild function to get this average value. I am not sure how to perform this average calculation within pandas directly and get the average values 1.5 and 1.33 respectively for 1 and 0 category.

答案1

得分: 3

使用 groupby.mean 步骤（.groupby('DevTool').mean()）：

(df.groupby(['System','DevTool'])['DevTool'].count()
   .groupby('DevTool').mean()
)

您还可以使用 value_counts 替换第一步：

df[['System','DevTool']].value_counts(sort=False).groupby('DevTool').mean()

输出：

DevTool
0    1.333333
1    1.500000
Name: DevTool, dtype: float64

英文:

Add a groupby.mean step (.groupby('DevTool').mean()):

(df.groupby([&#39;System&#39;,&#39;DevTool&#39;])[&#39;DevTool&#39;].count()
   .groupby(&#39;DevTool&#39;).mean()
)

You can also replace the first step by value_counts:

df[[&#39;System&#39;,&#39;DevTool&#39;]].value_counts(sort=False).groupby(&#39;DevTool&#39;).mean()

Output:

DevTool
0    1.333333
1    1.500000
Name: DevTool, dtype: float64

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在pandas中找到整体平均值

问题

答案1

如何序列化带有2个或更多外键的Django模型？

如何使用argparse库来解析给定的字符串而不是app_args？

计算Pandas数据帧中与第一个值相关的时间差。

在Python或其他语言中是否可以嵌入一个360度视图器到PDF中？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论