2020年1月6日 23:27:29go评论118阅读模式

英文:

Pandas: Split a specific column values to new columns and find occurrences of a value in all newly created columns

问题

Sure, here's the translated code portion:

我有两列名为“family”和“severity”的列。我想要拆分“severity”列中的唯一值，并在新创建的列中找到“family”列的出现次数。
初始数据框架：
```python
df
family severity
AA     High
BB     Critical
CC     Medium
DD     Low
AA     Low
CC     High

输出

df_output
family Critical High Medium Low Total
AA       0       1     0     1    2
BB       1       0     0     0    1
CC       0       1     1     0    2
DD       0       0     0     1    1
Total    1       2     1     2    6


这是翻译好的部分，没有其他内容。
<details>
<summary>英文:</summary>
I have two columns called &quot;family&quot; and &quot;severity&quot;. I would like to split the unique values in the &quot;severity&quot; column and find the occurrences of column &quot;family&quot; in newly created columns.
Initial Dataframe:

family severity
AA High
BB Critical
CC Medium
DD Low
AA Low
CC High

Output

df_output

family Critical High Medium Low Total
AA 0 1 0 1 2
BB 1 0 0 0 1
CC 0 1 1 0 2
DD 0 0 0 1 1
Total 1 2 1 2 6


</details>
# 答案1
**得分**: 4
使用 [`crosstab`](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.crosstab.html) 并使用 `margins=True`：
```python
final = pd.crosstab(df['family'], df['severity'], margins=True, margins_name='Total').rename_axis(None, axis=1)
print(final)

            Critical  High  Low  Medium  Total
    family                                  
    AA             0     1    1       0      2
    BB             1     0    0       0      1
    CC             0     1    0       1      2
    DD             0     0    1       0      1
    Total          1     2    2       1      6

从文档中：

margins：布尔值，默认为 False
添加行/列边距（小计）。

margins_name：字符串，默认为 'All'
当 margins 为 True 时，包含总计的行/列的名称。

英文:

Use crosstab using margins=True:

final=pd.crosstab(df[&#39;family&#39;],df[&#39;severity&#39;],
       margins=True,margins_name=&#39;Total&#39;).rename_axis(None,axis=1)
print(final)

        Critical  High  Low  Medium  Total
family                                    
AA             0     1    1       0      2
BB             1     0    0       0      1
CC             0     1    0       1      2
DD             0     0    1       0      1
Total          1     2    2       1      6

From docs:
>margins : bool, default False
Add row/column margins (subtotals).

>margins_name : str, default ‘All’
Name of the row/column that will contain the totals when margins is True.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Pandas：将特定列的值拆分到新列，并在所有新创建的列中查找值的出现次数

问题

无法在VS Code中调试测试案例：在”env”中发现重复项：”PATH”

在jinja循环中输出两个参数。

将Django中的嵌套列表转换为扁平列表的方法，Python

Azure 主机设置用于 Node API、Reactjs、Python FastAPI 和 React Native。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。