2023年2月9日 03:23:55go评论176阅读模式

英文:

How to melt a dataframe so repeated items become the values that correspond to the index

问题

我有这个数据框：

df = pd.DataFrame({'Status':['CO','AD','AD','AD','OT','CO','OT','AD'],
                   'Mutation':['H157Y','R47H','R47H','R67H','R62H','D87N','D39E','D39E']})
print(df)

我想要数据框看起来像这样：

df2 = pd.DataFrame({'Status':['CO','AD','OT'],'H157Y':[1,0,0],'R47H':[0,2,0],'R67H':[0,1,0],
                    'R62H':[0,0,1],'D87N':[1,0,0],'D39E':[1,0,1]})
print(df2)

其中突变是列名，它们的值 - 击中的次数 - 对应于状态。

英文:

I have this dataframe:

df = pd.DataFrame({&#39;Status&#39;:[&#39;CO&#39;,&#39;AD&#39;,&#39;AD&#39;,&#39;AD&#39;,&#39;OT&#39;,&#39;CO&#39;,&#39;OT&#39;,&#39;AD&#39;],
                   &#39;Mutation&#39;:[&#39;H157Y&#39;,&#39;R47H&#39;,&#39;R47H&#39;,&#39;R67H&#39;,&#39;R62H&#39;,&#39;D87N&#39;,&#39;D39E&#39;,&#39;D39E&#39;]})
print(df)
  
  Status Mutation
0     CO    H157Y
1     AD     R47H
2     AD     R47H
3     AD     R67H
4     OT     R62H
5     CO     D87N
6     OT     D39E
7     AD     D39E

I want the dataframe to look like this:

df2 = pd.DataFrame({&#39;Status&#39;:[&#39;CO&#39;,&#39;AD&#39;,&#39;OT&#39;],&#39;H157Y&#39;:[1,0,0],&#39;R47H&#39;:[0,2,0],&#39;R67H&#39;:[0,1,0],
                    &#39;R62H&#39;:[0,0,1],&#39;D87N&#39;:[1,0,0],&#39;D39E&#39;:[1,0,1]})
print(df2)

  Status  H157Y  R47H  R67H  R62H  D87N  D39E
0     CO      1     0     0     0     1     1
1     AD      0     2     1     0     0     0
2     OT      0     0     0     1     0     1

Where mutations are the column names and their values - the number of hits - corresponds to the status.

答案1

得分: 3

这应该可以解决问题：

df.groupby(['Status', 'Mutation']).size().unstack(fill_value=0)

英文:

This should do the trick:

df.groupby([&#39;Status&#39;, &#39;Mutation&#39;]).size().unstack(fill_value=0)

答案2

得分: 2

我们可以像下面这样使用 pd.crosstab：

&gt;&gt;&gt; pd.crosstab(df[&quot;Status&quot;], df[&quot;Mutation&quot;])

Mutation  D39E  D87N  H157Y  R47H  R62H  R67H
Status                                       
AD           1     0      0     2     0     1
CO           0     1      1     0     0     0
OT           1     0      0     0     1     0

或者我们可以像下面这样使用 pd.get_dummies、pandas.DataFrame.groupby 然后使用 pandas.DataFrame.rename 对列进行重命名：

(pd.get_dummies(df, 
                columns=[&#39;Mutation&#39;]
               ).groupby([&#39;Status&#39;]).sum().rename(columns=lambda x: x.split(&#39;_&#39;)[1]))

输出结果：

        D39E  D87N  H157Y  R47H  R62H  R67H
Status                                     
AD         1     0      0     2     0     1
CO         0     1      1     0     0     0
OT         1     0      0     0     1     0

英文:

We can use pd.crosstab like the below:

&gt;&gt;&gt; pd.crosstab(df[&quot;Status&quot;], df[&quot;Mutation&quot;])

Mutation  D39E  D87N  H157Y  R47H  R62H  R67H
Status                                       
AD           1     0      0     2     0     1
CO           0     1      1     0     0     0
OT           1     0      0     0     1     0

Or we can use pd.get_dummies, pandas.DataFrame.groupby then pandas.DataFrame.rename columns like the below:

(pd.get_dummies(df, 
                columns=[&#39;Mutation&#39;]
               ).groupby([&#39;Status&#39;]).sum().rename(columns=lambda x: x.split(&#39;_&#39;)[1]))

Output:

        D39E  D87N  H157Y  R47H  R62H  R67H
Status                                     
AD         1     0      0     2     0     1
CO         0     1      1     0     0     0
OT         1     0      0     0     1     0

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何融化数据框，使重复的项目成为与索引对应的值

问题

答案1

答案2

寻找一个快速的优化算法来解决具有唯一正解的非线性方程。

将值附加到DataFrame行使用lambda if else。

Matlab中是否有根据条件连接数组的函数？

寻找Hough线上最近的点

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论