2020年1月6日 19:19:45go评论107阅读模式

英文:

Replace a value by that value divided by number of time that value existing in pandas

问题

我帮你翻译以下部分：

从上面的数据框中，我想要将价格（Price）等于10000的行替换为具有相同ID和价格等于10000的行数，这里的计数为4。

期望的输出：

      ID    Unit_ID       Price
        1     1             50
        2     2             40
        3     1             2500
        3     2             2500
        3     3             2500
        3     4             2500
        6     1             10000
        8     3             10000

英文:

I have dataframe as follows

ID    Unit_ID       Price
1     1             50
2     2             40
3     1             10000
3     2             10000
3     3             10000
3     4             10000
6     1             10000
8     3             10000

From the above dataframe I want to replace the Price = 10000
By the count of rows having same ID and Price = 10000, here that count = 4

Expected Output:

  ID    Unit_ID       Price
    1     1             50
    2     2             40
    3     1             2500
    3     2             2500
    3     3             2500
    3     4             2500
    6     1             10000
    8     3             10000

答案1

得分: 1

创建掩码并将过滤后的行除以True值的计数，使用sum：

mask = df.Price == 10000

df.loc[mask, 'Price'] /= mask.sum()
#print (df)
   ID  Unit_ID   Price
0   1        1    50.0
1   2        2    40.0
2   3        1  2500.0
3   3        2  2500.0
4   3        3  2500.0
5   3        4  2500.0

如果想要将所有值都除以它们的计数：

df['Price'] /= df.groupby(by="Price")['Price'].transform('size')

编辑后：

df['Price'] /= df.groupby(by=["ID", "Price"])['Price'].transform('size')
#print (df)
   ID  Unit_ID    Price
0   1        1     50.0
1   2        2     40.0
2   3        1   2500.0
3   3        2   2500.0
4   3        3   2500.0
5   3        4   2500.0
6   6        1  10000.0
7   8        3  10000.0

英文:

Create mask and divide filtered rows by count of Trues values by sum:

mask = df.Price == 10000

df.loc[mask, &#39;Price&#39;] /= mask.sum()
#same like
#df.loc[mask, &#39;Price&#39;] = df.loc[mask, &#39;Price&#39;] / mask.sum()
print (df)
   ID  Unit_ID   Price
0   1        1    50.0
1   2        2    40.0
2   3        1  2500.0
3   3        2  2500.0
4   3        3  2500.0
5   3        4  2500.0

If want to divide all values by their counts:

df[&#39;Price&#39;] /= df.groupby(by=&quot;Price&quot;)[&#39;Price&#39;].transform(&#39;size&#39;)

EDIT:

df[&#39;Price&#39;] /= df.groupby(by=[&quot;ID&quot;, &quot;Price&quot;])[&#39;Price&#39;].transform(&#39;size&#39;)
print (df)
   ID  Unit_ID    Price
0   1        1     50.0
1   2        2     40.0
2   3        1   2500.0
3   3        2   2500.0
4   3        3   2500.0
5   3        4   2500.0
6   6        1  10000.0
7   8        3  10000.0

答案2

得分: 1

如果您只想将价格为10000的行替换为10000，可以这样做：

df.loc[df.Price==10000, 'Price']=10000/len(df.loc[df.Price==10000])

如果您想将每一行都除以该值的计数，可以使用groupby和transform：

df.Price = df.groupby(by="Price").Price.transform(lambda x: x/len(x))

	ID	Unit_ID	Price
0	1	1		50
1	2	2		40
2	3	1		2500
3	3	2		2500
4	3	3		2500
5	3	4		2500

英文:

If you just want to replace the rows with 10000, you can do:

df.loc[df.Price==10000, &#39;Price&#39;]=10000/len(df.loc[df.Price==10000])

If you want to divide every row with the value count, you can use groupby and transform:

df.Price = df.groupby(by=&quot;Price&quot;).Price.transform(lambda x: x/len(x))


	ID	Unit_ID	Price
0	1	1		50
1	2	2		40
2	3	1		2500
3	3	2		2500
4	3	3		2500
5	3	4		2500

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

将一个值替换为该值除以该值在 pandas 中存在的次数。

问题

答案1

答案2

Python: 根据观测日期创建图表（而不是作为时间序列）

根据时间填充列数值

在 pandas 数据帧中基于其他列最小值的索引创建新列。

“value of object index” 在 Pandas DataFrame 中是什么意思？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论