2023年4月19日 22:10:19go评论76阅读模式

英文:

I want to get the 'False' Count for every segment

问题

I have a data set that I have broken into segments based on a certain criteria. In another column I have it where the code returns 'True' if the value is equal to the one before it and 'False' if it is not equal. I am able to get the total count of 'False' values for the entire data set, but I am trying to get the total count of 'False' values per segment.

My code:

df['cols2'] = df['cols1'].diff().eq(0).replace({False : 0, True : 1})

counter_obj = Counter(df['cols2'])

false_count = counter_obj[False]

seg = df.groupby('segID')['cols2'].sum()
print(seg)

这是你的代码翻译。

英文:

My code:

df[&#39;cols2&#39;] = df[&#39;cols1&#39;].diff().eq(0).replace({False : 0, True : 1})

counter_obj = Counter(df[&#39;cols2&#39;])

false_count = counter_obj[False]

seg = df.groupby(&#39;segID&#39;)[cols2 , false_count].sum()
print(seg)

答案1

得分: 0

Here's the translated code portion:

import pandas as pd

df = pd.DataFrame({'cols1': [1, 2, 3, 3, 4, 5, 5, 6, 7]})
df['cols2'] = df['cols1'].diff().eq(0).replace({False: 0, True: 1})

df['cs'] = df['cols2'].cumsum()

And here's the translated output for your dataframes:

Input:

   cols1  cols2  cs
0      1      0   0
1      2      0   0
2      3      0   0
3      3      1   1
4      4      0   1
5      5      0   1
6      5      1   2
7      6      0   2
8      7      0   2

Output for aggregation by group:

Output for data for each row:

   cols1  cols2  cs  count
0      1      0   0      3
1      2      0   0      3
2      3      0   0      3
3      3      1   1      2
4      4      0   1      2
5      5      0   1      2
6      5      1   2      2
7      6      0   2      2
8      7      0   2      2

Is there anything else you'd like to know or translate?

英文:

import pandas as pd

df = pd.DataFrame({&#39;cols1&#39;: [1, 2, 3, 3, 4, 5, 5, 6, 7]})
df[&#39;cols2&#39;] = df[&#39;cols1&#39;].diff().eq(0).replace({False: 0, True: 1})

df[&#39;cs&#39;] = df[&#39;cols2&#39;].cumsum()

I can suggest creating a 'cs' column with a cumulative sum of 'cols2' in order to divide the dataframe into groups. As far as I understood you, you need to count only zeros in each segment.

Input

   cols1  cols2  cs
0      1      0   0
1      2      0   0
2      3      0   0
3      3      1   1
4      4      0   1
5      5      0   1
6      5      1   2
7      6      0   2
8      7      0   2

To aggregate a count by group:

agr = df.groupby(&#39;cs&#39;).apply(lambda x: x.loc[x[&#39;cols2&#39;] == 0, &#39;cols2&#39;].count())

Output

and if need data for each row:

df[&#39;count&#39;] = df.groupby(&#39;cs&#39;)[&#39;cols2&#39;].transform(lambda x: x[x==0].count())

Output

   cols1  cols2  cs  count
0      1      0   0      3
1      2      0   0      3
2      3      0   0      3
3      3      1   1      2
4      4      0   1      2
5      5      0   1      2
6      5      1   2      2
7      6      0   2      2
8      7      0   2      2

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

我想获得每个片段的“False”计数。

问题

答案1

Jupyter cells go blank after scrolling in Vscode.

获取格式化的回溯信息，当覆盖 sys.excepthook 时

如何使 PyTest 使用父目录的 `conftest.py` 文件。

如何区分Shapely中的自相交与自接触？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论