2023年7月24日 00:11:25go评论63阅读模式

英文:

Counting unique zip codes with pandas

问题

我正在尝试计算每月时间段内每个邮政编码的数量。

```python
df1 = pd.DataFrame(df)
print(df[['Timestamp', 'zip']].value_counts())

我得到了以下输出：

Timestamp                 zip  
Front Desk Call Log (Master Data)  NaN      1
2023-07-21 12:22:47.697000       60191    1
2023-07-21 10:55:13.311000       NaN      1
2023-07-21 10:49:06.148000       60187    1
2023-07-21 10:29:08.396000       60189    1
                                       ..
2023-07-21 14:43:23.522000       60187    1
2023-07-21 14:45:12.332000       60440    1
2023-07-21 14:46:46.452000       NaN      1
2023-07-21 17:34:11.631000       60548    1
2023-07-21 17:39:36.358000       60133    1
Length: 8314, dtype: int64

我不需要邮政编码的数量，我需要每个邮政编码的计数。我期望的输出是：

60187 2
60542 1
60540 3

等等。

英文:

I'm trying to count the number of each zip codes within a monthly period of time frame.

df1 = pd.DataFrame(df)
print(df[[&#39;Timestamp&#39;, &#39;zip&#39;]].nunique())

I get an output of:

Timestamp    8314
zip           343
dtype: int64

I don't need the number of zip codes, I need the count of each zip code.
I was expecting:

60187 2
60542 1
60540 3

etc.

asked to post sample rows:

                              Timestamp    zip
0     Front Desk Call Log (Master Data)    NaN
1            2023-07-21 12:22:47.697000  60191
2            2023-07-21 10:55:13.311000    NaN
3            2023-07-21 10:49:06.148000  60187
4            2023-07-21 10:29:08.396000  60189
...                                 ...    ...
8309         2023-07-21 14:43:23.522000  60187
8310         2023-07-21 14:45:12.332000  60440
8311         2023-07-21 14:46:46.452000    NaN
8312         2023-07-21 17:34:11.631000  60548
8313         2023-07-21 17:39:36.358000  60133

[8314 rows x 2 columns]

答案1

得分: 1

使用groupby函数与value_counts结合的替代方法来实现相同的结果如下所示：

按“zip”分组并计算每个邮政编码的出现次数

df.groupby('zip').size()

英文:

An alternative solution to achieve the same result is by using the groupby function in combination with value_counts. Here's the code for the alternate solution:

Group by 'zip' and count the occurrences of each zip code

df.groupby(&#39;zip&#39;).size()

答案2

得分: 0

尝试：

df['zip'].value_counts()

如果您想要计算空值（Nulls）：

df['zip'].value_counts(dropna=False)

英文:

Try:

df[&#39;zip&#39;].value_counts()

If you want to count the Nulls as well:

df[&#39;zip&#39;].value_counts(dropna=False)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用pandas进行唯一邮政编码计数

问题

答案1

答案2

匹配列值与三层嵌套列表中的元素，并从另一个列表中添加相应的值。

处理我的数据框，使用条件 – Python Jupyter 笔记本

DataFrame 最高效的方法是将小于 40% 的行值更新为 NaN 吗？

获取 Pandas DataFrame 中每小时值的滚动平均值，同时考虑到一天的循环性质。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论