2023年5月13日 20:24:38go评论73阅读模式

英文:

Frequency plot using dots instead of bars?

问题

I'm trying to create the chart in this question, using this answer. I'm open to any solution that works.

Visual borrowed from the original question:

Difference from that question is I've already calculated my bins and frequency values so I don't use numpy or matplotlib to do so.

Here's my sample data, I refer to it as df_fd in my sample code below:

     low_bin   high_bin  frequency
0  13.142857  18.857143          3
1  18.857143  24.571429          5
2  24.571429  30.285714          8
3  30.285714  36.000000          8
4  36.000000  41.714286          7
5  41.714286  47.428571          7
6  47.428571  53.142857          1
7  53.142857  58.857143          1

Based on the cited question, here's my code (df_fd is the DataFrame above):

fig, ax = plt.subplots()
ax.bar(df_fd.low_bin, df_fd.frequency, width= df_fd.high_bin-df_fd.low_bin)
X,Y = np.meshgrid(bins, df_fd['frequency'])
Y = Y.astype(np.float)
Y[Y>df_fd['frequency']] = np.nan
plt.scatter(X,Y)

This Y[Y>df_fd['frequency']] = np.nan statement is what fails, and I don't know how to get around it. I understand what it's trying to do, and the best guess I have is somehow mapping the matrix index to the DataFrame index would help, but I'm not sure how to do that.

Thank you for helping me!

英文:

I'm trying to create the chart in this question, using this answer. I'm open to any solution that works.

Visual borrowed from original question:

Difference from that question is I've already calculated my bins and frequency values so I don't use numpy or matplotlib to do so.

Here's my sample data, I refer to it as df_fd in my sample code below:

     low_bin   high_bin  frequency
0  13.142857  18.857143          3
1  18.857143  24.571429          5
2  24.571429  30.285714          8
3  30.285714  36.000000          8
4  36.000000  41.714286          7
5  41.714286  47.428571          7
6  47.428571  53.142857          1
7  53.142857  58.857143          1

Based off the cited question here's my code (df_fd is the DataFrame above):

fig, ax = plt.subplots()
ax.bar(df_fd.low_bin, df_fd.frequency, width= df_fd.high_bin-df_fd.low_bin)
X,Y = np.meshgrid(bins, df_fd[&#39;frequency&#39;])
Y = Y.astype(np.float)
Y[Y&gt;df_fd[&#39;frequency&#39;]] = np.nan
plt.scatter(X,Y)

This Y[Y>df_fd['frequency']] = np.nan statement is what fails and I don't know how to get around it. I understand what it's trying to do and the best guess I have is somehow mapping the matrix index to the DataFrame index would help, but I'm not sure how to do that.

Thank you for helping me!

答案1

得分: 2

使用散点图的一种巧妙解决方案：

(df.assign(bin=np.mean([df['low_bin'], df['high_bin']], axis=0))
   .loc[lambda d: d.index.repeat(tmp['frequency'])]
   .assign(Y=lambda d: d.groupby(level=0).cumcount())
   .plot.scatter(x='bin', y='Y', s=600)
)

它的工作原理是获取低/高的平均值作为X值，然后将行重复多次，次数等于“frequency”的值，并使用groupby.cumcount递增计数。

输出：

英文:

One hacky solution using a scatter plot:

(df.assign(bin=np.mean([df[&#39;low_bin&#39;], df[&#39;high_bin&#39;]], axis=0))
   .loc[lambda d: d.index.repeat(tmp[&#39;frequency&#39;])]
   .assign(Y=lambda d: d.groupby(level=0).cumcount())
   .plot.scatter(x=&#39;bin&#39;, y=&#39;Y&#39;, s=600)
)

It works by getting the average of low/high as X value, then repeating the rows as many times as the "frequency" value, and incrementing the count with a groupby.cumcount.

Output:

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用点而不是条形来制作频率图？

问题

答案1

使用Fetch API将JSON对象发送到Flask服务器会导致400 Bad Request。

Pyspark 使用多列创建数据透视表

Bokeh ColumnDataSource标识为源时出现错误 – 为什么？

如何在Python中找到一个楼层平面图的外部轮廓

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论