如何使我在R中制作的图表看起来像我在Python中有的那样?

huangapple go评论88阅读模式
英文:

How do I make my plot from R look like the one I have on Python?

问题

  1. 我试图让我的R中的图表看起来像我在Python中拥有的那个:
  2. ![左侧是Python(正确的),右侧是R(我正试图更改这个)](https://i.stack.imgur.com/HF83P.jpg)
  3. 这是PythonR的数据框。
  4. Python
  5. # 为了绘制折线图
  6. # 为每一年创建单独的数据框
  7. years = All_Flights_Combined_Month['Year'].unique()
  8. data_frames_month = [All_Flights_Combined_Month[All_Flights_Combined_Month['Year'] == year] for year in years]
  9. # 创建子图
  10. fig, ax = plt.subplots(figsize=(10, 8))
  11. # 为每一年绘制 Delay_count
  12. for i, year in enumerate(years):
  13. color = 'red' if str(year) == '2003' else 'green' if str(year) == '2004' else 'blue'
  14. ax.plot(data_frames_month[i]['Month'], data_frames_month[i]['Delay_count'], label=f"{year} Delay Count", color=color)
  15. # 为每一年绘制 Total_Count
  16. for i, year in enumerate(years):
  17. color = 'orange' if str(year) == '2003' else 'yellow' if str(year) == '2004' else 'purple'
  18. ax.plot(data_frames_month[i]['Month'], data_frames_month[i]['Total_Count'], label=f"{year} Total Count", color=color)
  19. # 设置标题和标签
  20. ax.set_title('Flight Count by Month')
  21. ax.set_xlabel('Month')
  22. ax.set_ylabel('Number of Flights')
  23. # 添加图例
  24. ax.legend(title='Year')
  25. # 将图表保存为pdf文件
  26. plt.savefig('Monthly Flight Comparison Python.pdf', format='pdf')
  27. # 显示图表
  28. plt.show()
  29. R
  30. # 为了绘制折线图
  31. month_plot <- ggplot() + geom_line(data= All_Flights_Combined_Month, aes(x =Month, y=Delay_count, group=Year, color=Year)) +
  32. geom_line(data=All_Flights_Combined_Month, aes(x =Month, y=Total_count, group=Year, color=Year))+ scale_x_discrete(limits = c("Jan","Feb","Mar","Apr","May","Jun","Jul","Aug","Sep","Oct","Nov","Dec"))+
  33. xlab("Months")+
  34. ylab("Number of Flights")+
  35. ggtitle("Flight Count by Month")
  36. # 保存图表为 .pdf
  37. ggplot2::ggsave("Monthly Flight Comparison R.pdf", plot = last_plot(), width = 8, height = 6)

在R代码中,您可以尝试添加 scale_color_manual 来手动设置颜色,但要确保您没有重复使用相同的颜色值。

英文:

I am trying to make a plot on my R look like the one I have on my Python:

如何使我在R中制作的图表看起来像我在Python中有的那样?

This is the data frame for both Python and R.

All_Flights_Combined_Month

Year Month Delay_count Total_count
2003 Jan 151238 552109
2003 Feb 158369 500206
2003 Mar 152156 559342
2003 Apr 125699 527303
2003 May 136551 533782
2003 Jun 163497 536496
2003 Jul 183491 558568
2003 Aug 178979 556984
2003 Sep 113916 527714
2003 Oct 131409 552370
2003 Nov 157157 528171
2003 Dec 206743 555495
2004 Jan 198818 583987
2004 Feb 183658 553876
2004 Mar 183273 601412
2004 Apr 170114 582970
2004 May 191604 594457
2004 Jun 238074 588792
2004 Jul 237670 614166
2004 Aug 215667 623107
2004 Sep 147508 585125
2004 Oct 193951 610037
2004 Nov 197560 584610
2004 Dec 254786 606731
2005 Jan 229809 594924
2005 Feb 184920 545332
2005 Mar 226883 617540
2005 Apr 169221 594492
2005 May 178327 614802
2005 Jun 236724 609195
2005 Jul 268988 627961
2005 Aug 240410 630904
2005 Sep 165541 574253
2005 Oct 186778 592712
2005 Nov 193399 566138
2005 Dec 256861 572343

And these are the codes for Python:

  1. # To plot the line graph
  2. # Create separate data frames for each year
  3. years = All_Flights_Combined_Month[&#39;Year&#39;].unique()
  4. data_frames_month = [All_Flights_Combined_Month[All_Flights_Combined_Month[&#39;Year&#39;] == year] for year in years]
  5. # Create subplots
  6. fig, ax = plt.subplots(figsize=(10, 8))
  7. # Plot Delay_count for each year
  8. for i, year in enumerate(years):
  9. color = &#39;red&#39; if str(year) == &#39;2003&#39; else &#39;green&#39; if str(year) == &#39;2004&#39; else &#39;blue&#39;
  10. ax.plot(data_frames_month[i][&#39;Month&#39;], data_frames_month[i][&#39;Delay_count&#39;], label=f&quot;{year} Delay Count&quot;, color=color)
  11. # Plot Total_Count for each year
  12. for i, year in enumerate(years):
  13. color = &#39;orange&#39; if str(year) == &#39;2003&#39; else &#39;yellow&#39; if str(year) == &#39;2004&#39; else &#39;purple&#39;
  14. ax.plot(data_frames_month[i][&#39;Month&#39;], data_frames_month[i][&#39;Total_Count&#39;], label=f&quot;{year} Total Count&quot;, color=color)
  15. # Set title and labels
  16. ax.set_title(&#39;Flight Count by Month&#39;)
  17. ax.set_xlabel(&#39;Month&#39;)
  18. ax.set_ylabel(&#39;Number of Flights&#39;)
  19. # Add legend
  20. ax.legend(title=&#39;Year&#39;)
  21. # Save the plot as a pdf file
  22. plt.savefig(&#39;Monthly Flight Comparison Python.pdf&#39;, format=&#39;pdf&#39;)
  23. # Show the plot
  24. plt.show()

While this is for R:

  1. {r}
  2. # To plot the line graph
  3. month_plot &lt;- ggplot() + geom_line(data= All_Flights_Combined_Month, aes(x =Month, y=Delay_count, group=Year, color=Year)) +
  4. geom_line(data=All_Flights_Combined_Month, aes(x =Month, y=Total_count, group=Year, color=Year))+ scale_x_discrete(limits = c(&quot;Jan&quot;,&quot;Feb&quot;,&quot;Mar&quot;,&quot;Apr&quot;,&quot;May&quot;,&quot;Jun&quot;,&quot;Jul&quot;,&quot;Aug&quot;,&quot;Sep&quot;,&quot;Oct&quot;,&quot;Nov&quot;,&quot;Dec&quot;))+
  5. xlab(&quot;Months&quot;)+
  6. ylab(&quot;Number of Flights&quot;)+
  7. ggtitle(&quot;Flight Count by Month&quot;)
  8. # To save the plot as .pdf
  9. ggplot2::ggsave(&quot;Monthly Flight Comparison R.pdf&quot;, plot = last_plot(), width = 8, height = 6)

I need the legend and the line colors to match the ones on Python. I hope I have provide sufficient information. Please kindly advice thank you.

I tried adding scale_color_manual to each geom_line but it churned out an error stating that scale_color_manual values has already been used and it will overwrite the previous ones.

答案1

得分: 2

你可以将你的数据转换为长格式,然后使用 paste0gsub 将年份、延误计数和总计数的长格式合并为一个字符串。要获得正确的颜色,你可以使用 scale_color_manual,并使用 breaks 指定正确的顺序,如下所示:

  1. library(ggplot2)
  2. library(dplyr)
  3. library(tidyr)
  4. df %>%
  5. pivot_longer(cols = Delay_count:Total_count) %>%
  6. mutate(Year2 = paste0(Year, " ", gsub("_", " ", name)),
  7. Month = factor(Month, levels = month.abb)) %>%
  8. ggplot(aes(x = Month, y = value, color = Year2, group = Year2)) +
  9. geom_line() +
  10. labs(color = "Year", x = "Month", y = "Number of Flights") +
  11. scale_color_manual(values = c("2003 Delay count" = "red",
  12. "2004 Delay count" = "green",
  13. "2005 Delay count" = "blue",
  14. "2003 Total count" = "orange",
  15. "2004 Total count" = "yellow",
  16. "2005 Total count" = "purple"),
  17. breaks = c("2003 Delay count",
  18. "2004 Delay count",
  19. "2005 Delay count",
  20. "2003 Total count",
  21. "2004 Total count",
  22. "2005 Total count"))

如何使我在R中制作的图表看起来像我在Python中有的那样?

创建于2023-02-19,使用 reprex v2.0.2

英文:

You could transform your data to a longer format and combine the Year and longer format of Delay count and Total count to one string using paste0 and gsub. To get the right colors you could use scale_color_manual, with right order using breaks like this:

  1. library(ggplot2)
  2. library(dplyr)
  3. library(tidyr)
  4. df %&gt;%
  5. pivot_longer(cols = Delay_count:Total_count) %&gt;%
  6. mutate(Year2 = paste0(Year, &quot; &quot;, gsub(&quot;_&quot;, &quot; &quot;, name)),
  7. Month = factor(Month, levels = month.abb)) %&gt;%
  8. ggplot(aes(x = Month, y = value, color = Year2, group = Year2)) +
  9. geom_line() +
  10. labs(color = &quot;Year&quot;, x = &quot;Month&quot;, y = &quot;Number of Flights&quot;) +
  11. scale_color_manual(values = c(&quot;2003 Delay count&quot; = &quot;red&quot;,
  12. &quot;2004 Delay count&quot; = &quot;green&quot;,
  13. &quot;2005 Delay count&quot; = &quot;blue&quot;,
  14. &quot;2003 Total count&quot; = &quot;orange&quot;,
  15. &quot;2004 Total count&quot; = &quot;yellow&quot;,
  16. &quot;2005 Total count&quot; = &quot;purple&quot;),
  17. breaks = c(&quot;2003 Delay count&quot;,
  18. &quot;2004 Delay count&quot;,
  19. &quot;2005 Delay count&quot;,
  20. &quot;2003 Total count&quot;,
  21. &quot;2004 Total count&quot;,
  22. &quot;2005 Total count&quot;))

如何使我在R中制作的图表看起来像我在Python中有的那样?<!-- -->

<sup>Created on 2023-02-19 with reprex v2.0.2</sup>

答案2

得分: 2

  1. 这种类型的问题通常涉及到数据重塑。格式应该是长格式,而数据是宽格式。请参考[这篇帖子](https://stackoverflow.com/questions/2185252/reshaping-data-frame-from-wide-to-long-format)来了解如何将数据从宽格式转换为长格式。
  2. 然后将变量 `Year` `name` 更改为它们之间的交互作用。这是颜色和分组变量。
  3. clrs <- c("2003 Delay Count" = "#e44b3b", "2003 Total Count" = "#edbe70",
  4. "2004 Delay Count" = "#0d720d", "2004 Total Count" = "#f8f867",
  5. "2005 Delay Count" = "#0000cb", "2005 Total Count" = "#6d0469")
  6. All_Flights_Combined_Month %>%
  7. pivot_longer(ends_with("count")) %>%
  8. mutate(Month = factor(Month, levels = month.abb),
  9. Year = interaction(Year, name, sep = " "),
  10. Year = sub("_c", " C", Year)) %>%
  11. select(-name) %>%
  12. ggplot(aes(Month, value, colour = Year, group = Year)) +
  13. geom_line(linewidth = 1.25) +
  14. scale_color_manual(values = clrs) +
  15. theme_minimal()
英文:

This type of problems generally has to do with reshaping the data. The format should be the long format and the data is in wide format. See this post on how to reshape the data from wide to long format.

Then change variable Year or name to the interaction between these two. That's the color and grouping variable.

  1. suppressPackageStartupMessages({
  2. library(dplyr)
  3. library(tidyr)
  4. library(ggplot2)
  5. })
  6. clrs &lt;- c(&quot;2003 Delay Count&quot; = &quot;#e44b3b&quot;, &quot;2003 Total Count&quot; = &quot;#edbe70&quot;,
  7. &quot;2004 Delay Count&quot; = &quot;#0d720d&quot;, &quot;2004 Total Count&quot; = &quot;#f8f867&quot;,
  8. &quot;2005 Delay Count&quot; = &quot;#0000cb&quot;, &quot;2005 Total Count&quot; = &quot;#6d0469&quot;)
  9. All_Flights_Combined_Month %&gt;%
  10. pivot_longer(ends_with(&quot;count&quot;)) %&gt;%
  11. mutate(Month = factor(Month, levels = month.abb),
  12. Year = interaction(Year, name, sep = &quot; &quot;),
  13. Year = sub(&quot;_c&quot;, &quot; C&quot;, Year)) %&gt;%
  14. select(-name) %&gt;%
  15. ggplot(aes(Month, value, colour = Year, group = Year)) +
  16. geom_line(linewidth = 1.25) +
  17. scale_color_manual(values = clrs) +
  18. theme_minimal()

如何使我在R中制作的图表看起来像我在Python中有的那样?<!-- -->

<sup>Created on 2023-02-19 with reprex v2.0.2</sup>


Data

  1. x &lt;- &quot;Year Month Delay_count Total_count
  2. 2003 Jan 151238 552109
  3. 2003 Feb 158369 500206
  4. 2003 Mar 152156 559342
  5. 2003 Apr 125699 527303
  6. 2003 May 136551 533782
  7. 2003 Jun 163497 536496
  8. 2003 Jul 183491 558568
  9. 2003 Aug 178979 556984
  10. 2003 Sep 113916 527714
  11. 2003 Oct 131409 552370
  12. 2003 Nov 157157 528171
  13. 2003 Dec 206743 555495
  14. 2004 Jan 198818 583987
  15. 2004 Feb 183658 553876
  16. 2004 Mar 183273 601412
  17. 2004 Apr 170114 582970
  18. 2004 May 191604 594457
  19. 2004 Jun 238074 588792
  20. 2004 Jul 237670 614166
  21. 2004 Aug 215667 623107
  22. 2004 Sep 147508 585125
  23. 2004 Oct 193951 610037
  24. 2004 Nov 197560 584610
  25. 2004 Dec 254786 606731
  26. 2005 Jan 229809 594924
  27. 2005 Feb 184920 545332
  28. 2005 Mar 226883 617540
  29. 2005 Apr 169221 594492
  30. 2005 May 178327 614802
  31. 2005 Jun 236724 609195
  32. 2005 Jul 268988 627961
  33. 2005 Aug 240410 630904
  34. 2005 Sep 165541 574253
  35. 2005 Oct 186778 592712
  36. 2005 Nov 193399 566138
  37. 2005 Dec 256861 572343&quot;
  38. All_Flights_Combined_Month &lt;- read.table(text = x, header = TRUE)

<sup>Created on 2023-02-19 with reprex v2.0.2</sup>

答案3

得分: 2

以下是代码的翻译部分:

  1. library(tidyverse)
  2. df %>%
  3. pivot_longer(-c(Year, Month)) %>%
  4. mutate(Year = paste(Year, name)) %>%
  5. ggplot(aes(x = Month, y = value, color = factor(Year))) +
  6. geom_line(aes(group = Year)) +
  7. scale_x_discrete(limits = c("Jan", "Feb", "Mar", "Apr", "May", "Jun", "Jul", "Aug", "Sep", "Oct", "Nov", "Dec")) +
  8. scale_color_manual(values = c("purple", "yellow", "gold", "blue", "green", "red")) +
  9. xlab("Months") +
  10. ylab("Number of Flights") +
  11. ggtitle("Flight Count by Month") +
  12. theme_classic()

如何使我在R中制作的图表看起来像我在Python中有的那样?

英文:

Something like this:

  1. library(tidyverse)
  2. df %&gt;%
  3. pivot_longer(-c(Year, Month)) %&gt;%
  4. mutate(Year = paste(Year, name)) %&gt;%
  5. ggplot(aes(x =Month, y=value, color=factor(Year)))+
  6. geom_line(aes(group = Year))+
  7. scale_x_discrete(limits = c(&quot;Jan&quot;,&quot;Feb&quot;,&quot;Mar&quot;,&quot;Apr&quot;,&quot;May&quot;,&quot;Jun&quot;,&quot;Jul&quot;,&quot;Aug&quot;,&quot;Sep&quot;,&quot;Oct&quot;,&quot;Nov&quot;,&quot;Dec&quot;))+
  8. scale_color_manual(values = c(&quot;purple&quot;, &quot;yellow&quot;, &quot;gold&quot;, &quot;blue&quot;, &quot;green&quot;, &quot;red&quot;))+
  9. xlab(&quot;Months&quot;)+
  10. ylab(&quot;Number of Flights&quot;)+
  11. ggtitle(&quot;Flight Count by Month&quot;)+
  12. theme_classic()

如何使我在R中制作的图表看起来像我在Python中有的那样?

答案4

得分: 1

使用基本的R语言功能。首先,将数据重塑为宽格式,然后使用matplot函数自定义axismtext

  1. dat_w <- reshape(dat, idvar='Month', timevar='Year', direction='w')
  2. par(mar=c(5, 6, 4, 2))
  3. matplot(dat_w[, -1], type='l', lty=1, col=2:8, axes=FALSE, ylab='', main='Flight Count By Month')
  4. axis(side=1, at=1:12, labels=dat_w$Month, cex.axis=0.8)
  5. axis(2, axTicks(2), formatC(axTicks(2), format='f', digits=0), las=2, cex.axis=0.8)
  6. mtext('Month', side=1, line=2.5, cex=0.8)
  7. mtext('Number of Flights', side=2, line=4, cex=0.8)
  8. legend('right', c(paste(unique(dat$Year), rep(gsub('_', ' ', names(dat)[3:4]), each=3))),
  9. col=2:8, lty=1, title='Year', cex=0.7)
  10. box()

请注意,我已经保留了代码部分的原文,只翻译了注释和函数参数的内容。

英文:

Using just base R. First, reshape into wide format, then use matplot and customize axis and mtext a little.

  1. dat_w &lt;- reshape(dat, idvar=&#39;Month&#39;, timevar=&#39;Year&#39;, direction=&#39;w&#39;)
  2. par(mar=c(5, 6, 4, 2))
  3. matplot(dat_w[, -1], type=&#39;l&#39;, lty=1, col=2:8, axes=FALSE, ylab=&#39;&#39;, main=&#39;Flight Count By Month&#39;)
  4. axis(side=1, at=1:12, labels=dat_w$Month, cex.axis=.8)
  5. axis(2, axTicks(2), formatC(axTicks(2), format=&#39;f&#39;, digits=0), las=2, cex.axis=.8)
  6. mtext(&#39;Month&#39;, side=1, line=2.5, cex=.8); mtext(&#39;Number of Flights&#39;, 2, 4, cex=.8)
  7. legend(&#39;right&#39;, c(paste(unique(dat$Year), rep(gsub(&#39;_&#39;, &#39; &#39;, names(dat)[3:4]), each=3))),
  8. col=2:8, lty=1, title=&#39;Year&#39;, cex=.7)
  9. box()

如何使我在R中制作的图表看起来像我在Python中有的那样?

huangapple
  • 本文由 发表于 2023年2月19日 17:24:20
  • 转载请务必保留本文链接:https://go.coder-hub.com/75499124.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定