ggplot line plot: is there a way to depict the data points under or over the line plot depending on what looks better?

huangapple go评论64阅读模式
英文:

ggplot line plot: is there a way to depict the data points under or over the line plot depending on what looks better?

问题

我想根据数据MIA_YEAR创建一个线图。它已经看起来相当不错了,但我想将那些(几乎)触及线的数据点移动到图下方。或者用数学术语来说:如果斜率增加,数据点应该在线图下方,如果斜率减小,数据点应该在线上方,而不是所有数据点都在线上方。 到目前为止的样子如下 -->

这是我的代码。我尝试包括类似这样的内容
vjust = ifelse(diff(Percent) < 0, -0.9, 0.9)
但它在ggplot中不起作用,我得到了这个错误,尽管该对象明确位于MIA_YEAR内:(Error in diff(y = Percent) : object 'Percent' not found)
(也许您还可以解释一下为什么它不起作用?)

谢谢你的帮助!

编辑1. 这是数据(我希望我理解了复制的点)

structure(list(YEAR = structure(1:13, .Label = c("1990", "2000", 
"2010", "2011", "2012", "2013", "2014", "2015", "2016", "2017", 
"2018", "2019", "2020"), class = "factor"), n = c(25L, 27L, 74L, 
95L, 79L, 79L, 98L, 98L, 102L, 79L, 101L, 86L, 99L), Percent = c(26.6, 
25.23, 36.63, 48.72, 44.63, 36.24, 43.75, 42.24, 44.54, 34.96, 
46.98, 35.98, 41.42)), class = c("grouped_df", "tbl_df", "tbl", 
"data.frame"), row.names = c(NA, -13L), groups = structure(list(
    YEAR = structure(1:13, .Label = c("1990", "2000", "2010", 
    "2011", "2012", "2013", "2014", "2015", "2016", "2017", "2018", 
    "2019", "2020"), class = "factor"), .rows = structure(list(
        1L, 2L, 3L, 4L, 5L, 6L, 7L, 8L, 9L, 10L, 11L, 12L, 13L), ptype = integer(0), class = c("vctrs_list_of", 
    "vctrs_vctr", "list"))), class = c("tbl_df", "tbl", "data.frame"
), row.names = c(NA, -13L), .drop = TRUE))
英文:

I wanted to create a line plot based on the data MIA_YEAR. It already looks quite good but I want to move those data points under the plot that (almost) touch the line. Or in mathematial terms: If the slope increases the data points should be under the line plot and if the slope decreases the data points should be over the plot instead of having all data points over the line. that is how it looks like until now -->.

MIA_Year %&gt;%
  ggplot(aes(x = YEAR, y = Percent)) +
  geom_line(group=1, color = &quot;steelblue&quot;, linewidth = 1) + #group = 1 needed when x is a factor
  geom_text(data = MIA_Year, aes(label=n), vjust = -0.9) +
  scale_y_continuous(limits = c(0, 100), breaks = seq(0, 100, by = 10)) +
  labs(x = &quot;&quot;, y = &quot;Percent of Articles&quot;) +
  theme(axis.text.x = element_text(angle = 45, hjust = 1), panel.border = element_rect(fill = NA), panel.background = element_rect(fill = &quot;white&quot;), panel.grid = element_line(colour = &quot;grey85&quot;))

This is my code. I tried including something like this
vjust = ifelse(diff(Percent) &lt; 0, -0.9, 0.9),
but it did not work with ggplot, I got this error, although the object definitely is inside of MIA_YEAR: (Error in diff(y = Percent) : object 'Percent' not found)
(maybe you could also explain to me, why it does not work?)

Thanks a lot for your help!

Edit 1. Here is the data (I hope I got the point with the copying right)

structure(list(YEAR = structure(1:13, .Label = c(&quot;1990&quot;, &quot;2000&quot;, 
&quot;2010&quot;, &quot;2011&quot;, &quot;2012&quot;, &quot;2013&quot;, &quot;2014&quot;, &quot;2015&quot;, &quot;2016&quot;, &quot;2017&quot;, 
&quot;2018&quot;, &quot;2019&quot;, &quot;2020&quot;), class = &quot;factor&quot;), n = c(25L, 27L, 74L, 
95L, 79L, 79L, 98L, 98L, 102L, 79L, 101L, 86L, 99L), Percent = c(26.6, 
25.23, 36.63, 48.72, 44.63, 36.24, 43.75, 42.24, 44.54, 34.96, 
46.98, 35.98, 41.42)), class = c(&quot;grouped_df&quot;, &quot;tbl_df&quot;, &quot;tbl&quot;, 
&quot;data.frame&quot;), row.names = c(NA, -13L), groups = structure(list(
    YEAR = structure(1:13, .Label = c(&quot;1990&quot;, &quot;2000&quot;, &quot;2010&quot;, 
    &quot;2011&quot;, &quot;2012&quot;, &quot;2013&quot;, &quot;2014&quot;, &quot;2015&quot;, &quot;2016&quot;, &quot;2017&quot;, &quot;2018&quot;, 
    &quot;2019&quot;, &quot;2020&quot;), class = &quot;factor&quot;), .rows = structure(list(
        1L, 2L, 3L, 4L, 5L, 6L, 7L, 8L, 9L, 10L, 11L, 12L, 13L), ptype = integer(0), class = c(&quot;vctrs_list_of&quot;, 
    &quot;vctrs_vctr&quot;, &quot;list&quot;))), class = c(&quot;tbl_df&quot;, &quot;tbl&quot;, &quot;data.frame&quot;
), row.names = c(NA, -13L), .drop = TRUE))

答案1

得分: 2

以下是翻译好的代码部分:

library(ggplot2)
library(dplyr, warn = FALSE)

MIA_Year %>%
  ungroup() %>%
  arrange(YEAR) %>%
  mutate(
    sign = c(0, diff(Percent)) < 0,
    lead_sign = lead(sign, default = FALSE),
    nudge_y = 3 * ifelse(sign & !lead_sign, -1, 1),
    vjust = ifelse(sign & !lead_sign, 1, 0)
  ) %>%
  ggplot(aes(x = YEAR, y = Percent)) +
  geom_line(group = 1, color = "steelblue", linewidth = 1) +
  geom_text(aes(y = Percent + nudge_y, label = n, vjust = vjust)) +
  scale_y_continuous(limits = c(0, 100), breaks = seq(0, 100, by = 10)) +
  labs(x = "", y = "文章百分比") +
  theme(
    axis.text.x = element_text(angle = 45, hjust = 1),
    panel.border = element_rect(fill = NA),
    panel.background = element_rect(fill = "white"),
    panel.grid = element_line(colour = "grey85")
  )

请注意,我将"Percent of Articles"翻译为了"文章百分比"。

英文:

Building on your idea, here is a working approach where - as mentioned in my comment - I "nudge" the y position of the labels and use vjust for the alignment. Also note, that I have taken the lead first differences or slopes into account to place the labels.

library(ggplot2)
library(dplyr, warn = FALSE)

MIA_Year |&gt;
  ungroup() |&gt;
  arrange(YEAR) |&gt;
  mutate(
    sign = c(0, diff(Percent)) &lt; 0,
    lead_sign = lead(sign, default = FALSE),
    nudge_y = 3 * ifelse(sign &amp; !lead_sign, -1, 1),
    vjust = ifelse(sign &amp; !lead_sign, 1, 0)
  ) |&gt;
  ggplot(aes(x = YEAR, y = Percent)) +
  geom_line(group = 1, color = &quot;steelblue&quot;, linewidth = 1) +
  geom_text(aes(y = Percent + nudge_y, label = n, vjust = vjust)) +
  scale_y_continuous(limits = c(0, 100), breaks = seq(0, 100, by = 10)) +
  labs(x = &quot;&quot;, y = &quot;Percent of Articles&quot;) +
  theme(
    axis.text.x = element_text(angle = 45, hjust = 1),
    panel.border = element_rect(fill = NA),
    panel.background = element_rect(fill = &quot;white&quot;),
    panel.grid = element_line(colour = &quot;grey85&quot;)
  )

ggplot line plot: is there a way to depict the data points under or over the line plot depending on what looks better?<!-- -->

huangapple
  • 本文由 发表于 2023年6月15日 21:05:02
  • 转载请务必保留本文链接:https://go.coder-hub.com/76482794.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定