2023年7月7日 00:53:42go评论97阅读模式

英文:

for loop in R - repeating a block of code iteratively

问题

column1_top50 <- dataframe %>%
arrange(desc(column1)) %>%
slice_head(n = 50) %>%
select(sample_name, column1)

column2_top50 <- dataframe %>%
arrange(desc(column2)) %>%
slice_head(n = 50) %>%
select(sample_name, column2)

column3_top50 <- dataframe %>%
arrange(desc(column3)) %>%
slice_head(n = 50) %>%
select(sample_name, column3)

英文:

I have a block of code that extracts the top 50 values from a specific column in my dataframe, outputting a new dataframe named accordingly. I want to repeat this process for every column in my dataframe (50-100 columns), as follows. How can I automate this?

column1_top50 &lt;- dataframe %&gt;%
arrange(desc(column1)) %&gt;%
slice_head(n = 50) %&gt;%
select(sample_name, column1)
column2_top50 &lt;- dataframe %&gt;%
  arrange(desc(column2)) %&gt;%
  slice_head(n = 50) %&gt;%
  select(sample_name, column2)
column3_top50 &lt;- dataframe %&gt;%
  arrange(desc(column3)) %&gt;%
  slice_head(n = 50) %&gt;%
  select(sample_name, column3)

答案1

得分: 1

我无法进行没有任何示例数据的测试，但这里有一种使用 purrr::map_dfr 的选项。您可以在数据框的每一列上进行“循环”，并返回前50个数值。

library(dplyr)
library(purrr)
set.seed(100)
tmp = data.frame(
  col1 = rnorm(100),
  col2 = rnorm(100),
  col3 = rnorm(100)
)
tmp %>%
  map_dfr(~ head(sort(.x, decreasing = TRUE), 50))

这将返回一个包含前50个数值的数据框。

英文:

I'm unable to test without any sample data but here's an option using purrr::map_dfr. Where you "loop" through each col in the data.frame and return the top 50 numeric values.

library(dplyr)
library(purrr)
set.seed(100)
tmp = data.frame(
  col1 = rnorm(100),
  col2 = rnorm(100),
  col3 = rnorm(100)
)
tmp %&gt;%
  map_dfr(~ head(sort(.x, decreasing = TRUE),50))
# A tibble: 50 &#215; 3
    col1  col2  col3
   &lt;dbl&gt; &lt;dbl&gt; &lt;dbl&gt;
 1  2.58  2.17  2.73
 2  2.45  1.90  2.61
 3  2.31  1.65  2.55
 4  1.90  1.62  1.88
 5  1.82  1.58  1.79
 6  1.76  1.36  1.35
 7  1.73  1.35  1.35
 8  1.65  1.27  1.24
 9  1.43  1.24  1.23
10  1.40  1.03  1.14
# … with 40 more rows

答案2

得分: 1

I'm not sure a for loop would be the most efficient way to do this (purrr would probably be faster). I'll also note that creating a data frame within a for loop is generally frowned upon (if you make an empty placeholder df before the for loop it'd be much faster), but without any sample data I tried to generalize this for the 50-100 columns you said you have:

library(tidyverse)
df = data.frame(sample_name = rep("example", 100),
                x1 = rnorm(100),
                x2 = rnorm(100),
                x3 = rnorm(100)) 
 for(i in 1:ncol(df)) {
  df1 <- df %>% arrange(desc(.[[i]])) %>%
    slice_head(n = 50)
  assign(paste0(colnames(df)[[i]], "_top50"), df1[,c(1, i)])
}

英文:

I'm not sure a for loop would be the most efficient way to do this (purrr would probably be faster). I'll also note that creating a data frame within a for loop is generally frowned upon (if you make an empty placeholder df before the forloop it'd be much faster), but without any sample data I tried to generalize this for the 50-100 columns you said you have:

library(tidyverse)
df = data.frame(sample_name = rep(&quot;example&quot;, 100),
                x1 = rnorm(100),
                x2 = rnorm(100),
                x3 = rnorm(100)) 
 for(i in 1:ncol(df)) {
  df1 &lt;- df %&gt;% arrange(desc(.[[i]])) %&gt;% 
    slice_head(n = 50)
  assign(paste0(colnames(df)[[i]], &quot;_top50&quot;), df1[,c(1, i)])
}

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在R中的for循环 – 迭代地重复一段代码。

问题

答案1

答案2

在R编程中，订单是否总是被保持和尊重？

在列表中迭代管道上的对象

从R中的FeatureCollection中提取坐标数据到csv中

检测特定用户输入，而无需在 while 循环中首先处理它。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。