2023年2月24日 03:56:20go评论90阅读模式

英文:

Remove rows from a data frame that match on multiple criteria

问题

我希望删除包含特定模式的数据帧行，并且如果可能的话，我希望使用 tidyverse 语法。

我希望删除列1包含 "cat" 且列2至4中包含以下任何单词的行：dog、fox 或 cow。对于此示例，这将从原始数据中删除行1和4。

这是一个示例数据集：

df <- data.frame(col1 = c("cat", "fox", "dog", "cat", "pig"),
                 col2 = c("lion", "tiger", "elephant", "dog", "cow"),
                 col3 = c("bird", "cow", "sheep", "fox", "dog"),
                 col4 = c("dog", "cat", "cat", "cow", "fox"))

我已经尝试了许多 across 变体，但一直遇到问题。这是我最新的尝试：

filtered_df <- df %>%
  filter(!(col1 == "cat" & !any(cowfoxdog <- across(col2:col4, ~ . %in% c("cow", "fox", "dog")))))

这返回以下错误：

Error in `filter()`:
! Problem while computing `..1 = !...`.
Caused by error in `FUN()`:
! only defined on a data frame with all numeric variables

英文:

I wish to remove rows of my data frame that contain a specific pattern and I wish to use tidyverse syntax if possible.

I wish to remove rows where column 1 contains "cat" and where any of col2:4 contain any of the following words: dog, fox or cow. For this example that will remove rows 1 and 4 from the original data.

Here's a sample dataset:

df &lt;- data.frame(col1 = c(&quot;cat&quot;, &quot;fox&quot;, &quot;dog&quot;, &quot;cat&quot;, &quot;pig&quot;),
                 col2 = c(&quot;lion&quot;, &quot;tiger&quot;, &quot;elephant&quot;, &quot;dog&quot;, &quot;cow&quot;),
                 col3 = c(&quot;bird&quot;, &quot;cow&quot;, &quot;sheep&quot;, &quot;fox&quot;, &quot;dog&quot;),
                 col4 = c(&quot;dog&quot;, &quot;cat&quot;, &quot;cat&quot;, &quot;cow&quot;, &quot;fox&quot;))

I've tried a number of across variants but constantly run into issues. Here is my latest attempt:

filtered_df &lt;- df %&gt;%
  filter(!(animal1 == &quot;cat&quot; &amp; !any(cowfoxdog &lt;- across(animal2:animal4, ~ . %in% c(&quot;cow&quot;, &quot;fox&quot;, &quot;dog&quot;)))))

This returns the following error:

Error in `filter()`:
! Problem while computing `..1 = !...`.
Caused by error in `FUN()`:
! only defined on a data frame with all numeric variables

答案1

得分: 5

你可以使用 if_any()。为了进行更强健的测试，我首先添加了一行，其中 col1 == "cat"，但 col2:col4 中没有出现 "dog"、"fox" 或 "cow"。

英文:

You can use if_any(). For a more robust test, I first added a row where col1 == "cat" but "dog", "fox", or "cow" don't appear in columns 2-4.

library(dplyr)
df &lt;- df %&gt;% 
  add_row(col1 = &quot;cat&quot;, col2 = &quot;sheep&quot;, col3 = &quot;lion&quot;, col4 = &quot;tiger&quot;)
df %&gt;% 
  filter(!(col1 == &quot;cat&quot; &amp; if_any(col2:col4, \(x) x %in% c(&quot;dog&quot;, &quot;fox&quot;, &quot;cow&quot;))))

  col1     col2  col3  col4
1  fox    tiger   cow   cat
2  dog elephant sheep   cat
3  pig      cow   dog   fox
4  cat    sheep  lion tiger

答案2

得分: 1

使用**filter()**函数根据逻辑运算符过滤符合您的条件的行：

library(tidyverse)
pattern1 <- c("cat")
pattern2 <- c("dog", "fox", "cow")
df %>%
  filter(!(col1 == pattern1 &
             (col2 %in% pattern2 |
              col3 %in% pattern2 |
              col4 %in% pattern2))
         )

   col1     col2  col3 col4
1  fox    tiger   cow  cat
2  dog elephant sheep  cat
3  pig      cow   dog  fox

英文:

One way is to use filter() function that filters rows that meet your criteria based on logical operators:

library(tidyverse)
pattern1&lt;-c(&quot;cat&quot;)
pattern2&lt;-c(&quot;dog&quot;, &quot;fox&quot;, &quot;cow&quot;)
df %&gt;% 
  filter(!(col1 == pattern1 &amp; 
             (col2 %in% pattern2 | 
              col3 %in% pattern2 | 
              col4 %in% pattern2))
         )
  col1     col2  col3 col4
1  fox    tiger   cow  cat
2  dog elephant sheep  cat
3  pig      cow   dog  fox

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

从数据框中删除符合多个条件的行

问题

答案1

答案2

使用 R 根据条件交换两列之间的数值。

Calculating True Prevalence (when apparent prevalence estimates are too low or too high) – avoiding negative values in CIs or values >100%

在R中如何保留具有两个不同列中相似值的数据框中的行。

使quarto HTML文档使用完整的窗口宽度

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。