如何使用grep或grepl来识别变量?

huangapple go评论198阅读模式
英文:

How to use grep or grepl in order to identify variables?

问题

下午好。以下是情况:

我有一个名为"dep"的变量,定义如下:

  1. dep <- c("txn_amt", "txn_cnt", "acct_cnt")

我将fn_form定义为:

  1. fn_form = c(dep, paste("log(", dep, "+1)", sep=""), paste("log(", dep, ")", sep=""))

当我运行:

  1. fnform <- "count"
  2. if (length(grep("log", fn_form)) > 0) {
  3. fnform <- "log"
  4. }
  5. if (grepl("log\\(\\+1\\)", fn_form) > 0) {
  6. fnform <- "log+1"
  7. }

它没有保存"count"、"log"和"log+1"。它只保存了"count"和"log",因为R没有识别出"log"和"log+1"之间的区别。我该如何修复这个问题,以便保存这三个值?

英文:

good afternoon. Here is the situation:

I have a variable called "dep" that is defined as:

  1. dep &lt;- c(&quot;txn_amt&quot;, &quot;txn_cnt&quot;, &quot;acct_cnt&quot;)

I defined fn_form as:

  1. fn_form = c(dep, paste(&quot;log(&quot;,dep,&quot;+1)&quot;,sep=&quot;&quot;),paste(&quot;log(&quot;,dep,&quot;)&quot;,sep=&quot;&quot;))

When I ran:

  1. fnform &lt;- &quot;count&quot;
  2. if(length(grep(&quot;log&quot;,fn_form))&gt;0) {
  3. fnform &lt;- &quot;log&quot;
  4. }
  5. if(grepl(&quot;log\\(\\+1\\)&quot;,fn_form)&gt;0) {
  6. fnform &lt;- &quot;log+1&quot;
  7. }

it is not saving "count", "log" and "log+1". It is only saving "count" and "log" because R is not recognizing the difference between log and log+1. How can I fix this in order to save for these 3 values?

答案1

得分: 1

有一些问题。首先,您尝试评估fn_form的所有元素,但然后将fn_form替换为单个值。更重要的是,您提出的问题是,您正在搜索"log(+1)",但您应该搜索"log(.*+1)",因为变量名位于附加到log的括号内。以下是我如何做的:

  1. dep <- c("txn_amt", "txn_cnt", "acct_cnt")
  2. fn_form = c(dep, paste("log(",dep,"+1)",sep=""),paste("log(",dep,")",sep=""))
  3. sapply(fn_form, function(x){
  4. out <- "count"
  5. if(grepl("log\\(", x)) out <- "log"
  6. if(grepl("log\\(.*\\+1", x)) out <- "log+1"
  7. out
  8. })
  9. #> txn_amt txn_cnt acct_cnt log(txn_amt+1) log(txn_cnt+1)
  10. #> "count" "count" "count" "log+1" "log+1"
  11. #> log(acct_cnt+1) log(txn_amt) log(txn_cnt) log(acct_cnt)
  12. #> "log+1" "log" "log" "log"

创建于2023年06月01日,使用reprex v2.0.2

英文:

There were a couple of problems. First, you're trying to evaluate all elements of fn_form, but then replace fn_form with a single value. More to the point you were raising though, is that you were searching for "log(+1)", when you should have been searching for "log(.*+1)" - because the variable name is inside the parentheses attached to log. Here's how I would do it:

  1. dep &lt;- c(&quot;txn_amt&quot;, &quot;txn_cnt&quot;, &quot;acct_cnt&quot;)
  2. fn_form = c(dep, paste(&quot;log(&quot;,dep,&quot;+1)&quot;,sep=&quot;&quot;),paste(&quot;log(&quot;,dep,&quot;)&quot;,sep=&quot;&quot;))
  3. sapply(fn_form, function(x){
  4. out &lt;- &quot;count&quot;
  5. if(grepl(&quot;log\\(&quot;, x))out &lt;- &quot;log&quot;
  6. if(grepl(&quot;log\\(.*\\+1&quot;, x))out &lt;- &quot;log+1&quot;
  7. out
  8. })
  9. #&gt; txn_amt txn_cnt acct_cnt log(txn_amt+1) log(txn_cnt+1)
  10. #&gt; &quot;count&quot; &quot;count&quot; &quot;count&quot; &quot;log+1&quot; &quot;log+1&quot;
  11. #&gt; log(acct_cnt+1) log(txn_amt) log(txn_cnt) log(acct_cnt)
  12. #&gt; &quot;log+1&quot; &quot;log&quot; &quot;log&quot; &quot;log&quot;

<sup>Created on 2023-06-01 with reprex v2.0.2</sup>

huangapple
  • 本文由 发表于 2023年6月2日 02:14:24
  • 转载请务必保留本文链接:https://go.coder-hub.com/76384630.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定