2023年6月5日 19:00:20go评论102阅读模式

英文:

Improve the performance of recursive sampling function

问题

以下是您要翻译的内容：

作为对我的先前问题的后续，我对改进现有的递归抽样函数性能感兴趣。

通过递归抽样，我指的是随机选择多达n个独特的未暴露ID，用于给定的已暴露ID，然后随机选择多达n个独特的未暴露ID，用于另一个已暴露ID的剩余未暴露ID。如果对于给定的已暴露ID没有剩余的未暴露ID，那么该已暴露ID将被排除在外。

原始函数如下：

recursive_sample <- function(data, n) {
 
 groups <- unique(data[["exposed"]])
 out <- data.frame(exposed = character(), unexposed = character())
 
 for (group in groups) {
  
  chosen <- data %>%
   filter(exposed == group,
          !unexposed %in% out$unexposed) %>%
   group_by(unexposed) %>%
   slice(1) %>%
   ungroup() %>%
   sample_n(size = min(n, nrow(.))) 
  
  out <- rbind(out, chosen)
  
 }
 
 out
 
}

我能够创建一个更有效的版本，如下所示：

recursive_sample2 <- function(data, n) {
 
 groups <- unique(data[["exposed"]])
 out <- tibble(exposed = integer(), unexposed = integer())
 
 for (group in groups) {
  
  chosen <- data %>%
   filter(exposed == group,
          !unexposed %in% out$unexposed) %>%
   filter(!duplicated(unexposed)) %>%
   sample_n(size = min(n, nrow(.))) 
  
  out <- bind_rows(out, chosen)
  
 }
 
 out
 
}

示例数据和性能基准测试：

set.seed(123)
df <- tibble(exposed = rep(1:100, each = 100),
             unexposed = sample(1:7000, 10000, replace = TRUE))
microbenchmark(f1 = recursive_sample(df, 5),
               f2 = recursive_sample2(df, 5),
               times = 10)
Unit: milliseconds
 expr       min        lq      mean    median        uq      max neval cld
   f1 1307.7198 1316.5276 1379.0533 1371.3952 1416.6360 1540.955    10   b
   f2  839.0086  865.2547  914.8327  901.2288  970.9518 1036.170    10  a

然而，对于我的实际数据集，我需要一个更高效（即更快）的函数。欢迎提供更高效版本的任何想法，无论是在data.table中，涉及并行化还是其他方法。

英文:

As a follow-up to my previous question, I'm interested in improving the performance of the existing recursive sampling function.

By recursive sampling I mean randomly choosing up to n unique unexposed IDs for a given exposed ID, and the randomly choosing up to n unique unexposed IDs from the remaining unexposed IDs for another exposed ID. If there are no remaining unexposed IDs for a given exposed ID, then the exposed ID is left out.

The original function is as follows:

recursive_sample &lt;- function(data, n) {
 
 groups &lt;- unique(data[[&quot;exposed&quot;]])
 out &lt;- data.frame(exposed = character(), unexposed = character())
 
 for (group in groups) {
  
  chosen &lt;- data %&gt;%
   filter(exposed == group,
          !unexposed %in% out$unexposed) %&gt;%
   group_by(unexposed) %&gt;%
   slice(1) %&gt;%
   ungroup() %&gt;%
   sample_n(size = min(n, nrow(.))) 
  
  out &lt;- rbind(out, chosen)
  
 }
 
 out
 
}

I was able to create a more efficient one as follows:

recursive_sample2 &lt;- function(data, n) {
 
 groups &lt;- unique(data[[&quot;exposed&quot;]])
 out &lt;- tibble(exposed = integer(), unexposed = integer())
 
 for (group in groups) {
  
  chosen &lt;- data %&gt;%
   filter(exposed == group,
          !unexposed %in% out$unexposed) %&gt;%
   filter(!duplicated(unexposed)) %&gt;%
   sample_n(size = min(n, nrow(.))) 
  
  out &lt;- bind_rows(out, chosen)
  
 }
 
 out
 
}

Sample data and bechmarking:

set.seed(123)
df &lt;- tibble(exposed = rep(1:100, each = 100),
             unexposed = sample(1:7000, 10000, replace = TRUE))
microbenchmark(f1 = recursive_sample(df, 5),
               f2 = recursive_sample2(df, 5),
               times = 10)
Unit: milliseconds
 expr       min        lq      mean    median        uq      max neval cld
   f1 1307.7198 1316.5276 1379.0533 1371.3952 1416.6360 1540.955    10   b
   f2  839.0086  865.2547  914.8327  901.2288  970.9518 1036.170    10  a

However, for my actual dataset, I would need an even more efficient (i.e., quicker) function. Any ideas for a more efficient version, whether in data.table, involving parallelisation or other approaches are welcome.

答案1

得分: 2

以下是翻译好的代码部分：

# 更新，改进更多
更简洁的解决方案可能是使用 `Reduce` + `split`，其中我们首先对 `data` 的行进行洗牌，然后我们按组进行**迭代**抽样
```R
ftic <- function(data, n) {
    Reduce(
        \(x, y) {
            rbind(x, head(subset(y, !unexposed %in% x$unexposed), n))
        },
        split(data[sample(1:nrow(data)), ], ~exposed)
    )
}

下面是一个更具挑战性的性能测试，即具有1e6行的data，其中的方法包括：

ftmfmnk <- function(data, n) {
    groups <- unique(data[["exposed"]])
    out <- tibble(exposed = integer(), unexposed = integer())
    for (group in groups) {
        chosen <- data %>%
            filter(
                exposed == group,
                !unexposed %in% out$unexposed
            ) %>%
            filter(!duplicated(unexposed)) %>%
            sample_n(size = min(n, nrow(.)))
        out <- bind_rows(out, chosen)
    }
    out
}
fminem <- function(data, n) {
    groups <- unique(data[["exposed"]])
    # working on vectors is faster
    id <- 1:nrow(data)
    i <- vector("integer")
    unexposed2 <- vector(class(data$unexposed))
    ex <- data$exposed
    ux <- data$unexposed
    for (group in groups) {
        f1 <- ex == group # first filter
        f2 <- !ux[f1] %in% unexposed2 # 2nd filter (only on those that match 1st)
        id3 <- id[f1][f2][!duplicated(ux[f1][f2])] # check duplicates only on needed
        # and select necesary row ids
        is <- sample(id3, size = min(length(id3), n)) # sample row ids
        i <- c(i, is) # add to list
        unexposed2 <- ux[i] # resave unexposed2
    }
    out <- data[i, ] # only one data.frame subset
    out$id <- NULL
    out
}
ftic <- function(data, n) {
    Reduce(
        \(x, y) {
            rbind(x, head(subset(y, !unexposed %in% x$unexposed), n))
        },
        split(data[sample(1:nrow(data)), ], ~exposed)
    )
}

以下是基准测试：

set.seed(123)
df <- tibble(
    exposed = rep(1:1000, each = 1000),
    unexposed = sample(1:70000, 1000000, replace = TRUE)
)
mbm <- microbenchmark(
    tmfmnk = ftmfmnk(df, 5),
    minem = fminem(df, 5),
    tic = ftic(df, 5),
    times = 10
)
boxplot(mbm)

我们将会看到：

> mbm
Unit: milliseconds
   expr        min         lq       mean     median         uq        max neval
 tmfmnk 36809.9563 44276.3545 43780.8407 44897.2661 46175.1031 46948.8906    10
  minem  5361.2796  5932.7752  5923.8811  6010.7775  6047.3716  6233.2919    10
    tic   504.5749   519.5997   641.7935   607.2825   729.4545   868.1283    10

先前的朴素方法

我在这里没有任何高级技巧，只是使用了for循环的动态规划方案，我相信一定有比我更高效的方法

dp <- function(df, n) {
    d <- table(df)
    out <- list()
    rnm <- row.names(d)
    cnm <- colnames(d)
    for (i in 1:nrow(d)) {
        v <- which(d[i, ] > 0)
        l <- length(v)
        idx <- v[sample(l, min(l, n))]
        out[[i]] <- data.frame(exposed = rnm[i], unexposed = cnm[idx])
        d[, idx] <- 0
    }
    do.call(rbind, out)
}

基准测试如下：

set.seed(123)
df <- tibble(
    exposed = rep(1:100, each = 100),
    unexposed = sample(1:7000, 10000, replace = TRUE)
)
mbm <- microbenchmark(
    f1 = recursive_sample(df, 5),
    f2 = recursive_sample2(df, 5),
    f3 = dp(df, 5),
    times = 10
)
boxplot(mbm)

结果如下：

> mbm
Unit: milliseconds
 expr       min        lq      mean    median        uq       max neval
   f1 1271.0135 1302.4310 1449.2193 1326.7630 1686.4329 1888.4549    10
   f2  507.9350  516.8854  617.0313  559.0422  706.4300  801.0124    10
   f3  212.8944  247.0066  278.1792  271.9010  309.7377  354.4320    10

此外，要检查结果 res <- dp(df, 5)，我们可以使用以下代码：

> table(res$exposed)
  1  10 100  11  12  13  14  15  16  17  18  19   2  20  21  22  23  24  25  26
  5   5   5   5   5   5   5   5   5   5   5   5   5   5   5   5   5   5   5   5
 27  28  29   3  30  31  32  33  34  
<details>
<summary>英文:</summary>
# Update, with More Improvement
A more concise solution might be using `Reduce` + `split`, where we shuffle the rows of `data` first and then we samples by groups **iteratively**

ftic <- function(data, n) {
Reduce(
(x, y) {
rbind(x, head(subset(y, !unexposed %in% x$unexposed), n))
},
split(data[sample(1:nrow(data)), ], ~exposed)
)
}

and below is a ***tougher*** pressure test, i.e., **`data` of `1e6` rows**, where the approaches include:

ftmfmnk <- function(data, n) {
groups <- unique(data[["exposed"]])
out <- tibble(exposed = integer(), unexposed = integer())

for (group in groups) {
    chosen &lt;- data %&gt;%
        filter(
            exposed == group,
            !unexposed %in% out$unexposed
        ) %&gt;%
        filter(!duplicated(unexposed)) %&gt;%
        sample_n(size = min(n, nrow(.)))
    out &lt;- bind_rows(out, chosen)
}
out

}

fminem <- function(data, n) {
groups <- unique(data[["exposed"]])
# working on vectors is faster
id <- 1:nrow(data)
i <- vector("integer")
unexposed2 <- vector(class(data$unexposed))
ex <- data$exposed
ux <- data$unexposed

for (group in groups) {
    f1 &lt;- ex == group # first filter
    f2 &lt;- !ux[f1] %in% unexposed2 # 2nd filter (only on those that match 1st)
    id3 &lt;- id[f1][f2][!duplicated(ux[f1][f2])] # check duplicates only on needed
    # and select necesary row ids
    is &lt;- sample(id3, size = min(length(id3), n)) # sample row ids
    i &lt;- c(i, is) # add to list
    unexposed2 &lt;- ux[i] # resave unexposed2
}
out &lt;- data[i, ] # only one data.frame subset
out$id &lt;- NULL
out

}

ftic <- function(data, n) {
Reduce(
(x, y) {
rbind(x, head(subset(y, !unexposed %in% x$unexposed), n))
},
split(data[sample(1:nrow(data)), ], ~exposed)
)
}


The benchmarking is as below

set.seed(123)
df <- tibble(
exposed = rep(1:1000, each = 1000),
unexposed = sample(1:70000, 1000000, replace = TRUE)
)

mbm <- microbenchmark(
tmfmnk = ftmfmnk(df, 5),
minem = fminem(df, 5),
tic = ftic(df, 5),
times = 10
)

boxplot(mbm)

and we will see that

> mbm
Unit: milliseconds
expr min lq mean median uq max neval
tmfmnk 36809.9563 44276.3545 43780.8407 44897.2661 46175.1031 46948.8906 10
minem 5361.2796 5932.7752 5923.8811 6010.7775 6047.3716 6233.2919 10
tic 504.5749 519.5997 641.7935 607.2825 729.4545 868.1283 10


[![enter image description here][1]][1]
-------------------------------------------
# Previous Na&#239;ve Approach 
I don&#39;t have any advanced technique here, but just a dynamic programming scheme with `for` loops, and I believe there must be more performant approaches than mine

dp <- function(df, n) {
d <- table(df)
out <- list()
rnm <- row.names(d)
cnm <- colnames(d)
for (i in 1:nrow(d)) {
v <- which(d[i, ] > 0)
l <- length(v)
idx <- v[sample(l, min(l, n))]
out[[i]] <- data.frame(exposed = rnm[i], unexposed = cnm[idx])
d[, idx] <- 0
}
do.call(rbind, out)
}

and the benchmarking

set.seed(123)
df <- tibble(
exposed = rep(1:100, each = 100),
unexposed = sample(1:7000, 10000, replace = TRUE)
)

mbm <- microbenchmark(
f1 = recursive_sample(df, 5),
f2 = recursive_sample2(df, 5),
f3 = dp(df, 5),
times = 10
)

boxplot(mbm)

shows

> mbm
Unit: milliseconds
expr min lq mean median uq max neval
f1 1271.0135 1302.4310 1449.2193 1326.7630 1686.4329 1888.4549 10
f2 507.9350 516.8854 617.0313 559.0422 706.4300 801.0124 10
f3 212.8944 247.0066 278.1792 271.9010 309.7377 354.4320 10


[![enter image description here][2]][2]
Also, to check the result `res &lt;- dp(df, 5)`, we can use

> table(res$exposed)

1 10 100 11 12 13 14 15 16 17 18 19 2 20 21 22 23 24 25 26
5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5
27 28 29 3 30 31 32 33 34 35 36 37 38 39 4 40 41 42 43 44
5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5
45 46 47 48 49 5 50 51 52 53 54 55 56 57 58 59 6 60 61 62
5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5
63 64 65 66 67 68 69 7 70 71 72 73 74 75 76 77 78 79 8 80
5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5
81 82 83 84 85 86 87 88 89 9 90 91 92 93 94 95 96 97 98 99
5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5

> anyDuplicated(res$unexposed)
1 0


  [1]: https://i.stack.imgur.com/HUvI0.png
  [2]: https://i.stack.imgur.com/60wE3.png
</details>
# 答案2
**得分**: 2
以下是已翻译的代码部分：
```R
Working on vectors is much faster:
recursive_sample3 <- function(data, n) {
  groups <- unique(data[["exposed"]])
  # working on vectors is faster
  id <- 1:nrow(data)
  i <- vector('integer')
  unexposed2 <- vector(class(data$unexposed))
  ex <- data$exposed
  ux <- data$unexposed
  
  for (group in groups) {
    f1 <- ex == group # first filter
    f2 <- !ux[f1] %in% unexposed2 # 2nd filter (only on those that match 1st)
    id3 <- id[f1][f2][!duplicated(ux[f1][f2])] # check duplicates only on needed
    # and select necesary row ids
    is <- sample(id3, size = min(length(id3), n)) # sample row ids
    i <- c(i, is) # add to list
    unexposed2 <- ux[i] # resave unexposed2
  }
  out <- data[i, ] # only one data.frame subset
  out$id <- NULL
  out
}
benchmarks:
microbenchmark(f1 = recursive_sample(df, 5),
               f2 = recursive_sample2(df, 5),
               f3 = recursive_sample3(df, 5),
               times = 3)
# Unit: milliseconds
# expr       min        lq        mean    median         uq       max neval cld
#   f1 1399.8988 1407.1939 1422.008133 1414.4889 1433.06280 1451.6367     3 a  
#   f2  667.0813  673.7229  678.106400  680.3645  683.61895  686.8734     3  b 
#   f3    6.2399    6.2625    9.531267    6.2851   11.17695   16.0688     3   c
Iterating on `recursive_sample3` & incorporating concerns of `sample`:
f_minem <- function(data, n) {
  i <- vector('integer')
  unexposed2 <- vector(class(data$unexposed))
  ux <- data$unexposed
  exl <- split(1:nrow(data), data$exposed)
  for (ii in exl) {
    f2 <- !ux[ii] %in% unexposed2
    f12 <- ii[f2]
    dn <- !duplicated(ux[f12])
    id3 <- f12[dn]
    is <- id3[sample.int(min(length(id3), n))]
    i <- c(i, is)
    unexposed2 <- ux[i]
  }
  out <- data[i, ]
  out
}
benchmarks nr2:
microbenchmark::microbenchmark(
  recursive_sample3 = recursive_sample3(df, 5L),
  recursive_sample4 = recursive_sample4(setDT(df), 5L),
  f_minem = f_minem(df, 5L),
  setup = {df <- copy(data)}
  , times = 10
)
# Unit: milliseconds
#              expr    min     lq    mean  median      uq     max neval cld
# recursive_sample3 6.2102 6.2974 9.63296 6.43245 16.3367 17.0746    10  a 
# recursive_sample4 3.5145 3.6249 3.67077 3.67075  3.7513  3.7970    10   b
#           f_minem 2.1705 2.1920 2.27510 2.23215  2.3784  2.4585    10   b

希望这对你有所帮助！如果需要进一步的翻译，请告诉我。

英文:

Working on vectors is much faster:

recursive_sample3 &lt;- function(data, n) {
  groups &lt;- unique(data[[&quot;exposed&quot;]])
  # working on vectors is faster
  id &lt;- 1:nrow(data)
  i &lt;- vector(&#39;integer&#39;)
  unexposed2 &lt;- vector(class(data$unexposed))
  ex &lt;- data$exposed
  ux &lt;- data$unexposed
  
  for (group in groups) {
    f1 &lt;- ex == group # first filter
    f2 &lt;- !ux[f1] %in% unexposed2 # 2nd filter (only on those that match 1st)
    id3 &lt;- id[f1][f2][!duplicated(ux[f1][f2])] # check duplicates only on needed
    # and select necesary row ids
    is &lt;- sample(id3, size = min(length(id3), n)) # sample row ids
    i &lt;- c(i, is) # add to list
    unexposed2 &lt;- ux[i] # resave unexposed2
  }
  out &lt;- data[i, ] # only one data.frame subset
  out$id &lt;- NULL
  out
}

benchmarks:

microbenchmark(f1 = recursive_sample(df, 5),
               f2 = recursive_sample2(df, 5),
               f3 = recursive_sample3(df, 5),
               times = 3)
# Unit: milliseconds
# expr       min        lq        mean    median         uq       max neval cld
#   f1 1399.8988 1407.1939 1422.008133 1414.4889 1433.06280 1451.6367     3 a  
#   f2  667.0813  673.7229  678.106400  680.3645  683.61895  686.8734     3  b 
#   f3    6.2399    6.2625    9.531267    6.2851   11.17695   16.0688     3   c

Iterating on recursive_sample3 & incorporating concerns of sample:

f_minem &lt;- function(data, n) {
  i &lt;- vector(&#39;integer&#39;)
  unexposed2 &lt;- vector(class(data$unexposed))
  ux &lt;- data$unexposed
  exl &lt;- split(1:nrow(data), data$exposed)
  for (ii in exl) {
    f2 &lt;- !ux[ii] %in% unexposed2
    f12 &lt;- ii[f2]
    dn &lt;- !duplicated(ux[f12])
    id3 &lt;- f12[dn]
    is &lt;- id3[sample.int(min(length(id3), n))]
    i &lt;- c(i, is)
    unexposed2 &lt;- ux[i]
  }
  out &lt;- data[i, ]
  out
}

benchmarks nr2:

microbenchmark::microbenchmark(
  recursive_sample3 = recursive_sample3(df, 5L),
  recursive_sample4 = recursive_sample4(setDT(df), 5L),
  f_minem = f_minem(df, 5L),
  setup = {df &lt;- copy(data)}
  , times = 10
)
# Unit: milliseconds
#              expr    min     lq    mean  median      uq     max neval cld
# recursive_sample3 6.2102 6.2974 9.63296 6.43245 16.3367 17.0746    10  a 
# recursive_sample4 3.5145 3.6249 3.67077 3.67075  3.7513  3.7970    10   b
#           f_minem 2.1705 2.1920 2.27510 2.23215  2.3784  2.4585    10   b

答案3

得分: 2

以下是您要翻译的代码部分：

A `data.table` solution that keeps a running list of sampled values that are used in `setdiff` (or %!in% from `collapse`):
    library(data.table)
    library(collapse) # for %!in%
    recursive_sample4 <- function(data, n) {
      sampled <- vector("list", uniqueN(data$exposed))
      data[
        ,.(
          unexposed = {
            x <- setdiff(unexposed, unlist(sampled))
            sampled[[.GRP]] <- x[sample.int(min(length(x), n))]
          }
        ), exposed
      ]
    }
    recursive_sample5 <- function(data, n) {
      sampled <- vector("list", uniqueN(data$exposed))
      data[
        ,.(
          unexposed = {
            x <- unexposed[unexposed %!in% unlist(sampled)]
            sampled[[.GRP]] <- x[sample.int(min(length(x), n))]
          }
        ), exposed
      ]
    }
Timing (including `recursive_sample3` by @minem):
    data <- copy(df)
    
    microbenchmark::microbenchmark(
      recursive_sample2 = recursive_sample2(df, 5L),
      recursive_sample3 = recursive_sample3(df, 5L),
      recursive_sample4 = recursive_sample4(setDT(df), 5L),
      recursive_sample5 = recursive_sample5(setDT(df), 5L),
      setup = {df <- copy(data)}
    )
    #> Unit: milliseconds
    #>               expr      min        lq       mean    median        uq      max neval
    #>  recursive_sample2 416.5425 427.38700 452.520780 436.58280 459.79430 614.6392   100
    #>  recursive_sample3   4.5211   5.16330   6.765060   5.79820   6.95425  14.0693   100
    #>  recursive_sample4   3.2038   3.57650   4.676284   4.41120   4.90855  11.6975   100
    #>  recursive_sample5   2.2327   2.58255   3.384131   3.27405   3.93265   8.7091   100
Note that `recursive_sample3` can give erroneous results due to the behavior of `sample` when the first argument is of length 1:
    set.seed(123)
    df <- tibble(exposed = rep(1:100, each = 100),
                 unexposed = sample(1:700, 10000, replace = TRUE))
    nrow(recursive_sample3(df, 10L))
    #> [1] 704

英文:

A data.table solution that keeps a running list of sampled values that are used in setdiff (or %!in% from collapse):

library(data.table)
library(collapse) # for %!in%
recursive_sample4 &lt;- function(data, n) {
  sampled &lt;- vector(&quot;list&quot;, uniqueN(data$exposed))
  data[
    ,.(
      unexposed = {
        x &lt;- setdiff(unexposed, unlist(sampled))
        sampled[[.GRP]] &lt;- x[sample.int(min(length(x), n))]
      }
    ), exposed
  ]
}
recursive_sample5 &lt;- function(data, n) {
  sampled &lt;- vector(&quot;list&quot;, uniqueN(data$exposed))
  data[
    ,.(
      unexposed = {
        x &lt;- unexposed[unexposed %!in% unlist(sampled)]
        sampled[[.GRP]] &lt;- x[sample.int(min(length(x), n))]
      }
    ), exposed
  ]
}

Timing (including recursive_sample3 by @minem):

data &lt;- copy(df)
microbenchmark::microbenchmark(
  recursive_sample2 = recursive_sample2(df, 5L),
  recursive_sample3 = recursive_sample3(df, 5L),
  recursive_sample4 = recursive_sample4(setDT(df), 5L),
  recursive_sample5 = recursive_sample5(setDT(df), 5L),
  setup = {df &lt;- copy(data)}
)
#&gt; Unit: milliseconds
#&gt;               expr      min        lq       mean    median        uq      max neval
#&gt;  recursive_sample2 416.5425 427.38700 452.520780 436.58280 459.79430 614.6392   100
#&gt;  recursive_sample3   4.5211   5.16330   6.765060   5.79820   6.95425  14.0693   100
#&gt;  recursive_sample4   3.2038   3.57650   4.676284   4.41120   4.90855  11.6975   100
#&gt;  recursive_sample5   2.2327   2.58255   3.384131   3.27405   3.93265   8.7091   100

Note that recursive_sample3 can give erroneous results due to the behavior of sample when the first argument is of length 1:

set.seed(123)
df &lt;- tibble(exposed = rep(1:100, each = 100),
             unexposed = sample(1:700, 10000, replace = TRUE))
nrow(recursive_sample3(df, 10L))
#&gt; [1] 704

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

提升递归抽样函数的性能

问题

答案1

先前的朴素方法

答案3

在Go语言中，可变参数函数会导致不必要的堆分配。

如何分组落在一定数值范围内的变量

在R中编写JSON标量。

R：自动化解决数据框中的许多方程组

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。