2023年2月16日 03:22:41go评论150阅读模式

英文:

In R, change the values of some items in a matrix without causing a copy of the entire matrix?

问题

以下是您要翻译的代码部分：

MList <- list(M1, M2)
unionCols <- Reduce(union, lapply(MList, colnames))
MTotal <- matrix(as.double(rep(0, (length(unionCols))^2)), nrow = length(unionCols))
rownames(MTotal) <- colnames(MTotal) <- unionCols
DFTotal <- as.data.frame(MTotal)
DFList <- lapply(MList, as.data.frame)
for (i in 1:length(MList)) {
  tracemem(MTotal)
  tracemem(DFTotal)
  mCol <- match(colnames(MList[[i]]), colnames(MTotal))
  MTotal[mCol, mCol] <- MTotal[mCol, mCol] + MList[[i]] # this causes a copy
  DFTotal[mCol, mCol] <- DFTotal[mCol, mCol] + DFList[[i]] # this causes a copy
}
M1
M2
MTotal
# rbindlist method
.AggDMCMatsSingleM2 <- function(M1, M2) {
  .MyMelt <- function(M) {
    DT <- setnames(reshape2::melt(M, id.vars = colnames(M)), c('Var1', 'Var2'), c('row', 'col'))
  }
  M_total <- as.matrix(data.table::dcast(rbindlist(lapply(list(M1, M2), .MyMelt)),
                                         formula = as.formula(row ~ col),
                                         value.var = 'value',
                                         fun.aggregate = sum,
                                         fill = 0),
                       rownames = 'row')
  return(M_total)
}
M1
M2
.AggDMCMatsSingleM2(M1, M2)

希望这有所帮助。

英文:

I have a "small" square matrix that I want to add to a "big" matrix. The big matrix contains all the rows and columns of the small matrix plus extras. I want to add the values where the indices are in common and just keep the values from the big one where that index is not contained in the small one. Unfortunately, all the data is copied on the addition so it takes a long time and can temporarily spike memory when the matrices are large.

I have tried adding subsets using matrices and data.frames, as well as a data.table method using rbindlist. Both the data.frame and matrix methods seem to cause a memory copy (why?) and the rbindlist method is not ideal because it requires a melt and dcast and temporarily spiking the memory by spiking the number of rows.

Is there any way to just change the values of some items in a matrix without causing a copy of the entire matrix?

Here are my attempts:


MList &lt;- list(M1,M2)
unionCols &lt;- Reduce(union, lapply(MList, colnames))
MTotal &lt;- matrix(as.double(rep(0,(length(unionCols))^2)), nrow = length(unionCols))
rownames(MTotal) &lt;- colnames(MTotal) &lt;- unionCols
DFTotal &lt;- as.data.frame(MTotal)
DFList &lt;- lapply(MList, as.data.frame)
for(i in 1:length(MList)){
  tracemem(MTotal)
  tracemem(DFTotal)
  mCol &lt;- match(colnames(MList[[i]]), colnames(MTotal))
  MTotal[mCol,mCol] &lt;- MTotal[mCol,mCol] + MList[[i]] # this causes a copy
  DFTotal[mCol,mCol] &lt;- DFTotal[mCol,mCol] + DFList[[i]] # this causes a copy
}
M1
M2
MTotal
# rbindlist method
.AggDMCMatsSingleM2 &lt;- function(M1, M2){
  .MyMelt &lt;- function(M){
    DT &lt;- setnames(reshape2::melt(M, id.vars = colnames(M)), c(&#39;Var1&#39;,&#39;Var2&#39;), c(&#39;row&#39;,&#39;col&#39;))
  }
  M_total &lt;- as.matrix(data.table::dcast(rbindlist(lapply(list(M1,M2), .MyMelt)),
                                         formula = as.formula(row ~ col),
                                         value.var = &#39;value&#39;,
                                         fun.aggregate = sum,
                                         fill = 0),
                       rownames = &#39;row&#39;)
  return(M_total)
}
M1
M2
.AggDMCMatsSingleM2(M1,M2)

答案1

得分: 1

以下是代码部分的翻译：

如果我理解你的问题，我们可以直接使用小矩阵的行/列名称的方括号表示法将数据添加到大矩阵中：
big_matrix <- matrix(data=rep(1, 25), nrow=5, 
                   dimnames = list(c(LETTERS[1:5]), 
                                   c(letters[1:5])))
#  a b c d e
# A 1 1 1 1 1
# B 1 1 1 1 1
# C 1 1 1 1 1
# D 1 1 1 1 1
# E 1 1 1 1 1
small_matrix <- matrix(data=c(1:9), nrow=3, 
                     dimnames = list(c(LETTERS[2:4]), 
                                     c(letters[2:4])))
#  b c d
# B 1 4 7
# C 2 5 8
# D 3 6 9    
big_matrix[rownames(small_matrix), colnames(small_matrix)] <- 
  big_matrix[rownames(small_matrix), colnames(small_matrix)] + small_matrix
#  a b c  d e
# A 1 1 1  1 1
# B 1 2 5  8 1
# C 1 3 6  9 1
# D 1 4 7 10 1
# E 1 1 1  1 1

更复杂的测试：

big_matrix <- matrix(data=rep(1, 25), nrow=5, 
                   dimnames = list(c(LETTERS[1:5]), 
                                   c(letters[1:5])))
#  a b c d e
# A 1 1 1 1 1
# B 1 1 1 1 1
# C 1 1 1 1 1
# D 1 1 1 1 1
# E 1 1 1 1 1
small_matrix <- matrix(data=c(1:9), nrow=3, 
                     dimnames = list(c("A", "D", "C"), 
                                     c(letters[c(2:4)])))
#  b c d
# A 1 4 7
# D 2 5 8
# C 3 6 9
big_matrix[rownames(small_matrix), colnames(small_matrix)] <- 
  big_matrix[rownames(small_matrix), colnames(small_matrix)] + small_matrix
big_matrix
#  a b c  d e
# A 1 2 5  8 1
# B 1 1 1  1 1
# C 1 4 7 10 1
# D 1 3 6  9 1
# E 1 1 1  1 1

英文:

If I follow what you are asking we can directly add and write to the big matrix using the bracket notation row/col names of the small matrix:

big_matrix&lt;-matrix(data=rep(1, 25), nrow=5, 
                   dimnames = list(c(LETTERS[1:5]), 
                                   c(letters[1:5])))
#  a b c d e
#A 1 1 1 1 1
#B 1 1 1 1 1
#C 1 1 1 1 1
#D 1 1 1 1 1
#E 1 1 1 1 1
small_matrix&lt;-matrix(data=c(1:9), nrow=3, 
                     dimnames = list(c(LETTERS[2:4]), 
                                     c(letters[2:4])))
#  b c d
#B 1 4 7
#C 2 5 8
#D 3 6 9    
big_matrix[rownames(small_matrix), colnames(small_matrix)] &lt;- 
  big_matrix[rownames(small_matrix), colnames(small_matrix)] + small_matrix
#  a b c  d e
#A 1 1 1  1 1
#B 1 2 5  8 1
#C 1 3 6  9 1
#D 1 4 7 10 1
#E 1 1 1  1 1

More complex test:

big_matrix&lt;-matrix(data=rep(1, 25), nrow=5, 
                   dimnames = list(c(LETTERS[1:5]), 
                                   c(letters[1:5])))
#  a b c d e
#A 1 1 1 1 1
#B 1 1 1 1 1
#C 1 1 1 1 1
#D 1 1 1 1 1
#E 1 1 1 1 1
small_matrix&lt;-matrix(data=c(1:9), nrow=3, 
                     dimnames = list(c(&quot;A&quot;, &quot;D&quot;, &quot;C&quot;), 
                                     c(letters[c(2:4)])))
#  b c d
#A 1 4 7
#D 2 5 8
#C 3 6 9
    big_matrix[rownames(small_matrix), colnames(small_matrix)] &lt;- 
      big_matrix[rownames(small_matrix), colnames(small_matrix)] + small_matrix
big_matrix
#  a b c  d e
#A 1 2 5  8 1
#B 1 1 1  1 1
#C 1 4 7 10 1
#D 1 3 6  9 1
#E 1 1 1  1 1

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在R中，更改矩阵中某些项的值而不复制整个矩阵？

问题

答案1

保留DataFrame中的小数位数。

如何仅对重复的行进行排名，而不包括NaN值？

如何在R中用另一个数据框替代一个数据框

创建并更新一个MapType列在PySpark中

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。