2023年7月4日 22:08:15go评论97阅读模式

英文:

Manipulating Single Values in R to Column values

问题

我已经从Excel导入了一些由他人提供的数据。它相当混乱，所以我正在尝试整理它以进行分析，但代码的格式使这变得困难。以下是数据的最小示例：

Contraption 1
Attempt 1
#s AX AY AZ Distance
3->6 162.4 17.3 175.6 9.1
3->11 67.3 5.3 67.3 32.7
Contraption 1
Attempt 2
#s AX AY AZ Distance
3->6 162.4 17.3 175.6 9.1
3->11 67.3 5.3 67.3 32.7
Contraption 2
Attempt 1
#s AX AY AZ Distance
3->6 162.4 17.3 175.6 9.1
3->11 67.3 5.3 67.3 32.7


我想要的是：

#s AX AY AZ Distance Contraption Attempt
3->6 162.4 17.3 175.6 9.1 1 1
3->11 67.3 5.3 67.3 32.7 1 1
3->6 162.4 17.3 175.6 9.1 1 2
3->11 67.3 5.3 67.3 32.7 1 2
3->6 162.4 17.3 175.6 9.1 2 1
3->11 67.3 5.3 67.3 32.7 2 1


我可以显然根据行号创建一个列表，然后将值重新分配到新列，但我想尝试找到一种可以用新数据重复的方法，因为我有相当多的数据并且可能会有更新。
另外，最好是一个dplyr解决方案。
提前感谢！

这是一个稍微修改过的dput：

structure(list( #s = c("GROUND TRUTH", " #s", "3->6",
"3->11", "3->14", "6->11", "6->14", "11->14", "Contraption 1",
"Attempt 1", " #s", "3->6", "3->11",
"3->14", "6->11", "6->14", "11->14", "Attempt 2",
" #s", "3->6"), AX = c(NA, "AX", "162.4435258", "67.325616600000004",
"97.847449400000002", "115.628574", "76.795228199999997", "164.19540980000002",
NA, NA, "AX", "17.729424000000002", "45.376750999999999", "20.891473000000001",
"50.795566999999998", "7.6219530000000004", "58.288466999999997",
NA, "AX", "160.67087599999999"), AY = c(NA, "AY", "17.371937600000003",
"5.3626663999999993", "17.839726800000001", "12.1478062", "5.5127848000000004",
"13.4053042", NA, NA, "AY", "17.269538000000001", "5.4522810000000002",
"17.783394999999999", "12.026367", "6.0820509999999999", "13.563755",
NA, "AY", "17.594677000000001"), AZ = c(NA, "AZ", "175.6576848",
"67.382918000000004", "99.333905000000001", "116.01154019999998",
"76.799018200000006", "162.97033279999999", NA, NA, "AZ", "6.0498320000000003",
"45.120047999999997", "15.705375999999999", "50.471809999999998",
"9.6571890000000007", "59.333387999999999", NA, "AZ", "167.606852"
), DX = c(NA, "DX", "9.1008000000000013", "32.729599999999998",
"47.664360000000002", "23.628800000000002", "38.563539999999996",
"14.93474", NA, NA, "DX", "-11.773400000000001", "-35.472099999999998",
"-48.059600000000003", "-23.698699999999999", "-36.286200000000001",
"-12.587

英文:

I have imported some data provided by someone else from Excel. It's pretty messy, so I'm trying to get it into shape for analysis, but the format of the code is making it difficult. Here is a minimal example of the data as it is:

Contraption 1
Attempt 1							
#s	    AX	        AY	    AZ	    Distance
3-&gt;6	162.4       17.3	175.6	9.1
3-&gt;11	67.3	    5.3     67.3	32.7
Contraption 1
Attempt 2						
#s	    AX	        AY	    AZ	    Distance
3-&gt;6	162.4       17.3	175.6	9.1
3-&gt;11	67.3	    5.3     67.3	32.7
Contraption 2
Attempt 1							
#s	    AX	        AY	    AZ	    Distance
3-&gt;6	162.4       17.3	175.6	9.1
3-&gt;11	67.3	    5.3     67.3	32.7

What I'd like to get is:

#s	    AX	        AY	    AZ	    Distance    Contraption    Attempt
3-&gt;6	162.4       17.3	175.6	9.1         1              1
3-&gt;11	67.3	    5.3     67.3	32.7        1              1
3-&gt;6	162.4       17.3	175.6	9.1         1              2
3-&gt;11	67.3	    5.3     67.3	32.7        1              2
3-&gt;6	162.4       17.3	175.6	9.1         2              1
3-&gt;11	67.3	    5.3     67.3	32.7        2              1

I can obviously just go by row numbers and create a list and then reassign the values to a new column, but I'd like to try to find a way for it to be repeatable with new data, as I've got quite a lot of it and will probably get updates.

Also, ideally, a dplyr solution is preferred.

Thanks in advance!

EDIT: Here is a slightly modified dput:

structure(list(` #s` = c(&quot;GROUND TRUTH&quot;, &quot; #s&quot;, &quot;3-&gt;6&quot;,
                              &quot;3-&gt;11&quot;, &quot;3-&gt;14&quot;, &quot;6-&gt;11&quot;, &quot;6-&gt;14&quot;, &quot;11-&gt;14&quot;, &quot;Contraption 1&quot;,
                              &quot;Attempt 1&quot;, &quot; #s&quot;, &quot;3-&gt;6&quot;, &quot;3-&gt;11&quot;,
                              &quot;3-&gt;14&quot;, &quot;6-&gt;11&quot;, &quot;6-&gt;14&quot;, &quot;11-&gt;14&quot;, &quot;Attempt 2&quot;,
                              &quot; #s&quot;, &quot;3-&gt;6&quot;), AX = c(NA, &quot;AX&quot;, &quot;162.4435258&quot;, &quot;67.325616600000004&quot;,
                                                          &quot;97.847449400000002&quot;, &quot;115.628574&quot;, &quot;76.795228199999997&quot;, &quot;164.19540980000002&quot;,
                                                          NA, NA, &quot;AX&quot;, &quot;17.729424000000002&quot;, &quot;45.376750999999999&quot;, &quot;20.891473000000001&quot;,
                                                          &quot;50.795566999999998&quot;, &quot;7.6219530000000004&quot;, &quot;58.288466999999997&quot;,
                                                          NA, &quot;AX&quot;, &quot;160.67087599999999&quot;), AY = c(NA, &quot;AY&quot;, &quot;17.371937600000003&quot;,
                                                                                                  &quot;5.3626663999999993&quot;, &quot;17.839726800000001&quot;, &quot;12.1478062&quot;, &quot;5.5127848000000004&quot;,
                                                                                                  &quot;13.4053042&quot;, NA, NA, &quot;AY&quot;, &quot;17.269538000000001&quot;, &quot;5.4522810000000002&quot;,
                                                                                                  &quot;17.783394999999999&quot;, &quot;12.026367&quot;, &quot;6.0820509999999999&quot;, &quot;13.563755&quot;,
                                                                                                  NA, &quot;AY&quot;, &quot;17.594677000000001&quot;), AZ = c(NA, &quot;AZ&quot;, &quot;175.6576848&quot;,
                                                                                                                                          &quot;67.382918000000004&quot;, &quot;99.333905000000001&quot;, &quot;116.01154019999998&quot;,
                                                                                                                                          &quot;76.799018200000006&quot;, &quot;162.97033279999999&quot;, NA, NA, &quot;AZ&quot;, &quot;6.0498320000000003&quot;,
                                                                                                                                          &quot;45.120047999999997&quot;, &quot;15.705375999999999&quot;, &quot;50.471809999999998&quot;,
                                                                                                                                          &quot;9.6571890000000007&quot;, &quot;59.333387999999999&quot;, NA, &quot;AZ&quot;, &quot;167.606852&quot;
), DX = c(NA, &quot;DX&quot;, &quot;9.1008000000000013&quot;, &quot;32.729599999999998&quot;,
          &quot;47.664360000000002&quot;, &quot;23.628800000000002&quot;, &quot;38.563539999999996&quot;,
          &quot;14.93474&quot;, NA, NA, &quot;DX&quot;, &quot;-11.773400000000001&quot;, &quot;-35.472099999999998&quot;,
          &quot;-48.059600000000003&quot;, &quot;-23.698699999999999&quot;, &quot;-36.286200000000001&quot;,
          &quot;-12.5875&quot;, NA, &quot;DX&quot;, &quot;-11.7559&quot;), DY = c(NA, &quot;DY&quot;, &quot;-1.2008800000000002&quot;,
                                                    &quot;-0.66572000000000009&quot;, &quot;1.0446199999999999&quot;, &quot;0.53513999999999995&quot;,
                                                    &quot;2.2454800000000001&quot;, &quot;1.71034&quot;, NA, NA, &quot;DY&quot;, &quot;-19.358799999999999&quot;,
                                                    &quot;-18.635200000000001&quot;, &quot;0.52759999999999996&quot;, &quot;0.72360000000000002&quot;,
                                                    &quot;19.886500000000002&quot;, &quot;19.1629&quot;, NA, &quot;DY&quot;, &quot;-19.334599999999998&quot;
), DZ = c(NA, &quot;DZ&quot;, &quot;-20.673919999999999&quot;, &quot;-23.138440000000003&quot;,
          &quot;-5.8264999999999993&quot;, &quot;-2.4645199999999998&quot;, &quot;14.847440000000001&quot;,
          &quot;17.31194&quot;, NA, NA, &quot;DZ&quot;, &quot;1.09E-2&quot;, &quot;-1.9400000000000001E-2&quot;,
          &quot;-1.24E-2&quot;, &quot;-3.04E-2&quot;, &quot;-2.3400000000000001E-2&quot;, &quot;7.0000000000000001E-3&quot;,
          NA, &quot;DZ&quot;, &quot;-9.7000000000000003E-3&quot;), Distance = c(NA, &quot;Distance&quot;,
                                                            &quot;22.6203&quot;, &quot;40.088119999999989&quot;, &quot;48.030499999999996&quot;, &quot;23.762999999999998&quot;,
                                                            &quot;41.383979999999994&quot;, &quot;22.927579999999999&quot;, NA, NA, &quot;Distance&quot;,
                                                            &quot;22.657900000000001&quot;, &quot;40.069200000000002&quot;, &quot;48.0625&quot;, &quot;23.709700000000002&quot;,
                                                            &quot;41.3782&quot;, &quot;22.927299999999999&quot;, NA, &quot;Distance&quot;, &quot;22.628&quot;)), row.names = c(NA,
                                                                                                                                       -20L), class = c(&quot;tbl_df&quot;, &quot;tbl&quot;, &quot;data.frame&quot;))

答案1

得分: 1

以下是代码的翻译部分：

# 我不得不添加NAs，使read.table()能够识别它
df <- read.table(text="
Contraption 1 NA NA NA
Attempt 1 NA NA NA                     
#s      AX          AY      AZ      Distance
3->6    162.4       17.3    175.6   9.1
3->11   67.3        5.3     67.3    32.7
Contraption 1 NA NA NA
Attempt 2 NA NA NA                
#s      AX          AY      AZ      Distance
3->6    162.4       17.3    175.6   9.1
3->11   67.3        5.3     67.3    32.7
Contraption 2 NA NA NA
Attempt 1 NA NA NA                          
#s      AX          AY      AZ      Distance
3->6    162.4       17.3    175.6   9.1
3->11   67.3        5.3     67.3    32.7", header=F)
df %>%
  mutate(Contraption = ifelse(V1 == "Contraption", V2, NA),
          Attempt = ifelse(V1 == "Attempt", V2, NA),
          Group = rep(1:3, each=4)) %>%
          group_by(Group) %>%
          mutate(Contraption = Contraption[1],
                 Attempt = Attempt[2]) %>%
          filter(!is.na(V3)) %>%
          ungroup() %>%
          select("#s" = V1, AX = V2, AY = V3, AZ = V4, Distance = V5, Contraption, Attempt)
# 一个数据表: 6 × 7
  `#s`     AX    AY    AZ Distance Contraption Attempt
  <chr> <dbl> <dbl> <dbl>    <dbl>       <dbl>   <dbl>
1 3->6  162.   17.3 176.       9.1           1       1
2 3->11  67.3   5.3  67.3     32.7           1       1
3 3->6  162.   17.3 176.       9.1           1       2
4 3->11  67.3   5.3  67.3     32.7           1       2
5 3->6  162.   17.3 176.       9.1           2       1
6 3->11  67.3   5.3  67.3     32.7           2       1

如果有其他问题，请随时告诉我。

英文:

# I had to add NAs into it to make read.table() recognise it
df &lt;- read.table(text=&quot;
Contraption 1 NA NA NA
Attempt 1 NA NA NA                     
#s      AX          AY      AZ      Distance
3-&gt;6    162.4       17.3    175.6   9.1
3-&gt;11   67.3        5.3     67.3    32.7
Contraption 1 NA NA NA
Attempt 2 NA NA NA                
#s      AX          AY      AZ      Distance
3-&gt;6    162.4       17.3    175.6   9.1
3-&gt;11   67.3        5.3     67.3    32.7
Contraption 2 NA NA NA
Attempt 1 NA NA NA                          
#s      AX          AY      AZ      Distance
3-&gt;6    162.4       17.3    175.6   9.1
3-&gt;11   67.3        5.3     67.3    32.7&quot;, header=F)
df %&gt;%
mutate(Contraption = ifelse(V1 == &quot;Contraption&quot;, V2, NA),
Attempt = ifelse(V1 == &quot;Attempt&quot;, V2, NA),
Group = rep(1:3, each=4)) %&gt;%
group_by(Group) %&gt;%
mutate(Contraption = Contraption[1],
Attempt = Attempt[2]) %&gt;%
filter(!is.na(V3)) %&gt;%
ungroup() %&gt;%
select(&quot;#s&quot; = V1, AX = V2, AY = V3, AZ = V4, Distance = V5, Contraption, Attempt)
# A tibble: 6 &#215; 7
`#s`     AX    AY    AZ Distance Contraption Attempt
&lt;chr&gt; &lt;dbl&gt; &lt;dbl&gt; &lt;dbl&gt;    &lt;dbl&gt;       &lt;dbl&gt;   &lt;dbl&gt;
1 3-&gt;6  162.   17.3 176.       9.1           1       1
2 3-&gt;11  67.3   5.3  67.3     32.7           1       1
3 3-&gt;6  162.   17.3 176.       9.1           1       2
4 3-&gt;11  67.3   5.3  67.3     32.7           1       2
5 3-&gt;6  162.   17.3 176.       9.1           2       1
6 3-&gt;11  67.3   5.3  67.3     32.7           2       1

Let me know if you have any questions/thoughts/things that could be improved!

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Manipulating Single Values in R to Column values

问题

答案1

“在设置显式唯一行名称后，设置‘row.names’时出现‘非唯一值’错误。”

如何显示具有不同列’b’值的列’a’的重复项

从单元格中移除具有多个出现的字符串部分

从列表中提取对象的属性并将它们写入数据框中。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。