2023年6月18日 21:33:30go评论97阅读模式

英文:

Join results of marginaleffects::predictions() back to main df?

问题

我运行了一个lm模型，然后在输出上运行了marginaleffects中的predictions()。我想将这个输出与我输入到lm中的主要数据框连接起来，但我看不到在这种情况下应该使用哪个选项。

有人知道我需要在这里做什么吗？predictions()的输出具有一个rowid，但根据索引连接（可能已更改顺序）似乎是一种冒险的方式。

例如，看下面的代码（来自文档）：

mod <- lm(mpg ~ hp + factor(cyl), data = mtcars)
pred <- predictions(mod)
pred %>% head()
 Estimate Std. Error    z Pr(>|z|) 2.5 % 97.5 %
     20.0      1.204 16.6   <0.001  17.7   22.4
     20.0      1.204 16.6   <0.001  17.7   22.4
     26.4      0.962 27.5   <0.001  24.5   28.3
     20.0      1.204 16.6   <0.001  17.7   22.4
     15.9      0.992 16.0   <0.001  14.0   17.9
     20.2      1.219 16.5   <0.001  17.8   22.5
Columns: rowid, estimate, std.error, statistic, p.value, conf.low, conf.high, mpg, hp, cyl 
> mtcars %>% head()
                   mpg cyl disp  hp drat    wt  qsec vs am gear carb
Mazda RX4         21.0   6  160 110 3.90 2.620 16.46  0  1    4    4
Mazda RX4 Wag     21.0   6  160 110 3.90 2.875 17.02  0  1    4    4
Datsun 710        22.8   4  108  93 3.85 2.320 18.61  1  1    4    1
Hornet 4 Drive    21.4   6  258 110 3.08 3.215 19.44  1  0    3    1
Hornet Sportabout 18.7   8  360 175 3.15 3.440 17.02  0  0    3    2
Valiant           18.1   6  225 105 2.76 3.460 20.22  1  0    3    1

在pred中，每行mtcars都有1个预测，但我如何将它们连接起来？

英文:

I've run a lm model and then run predictions() from marginaleffects on the output. I want to join this back to by main df (that I fed into the lm) but I can't see what option is the right one to use in this case.

Does anyone know what I need to do here? The output of predictions() has a rowid but joining on an index (which may have changed order) seems like a risky way forward.

For example, take the following code (from the documentation):

mod &lt;- lm(mpg ~ hp + factor(cyl), data = mtcars)
pred &lt;- predictions(mod)
pred %&gt;% head()
 Estimate Std. Error    z Pr(&gt;|z|) 2.5 % 97.5 %
     20.0      1.204 16.6   &lt;0.001  17.7   22.4
     20.0      1.204 16.6   &lt;0.001  17.7   22.4
     26.4      0.962 27.5   &lt;0.001  24.5   28.3
     20.0      1.204 16.6   &lt;0.001  17.7   22.4
     15.9      0.992 16.0   &lt;0.001  14.0   17.9
     20.2      1.219 16.5   &lt;0.001  17.8   22.5
Columns: rowid, estimate, std.error, statistic, p.value, conf.low, conf.high, mpg, hp, cyl 
&gt; mtcars %&gt;% head()
                   mpg cyl disp  hp drat    wt  qsec vs am gear carb
Mazda RX4         21.0   6  160 110 3.90 2.620 16.46  0  1    4    4
Mazda RX4 Wag     21.0   6  160 110 3.90 2.875 17.02  0  1    4    4
Datsun 710        22.8   4  108  93 3.85 2.320 18.61  1  1    4    1
Hornet 4 Drive    21.4   6  258 110 3.08 3.215 19.44  1  0    3    1
Hornet Sportabout 18.7   8  360 175 3.15 3.440 17.02  0  0    3    2
Valiant           18.1   6  225 105 2.76 3.460 20.22  1  0    3    1

There is 1 prediction in pred for each row in mtcars but how do I join these?

答案1

得分: 2

Ah, I got it! It seems you just need to pass the original df through to the newdata argument of predictions. E.g.

mod &lt;- lm(mpg ~ hp + factor(cyl), data = mtcars)
pred &lt;- predictions(mod, newdata = mtcars)
pred %&gt;% head()
 估计值 标准误差    z Pr(>|z|) 2.5 % 97.5 % cyl disp  hp drat   wt qsec vs am gear
     20.0      1.204 16.6   &lt;0.001  17.7   22.4   6  160 110 3.90 2.62 16.5  0  1    4
     20.0      1.204 16.6   &lt;0.001  17.7   22.4   6  160 110 3.90 2.88 17.0  0  1    4
     26.4      0.962 27.5   &lt;0.001  24.5   28.3   4  108  93 3.85 2.32 18.6  1  1    4
     20.0      1.204 16.6   &lt;0.001  17.7   22.4   6  258 110 3.08 3.21 19.4  1  0    3
     15.9      0.992 16.0   &lt;0.001  14.0   17.9   8  360 175 3.15 3.44 17.0  0  0    3
     20.2      1.219 16.5   &lt;0.001  17.8   22.5   6  225 105 2.76 3.46 20.2  1  0    3
 carb
    4
    4
    1
    1
    2
    1
Columns: rowid, estimate, std.error, statistic, p.value, conf.low, conf.high, mpg, cyl, disp, hp, drat, wt, qsec, vs, am, gear, carb

英文:

Ah, I got it! It seems you just need to pass the original df through to the newdata argument of predictions. E.g.

mod &lt;- lm(mpg ~ hp + factor(cyl), data = mtcars)
pred &lt;- predictions(mod, newdata = mtcars)
pred %&gt;% head()
 Estimate Std. Error    z Pr(&gt;|z|) 2.5 % 97.5 % cyl disp  hp drat   wt qsec vs am gear
     20.0      1.204 16.6   &lt;0.001  17.7   22.4   6  160 110 3.90 2.62 16.5  0  1    4
     20.0      1.204 16.6   &lt;0.001  17.7   22.4   6  160 110 3.90 2.88 17.0  0  1    4
     26.4      0.962 27.5   &lt;0.001  24.5   28.3   4  108  93 3.85 2.32 18.6  1  1    4
     20.0      1.204 16.6   &lt;0.001  17.7   22.4   6  258 110 3.08 3.21 19.4  1  0    3
     15.9      0.992 16.0   &lt;0.001  14.0   17.9   8  360 175 3.15 3.44 17.0  0  0    3
     20.2      1.219 16.5   &lt;0.001  17.8   22.5   6  225 105 2.76 3.46 20.2  1  0    3
 carb
    4
    4
    1
    1
    2
    1
Columns: rowid, estimate, std.error, statistic, p.value, conf.low, conf.high, mpg, cyl, disp, hp, drat, wt, qsec, vs, am, gear, carb

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

将marginaleffects::predictions()的结果与主数据框连接起来？

问题

答案1

对于一个RMarkdown报告，泛化变量名称

创建一个从两个数据集中生成的’表格’。

选择在R中的一组组中具有最大值的行如何？

Ordering days of the week (consecutive days) – 排列星期几（连续的天）

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。