2023年2月14日 06:58:16go评论59阅读模式

英文:

Matching observations across two string variables

问题

以下是您提供的文本的翻译部分：

GDP per capita
democracy score

我有两个连续指标，它们是在国家水平上测量的：

人均 GDP
民主评分

I have two string variables that essentially use the same country coding system, such as AFG for Afghanistan. However, I only have 184 observations under the country variable for the GDP data, yet 249 observations under the code variable for the democracy_score data.

我有两个字符串变量，它们基本上使用相同的国家编码系统，比如 AFG 代表阿富汗。然而，对于 GDP 数据，我的 country 变量下只有 184 个观测值，而对于 democracy_score 数据，code 变量下有 249 个观测值。

I would like to match GDP and democracy score data for observations where the data for both continuous indicators are complete.

我想要匹配 GDP 和民主评分数据，以便在连续指标的数据都完整的情况下进行匹配。

And I would like to match it with the democracy score data from the third row for observations where the country code is the same, "AFG".

并且，我想要将其与第三行的民主评分数据匹配，对于那些国家代码相同的观测值，如 "AFG"。

And the correct data structure would be as follows for AFG:

对于 AFG，正确的数据结构应如下：

country gdp_adj democracy_score
"AFG" 2079.9219 "0.174"

Here is a data example:

以下是一个数据示例：

dataex country gdp_adj code democracy_score

output:

输出：

Example generated by -dataex-. For more info, type help dataex
clear
input str3 country float gdp_adj str3 code str5 democracy_score
"AFG" 2079.9219 "ABW" "0.813"
"AGO" 6602.424 "ADO" "#N/A"
"ALB" 13655.665 "AFG" "0.174"
...
end

请注意，我已经去掉了代码部分，只返回了翻译的文本。如果您有任何其他问题或需要进一步的帮助，请随时告诉我。

英文:

I have two continuous indicators that are measured at the country-level:

GDP per capita
democracy score

I would like to match GDP and democracy score data for observations where the data for both continuous indicators are complete. For instance, the data in the first row below is

&quot;AFG&quot; 2079.9219 &quot;ABW&quot; &quot;0.813&quot;

And I would like to match it with the democracy score data from the third row for observations where the country code is the same, "AFG".

&quot;ALB&quot; 13655.665 &quot;AFG&quot; &quot;0.174&quot;

And the correct data structure would be as follows for AFG:

country gdp_adj democracy_score 
&quot;AFG&quot; 2079.9219 &quot;0.174&quot;

Here is a data example:

dataex country gdp_adj code democracy_score

output:

* Example generated by -dataex-. For more info, type help dataex
clear
input str3 country float gdp_adj str3 code str5 democracy_score
&quot;AFG&quot; 2079.9219 &quot;ABW&quot; &quot;0.813&quot;
&quot;AGO&quot;  6602.424 &quot;ADO&quot; &quot;#N/A&quot; 
&quot;ALB&quot; 13655.665 &quot;AFG&quot; &quot;0.174&quot;
&quot;ARE&quot;  71782.16 &quot;AIA&quot; &quot;#N/A&quot; 
&quot;ARG&quot;  22071.75 &quot;ALB&quot; &quot;0.576&quot;
&quot;ARM&quot; 14317.553 &quot;ANT&quot; &quot;#N/A&quot; 
&quot;ATG&quot;  23035.66 &quot;ARE&quot; &quot;0.232&quot;
&quot;AUS&quot;  49379.09 &quot;ARG&quot; &quot;0.632&quot;
&quot;AUT&quot;  55806.44 &quot;ARM&quot; &quot;0.496&quot;
&quot;AZE&quot;  14442.04 &quot;ASM&quot; &quot;#N/A&quot; 
&quot;BDI&quot;  729.6584 &quot;ATG&quot; &quot;#N/A&quot; 
&quot;BEL&quot;  51977.18 &quot;AUS&quot; &quot;0.861&quot;
&quot;BEN&quot;  3156.439 &quot;AUT&quot; &quot;0.852&quot;
&quot;BFA&quot; 2110.0623 &quot;AZE&quot; &quot;0.200&quot;
&quot;BGD&quot;  5467.208 &quot;BDI&quot; &quot;0.170&quot;
&quot;BGR&quot; 23270.225 &quot;BEL&quot; &quot;0.820&quot;
&quot;BHR&quot;  49768.98 &quot;BEN&quot; &quot;0.473&quot;
&quot;BHS&quot; 35161.832 &quot;BFA&quot; &quot;0.358&quot;
&quot;BIH&quot; 14634.738 &quot;BGD&quot; &quot;0.388&quot;
&quot;BLR&quot; 19279.209 &quot;BGR&quot; &quot;0.602&quot;
&quot;BLZ&quot;  9028.552 &quot;BHR&quot; &quot;0.190&quot;
&quot;BOL&quot;  8528.749 &quot;BHS&quot; &quot;0.688&quot;
&quot;BRA&quot; 14685.128 &quot;BIH&quot; &quot;0.399&quot;
end

答案1

得分: 2

以下是代码部分的翻译：

You can do it by stacking and reshaping back to wide:

通过堆叠和重新调整为宽格式来实现：

destring democracy_score, replace ignore("#N/A")

将democracy_score转换为数值型，替换忽略"#N/A"

stack country gdp_adj code democracy_score , into(country outcome) clear

将country、gdp_adj、code和democracy_score堆叠，生成新的变量country和outcome，并清除原始数据

reshape wide outcome, i(country) j(_stack)

将outcome重新调整为宽格式，以i(country)和j(_stack)标识

rename (outcome1 outcome2) (gdp_adj democracy_score)

重命名变量名，将outcome1和outcome2分别重命名为gdp_adj和democracy_score

I converted the score from string to double under the assumption that you would want to do some analysis on it. If not, then you can tostring it back.

我假设你想对分数进行一些分析，因此将其从字符串转换为数值型。如果不需要，可以使用tostring将其转回字符串类型。

I also had to tweak the GDP storage to double to avoid some precision issues:

我还不得不将GDP存储类型调整为双精度以避免一些精度问题：

input str3 country double gdp_adj str3 code str5 democracy_score

将country设为字符串类型，gdp_adj设为双精度数值型，code设为字符串类型，democracy_score设为字符串类型。

英文:

You can do it by stacking and reshaping back to wide:

destring democracy_score, replace ignore(&quot;#N/A&quot;)
stack country gdp_adj code democracy_score , into(country outcome) clear
reshape wide outcome, i(country) j(_stack)
rename (outcome1 outcome2) (gdp_adj democracy_score)

I converted the score from string to double under the assumption that you would want to do some analysis on it. If not, then you can tostring it back.

I also had to tweak the GDP storage to double to avoid some precision issues:

input str3 country double gdp_adj str3 code str5 democracy_score

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

匹配两个字符串变量中的观测值

问题

答案1

Laravel 9如果数据库连接失败，则抛出自定义错误视图？

Calculate mean of group without 0.

在Pandas中使用索引子句的错误适当标记为’ignore’。

在Flask网站中使用SQLAlchemy操作数据库时遇到的问题。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论