问题

我想要为“Type”创建一个新列来存储值 - "webfile"，"app" 等。我运行了下面的代码：

df_test["new_col"]=df_test['Vals'].apply(lambda x: x['Type'] if 'Type' in x else None)

但是出现了错误：

TypeError: list indices must be integers or slices, not str

有人可以帮忙吗？

英文:

I have dataframe with one json (Vals) column:

                            Identity                                                                                                      Vals
                  2fc9d38d-0fe4-c7be       {&quot;$id&quot;:&quot;2&quot;,&quot;Address&quot;:&quot;22.44&quot;,&quot;Location&quot;:{&quot;Code&quot;:&quot;TN&quot;},&quot;Asset&quot;:false,&quot;Roles&quot;:[&quot;A&quot;],&quot;Type&quot;:&quot;webfile&quot;}
                        abd77d57ac29 {&quot;$id&quot;:&quot;3&quot;,&quot;Address&quot;:&quot;40.1&quot;,&quot;Location&quot;:{&quot;Code&quot;:&quot;SS&quot;},&quot;Asset&quot;:false,&quot;Roles&quot;:[&quot;Attacker&quot;],&quot;Type&quot;:&quot;webfile&quot;}
                           c7be-4a37                  {&quot;$id&quot;:&quot;4&quot;,&quot;AppId&quot;:11161,&quot;SaasId&quot;:11161,&quot;Name&quot;:&quot;Office 365&quot;,&quot;InstanceId&quot;:0,&quot;Type&quot;:&quot;app&quot;}
              916a-8051-8fd1721385ae                              {&quot;$id&quot;:&quot;3&quot;,&quot;Address&quot;:&quot;213.85&quot;,&quot;Asset&quot;:false,&quot;Roles&quot;:[&quot;tm&quot;],&quot;Type&quot;:&quot;webfile&quot;}
                   8051-8fd1721385ae                     {&quot;$id&quot;:&quot;4&quot;,&quot;Address&quot;:&quot;198.137&quot;,&quot;Asset&quot;:false,&quot;Roles&quot;:[&quot;Contextual&quot;],&quot;Type&quot;:&quot;webfile&quot;}
                        8fd1721385ae                             {&quot;$id&quot;:&quot;5&quot;,&quot;AppId&quot;:26324,&quot;sId&quot;:26324,&quot;Name&quot;:&quot;MB&quot;,&quot;InstanceId&quot;:0,&quot;Type&quot;:&quot;app&quot;}
                       58a51721385ae                      {&quot;$id&quot;:&quot;6&quot;,&quot;Address&quot;:&quot;.225.0&quot;,&quot;Asset&quot;:false,&quot;Roles&quot;:[&quot;Contextual&quot;],&quot;Type&quot;:&quot;webfile&quot;}
964fb17e-a352-dbd4-d5b7-374172d811aa                                   {&quot;$id&quot;:&quot;2&quot;,&quot;Name&quot;:&quot;AD561-SA&quot;,&quot;DisplayName&quot;:&quot;AD561-SA&quot;,&quot;Type&quot;:&quot;account&quot;}

I want to create a new column for "Type" to hold values - "webfile","app" etc. Ran the code below :

df_test[&quot;new_col&quot;]=df_test[&#39;Vals&#39;].apply(lambda x: x[&#39;Type&#39;] if &#39;Type&#39; in x else None)

But getting error

TypeError: list indices must be integers or slices, not str

Can someone help ?

答案1

得分: 3

import json

df_test['Type'] = pd.json_normalize(df_test['Vals'].apply(json.loads))['Type']

输出:

>>> df_test[['Identity', 'Type']]
                               Identity     Type
0                    2fc9d38d-0fe4-c7be  webfile
1                          abd77d57ac29  webfile
2                             c7be-4a37      app
3                916a-8051-8fd1721385ae  webfile
4                     8051-8fd1721385ae  webfile
5                          8fd1721385ae      app
6                         58a51721385ae  webfile
7  964fb17e-a352-dbd4-d5b7-374172d811aa  account

英文:

As your Vals column contains JSON string, you have to decode first before extract Type field:

import json

df_test[&#39;Type&#39;] = pd.json_normalize(df_test[&#39;Vals&#39;].apply(json.loads))[&#39;Type&#39;]

Output:

&gt;&gt;&gt; df_test[[&#39;Identity&#39;, &#39;Type&#39;]]
                               Identity     Type
0                    2fc9d38d-0fe4-c7be  webfile
1                          abd77d57ac29  webfile
2                             c7be-4a37      app
3                916a-8051-8fd1721385ae  webfile
4                     8051-8fd1721385ae  webfile
5                          8fd1721385ae      app
6                         58a51721385ae  webfile
7  964fb17e-a352-dbd4-d5b7-374172d811aa  account

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

通过解析 JSON 列创建一个新列

问题

答案1

如何从JSON数组中删除尾随逗号

我的Python tkinter GUI为什么在我频繁点击返回按钮后关闭？

Jayway JSONPath 表达式适用于对象和数组。

如何过滤 pandas 数据框（DF）并根据这些条件创建三个新的数据框（DF）？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论