2023年7月24日 15:05:22go评论114阅读模式

英文:

Why reaching nested meta gives NaN when normalizing a json with pandas?

问题

以下是您要翻译的内容：

我的输入是一个Python字典（类似JSON）：
d = {
    "type": "type1",
    "details": {
        "name": "foo",
        "date": {
            "timestamp": "01/02/2023 21:42:44",
            "components": {
                "day": 2,
                "month": 1,
                "year": 2023,
                "time": "21:42:44"
            }
        }
    },
    "infos": {
        "records": [
            {
                "field1": "qux",
                "field2": "baz",
            }
        ],
        "class": "P"
    }
}
我使用以下代码：
df = pd.json_normalize(
    d,
    record_path=["infos", "records"],
    meta=[
        "type",
        ["details", "date", "timestamp"],
        ["details", "date", "components", "year"],
        ["infos", "class"]
    ],
    errors="ignore"
)
这给了我以下输出：
field1 field2   type details.date.timestamp details.date.components.year infos.class
0    qux    baz  type1                    NaN                          NaN           P
但我期望得到这个输出：
field1 field2   type details.date.timestamp details.date.components.year infos.class
0    qux    baz  type1    01/02/2023 21:42:44                         2023           P
老实说，我对`meta`参数感到非常困惑！我不知道我做错了什么...
您能解释一下它的逻辑吗？

英文:

My input is a Python dictionnary (json-like) :

d = {
    &quot;type&quot;: &quot;type1&quot;,
    &quot;details&quot;: {
        &quot;name&quot;: &quot;foo&quot;,
        &quot;date&quot;: {
            &quot;timestamp&quot;: &quot;01/02/2023 21:42:44&quot;,
            &quot;components&quot;: {
                &quot;day&quot;: 2,
                &quot;month&quot;: 1,
                &quot;year&quot;: 2023,
                &quot;time&quot;: &quot;21:42:44&quot;
            }
        }
    },
    &quot;infos&quot;: {
        &quot;records&quot;: [
            {
                &quot;field1&quot;: &quot;qux&quot;,
                &quot;field2&quot;: &quot;baz&quot;,
            }
        ],
        &quot;class&quot;: &quot;P&quot;
    }
}

I'm using the code below :

df = pd.json_normalize(
    d,
    record_path=[&quot;infos&quot;, &quot;records&quot;],
    meta=[
        &quot;type&quot;,
        [&quot;details&quot;, &quot;date&quot;, &quot;timestamp&quot;],
        [&quot;details&quot;, &quot;date&quot;, &quot;components&quot;, &quot;year&quot;],
        [&quot;infos&quot;, &quot;class&quot;]
    ],
    errors=&quot;ignore&quot;
)

Which gives me this output :

  field1 field2   type details.date.timestamp details.date.components.year infos.class
0    qux    baz  type1                    NaN                          NaN           P

But I'm expecting this one :

  field1 field2   type details.date.timestamp details.date.components.year infos.class
0    qux    baz  type1    01/02/2023 21:42:44                         2023           P

To be honest, I'm going crazy with the meta parameter! I ignore what I'm doing wrong..

Can you explain its logic, please ?

答案1

得分: 2

我认为你应该在`record_path=`中额外添加`[]`：
```py
df = pd.json_normalize(
    d,
    record_path=[["infos", "records"]],  # &lt;-- 在这里加上 []
    meta=[
        "type",
        ["details", "date", "timestamp"],
        ["details", "date", "components", "year"],
        ["infos", "class"],
    ],
    errors="ignore",
)
print(df)

打印：

  field1 field2   type details.date.timestamp details.date.components.year infos.class
0    qux    baz  type1    01/02/2023 21:42:44                         2023           P


<details>
<summary>英文:</summary>
I think you should put extra `[]` in `record_path=`:
```py
df = pd.json_normalize(
    d,
    record_path=[[&quot;infos&quot;, &quot;records&quot;]],  # &lt;-- put [] here
    meta=[
        &quot;type&quot;,
        [&quot;details&quot;, &quot;date&quot;, &quot;timestamp&quot;],
        [&quot;details&quot;, &quot;date&quot;, &quot;components&quot;, &quot;year&quot;],
        [&quot;infos&quot;, &quot;class&quot;],
    ],
    errors=&quot;ignore&quot;,
)
print(df)

Prints:

  field1 field2   type details.date.timestamp details.date.components.year infos.class
0    qux    baz  type1    01/02/2023 21:42:44                         2023           P

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

为什么在使用 pandas 规范化 JSON 时，访问嵌套元数据会得到 NaN？

问题

答案1

pytest与@cache结合使用时无法按预期工作。

有没有办法在Pygame Zero中改变声音的音量？

遍历JSON数组

如何在返回到Dash标签时保留相同的内容

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。