2023年6月5日 14:52:45go评论72阅读模式

英文:

How can I control the order of attributes when converting dataclasses to a dict in Python?

问题

我想在我的项目中使用命名数据，但最终我需要将数据转换为CSV文件。

我的方法是在代码中使用数据类的数据容器列表。最后，数据类的列表转换为字典列表，然后转换为CSV文件。

我希望能控制最终数据对象中属性出现的顺序。这是我的代码：

from dataclasses import dataclass

@dataclass
class MyClass:
    c: str
    b: str
    a: str = "Default"

objdata = [
    MyClass(c="Foo", b="Bar"),
    MyClass(c="Test", b="Test"),
    MyClass(c="asdf", b="yxcv")
]

dictdata = []
attributes = [a for a in dir(objdata[0]) if not a.startswith('__')]
for entry in objdata:
    csv_entry = {}
    for attribute in attributes:
        csv_entry[attribute] = getattr(entry, attribute)
    dictdata.append(csv_entry)

我有几个具有不同属性的数据类。我想重用将数据类转换为字典的代码，因此我循环遍历类中的属性。问题是dir按字母顺序返回属性（所以是a、b、c），而不是我在类中声明它们的顺序（它们是c、b、a）。有办法控制这个顺序吗？

英文:

i want to used named data in my project but in the end i have to convert the data to a CSV file.

My approach is to use list of dataclasses in the code as data containers. In the end the list of dataclasses is converted to a list of dicts and from that to a CSV file.

I want to control the order in which the attributes appear in the final data object. Here is my code:

from dataclasses import dataclass

@dataclass
class MyClass:
    c: str
    b: str
    a: str = &quot;Default&quot;

objdata = [
    MyClass(c=&quot;Foo&quot;, b=&quot;Bar&quot;),
    MyClass(c=&quot;Test&quot;, b=&quot;Test&quot;),
    MyClass(c=&quot;asdf&quot;, b=&quot;yxcv&quot;)
    ]

dictdata = []
attributes = [a for a in dir(objdata[0]) if not a.startswith(&#39;__&#39;)]
for entry in objdata:
    csv_entry = {}
    for attribute in attributes:
        csv_entry[attribute] = getattr(entry, attribute)
    dictdata.append(csv_entry)

I have several dataclasses with different attributes. I want to reuse the code converting the dataclass to a dict, so i loop over the attributes in the class. The problem is dir returns the attributes in alphabetical order (so a, b c) and not in the order i declared them in the class (which is c, b, a). Is there a way to control this?

答案1

得分: 1

在Python 3.11.3中，属性可以按照声明的顺序获取，如下所示：

from dataclasses import dataclass

@dataclass
class A:
    c: str
    b: str
    a: str

a = A(a='A', c='C', b='B')

for key in a.__dict__:
    print(key)

输出:

c
b
a

英文:

In Python 3.11.3 the attributes can be acquired in order of declaration thus:

from dataclasses import dataclass

@dataclass
class A:
    c: str
    b: str
    a: str

a = A(a=&#39;A&#39;, c=&#39;C&#39;, b=&#39;B&#39;)

for key in a.__dict__:
    print(key)

Output:

c
b
a

答案2

得分: 1

所以，你只需要使用dataclasses.asdict，它为任何由使用dataclasses.dataclass代码生成器装饰的类创建的对象实例实现了这种行为。

import dataclasses

@dataclasses.dataclass
class MyClass:
    c: str
    b: str
    a: str = "Default"

objdata = [
    MyClass(c="Foo", b="Bar"),
    MyClass(c="Test", b="Test"),
    MyClass(c="asdf", b="yxcv")
]

for obj in objdata:
    print(dataclasses.asdict(obj))

注意，这是通过使用dataclasses.fields(obj)来实现的，它最终只是访问了__dataclass_fields__类变量。详情可以在源代码中找到。

如果你只想将你知道是某个特定dataclass类型的每个对象写入CSV文件，那么无需创建中间字典。你可以这样做：

import operator
import csv

fields = [f.name for f in dataclasses.fields(MyClass)]
getter_func = operator.attrgetter(*fields)

with open("data.csv", "w") as f:
    writer = csv.writer(f)
    writer.writerow(fields)  # 如果你需要标题行
    writer.writerows(map(getter_func, objdata))

英文:

So, you should just use dataclasses.asdict, which implements this behavior for any object that is an instance of a class created by a class that was decorated with the dataclasses.dataclass code generator.

&gt;&gt;&gt; import dataclasses
&gt;&gt;&gt; @dataclasses.dataclass
... class MyClass:
...     c: str
...     b: str
...     a: str = &quot;Default&quot;
...
&gt;&gt;&gt; objdata = [
...     MyClass(c=&quot;Foo&quot;, b=&quot;Bar&quot;),
...     MyClass(c=&quot;Test&quot;, b=&quot;Test&quot;),
...     MyClass(c=&quot;asdf&quot;, b=&quot;yxcv&quot;)
...     ]
&gt;&gt;&gt;
&gt;&gt;&gt; for obj in objdata:
...     print(dataclasses.asdict(obj))
...
{&#39;c&#39;: &#39;Foo&#39;, &#39;b&#39;: &#39;Bar&#39;, &#39;a&#39;: &#39;Default&#39;}
{&#39;c&#39;: &#39;Test&#39;, &#39;b&#39;: &#39;Test&#39;, &#39;a&#39;: &#39;Default&#39;}
{&#39;c&#39;: &#39;asdf&#39;, &#39;b&#39;: &#39;yxcv&#39;, &#39;a&#39;: &#39;Default&#39;}

Note, this is implemented by using dataclasses.fields(obj), which ultimately just access the __dataclass_fields__ class variable. The details are all pretty easy to follow in the source code

Note, if you simply wanted to write every object that you know is of some specific dataclass type to a csv, I wouldn't create intermediate dicts. You could just do:

import operator
import csv

fields = [f.name for f in dataclasses.fields(MyClass)]
getter_func = operator.attrgetter(*fields)

with open(&quot;data.csv&quot;) as f:
    writer = csv.writer(f)
    writer.writerow(fields) # If you want a header
    writer.writerows(map(getter_func, objdata))

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

你可以控制在Python中将dataclasses转换为字典时属性的顺序吗？

问题

答案1

答案2

"getaddrinfo() argument 1 must be string or None" error starting dev_appserver.py on Linux

创建一个包含变量名称和值的字典列表。

如何改进我用于在时间序列中分类间歇信号的Python代码？

Pydantic根验证器无法访问类属性。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论