2023年5月21日 03:34:59go评论56阅读模式

英文:

Grouping a list of a dictionaries (with common values) from 3 lists of dictionaries

问题

以下是您要的翻译：

list1 = [
  {'a': 1, 'fruit': 'apple', 'thing': 'aeroplane'}, 
  {'b': 2, 'fruit': 'banana', 'thing': 'bicycle'}, 
  {'c': 3, 'fruit': 'cherry', 'thing': 'chair'}
]

list2 = [
  {'fruit': 'apple', 'color': 'green'}, 
  {'fruit': 'banana', 'color': 'yellow'}, 
  {'fruit': 'cherry', 'color': 'red'}
]

list3 = [
  {'thing': 'aeroplane', 'capacity': 100}, 
  {'thing': 'bicycle', 'capacity': 2}, 
  {'thing': 'chair', 'capacity': 1}
]

what_i_want = [
  [
    {'a': 1, 'fruit': 'apple', 'thing': 'aeroplane'}, 
    {'fruit': 'apple', 'color':'green'}, 
    {'thing': 'aeroplane', 'capacity': 100}
  ],
  [
    {'b': 2, 'fruit': 'banana', 'thing': 'bicycle'}, 
    {'fruit': 'banana', 'color':'yellow'}, 
    {'thing': 'bicycle', 'capacity': 2}
  ],
  [
    {'c': 3, 'fruit': 'cherry', 'thing': 'chair'}, 
    {'fruit': 'cherry', 'color': 'red'}, 
    {'thing': 'chair', 'capacity': 1}
  ]
]

希望这有助于您的工作！如果您需要任何其他帮助，请随时告诉我。

英文:

list1 = [
  {&#39;a&#39;: 1, &#39;fruit&#39;: &#39;apple&#39;, &#39;thing&#39;: &#39;aeroplane&#39;}, 
  {&#39;b&#39;: 2, &#39;fruit&#39;: &#39;banana&#39;, &#39;thing&#39;: &#39;bicycle&#39;}, 
  {&#39;c&#39;: 3, &#39;fruit&#39;: &#39;cherry&#39;, &#39;thing&#39;: &#39;chair&#39;}
]

list2 = [
  {&#39;fruit&#39;: &#39;apple&#39;, &#39;color&#39;: &#39;green&#39;}, 
  {&#39;fruit&#39;: &#39;banana&#39;, &#39;color&#39;: &#39;yellow&#39;}, 
  {&#39;fruit&#39;: &#39;cherry&#39;, &#39;color&#39;: &#39;red&#39;}
]

list3 = [
  {&#39;thing&#39;: &#39;aeroplane&#39;, &#39;capacity&#39;: 100}, 
  {&#39;thing&#39;: &#39;bicycle&#39;, &#39;capacity&#39;: 2}, 
  {&#39;thing&#39;: &#39;chair&#39;, &#39;capacity&#39;: 1}
]

what_i_want = [
  [
    {&#39;a&#39;: 1, &#39;fruit&#39;: &#39;apple&#39;, &#39;thing&#39;: &#39;aeroplane&#39;}, 
    {&#39;fruit&#39;: &#39;apple&#39;, &#39;color&#39;:&#39;green&#39;}, 
    {&#39;thing&#39;: &#39;aeroplane&#39;, &#39;capacity&#39;: 100}
  ],
  [
    {&#39;b&#39;: 2, &#39;fruit&#39;: &#39;banana&#39;, &#39;thing&#39;: &#39;bicycle&#39;}, 
    {&#39;fruit&#39;: &#39;banana&#39;, &#39;color&#39;:&#39;yellow&#39;}, 
    {&#39;thing&#39;: &#39;bicycle&#39;, &#39;capacity&#39;: 2}
  ],
  [
    {&#39;c&#39;: 3, &#39;fruit&#39;: &#39;cherry&#39;, &#39;thing&#39;: &#39;chair&#39;}, 
    {&#39;fruit&#39;: &#39;cherry&#39;, &#39;color&#39;: &#39;red&#39;}, 
    {&#39;thing&#39;: &#39;chair&#39;, &#39;capacity&#39;: 1}
  ]
]

The grouping should be done by the order of list1. The lists I am working on have more than 100 dict objects in no particular order. Also the common values are computer generated ids and not the alphabetic values given in the example (just in case anybody was thinking of sorting by alphabets). I want to do this in the most pythonic way possible with minimum iterations.

I have looked at similar questions. They suggested using defaultdict, groupby and itemgetter but I'm not sure if these will work in my use case since the other questions were dealing with a single list while I am dealing with 3-4 lists.

答案1

得分: 0

以下是您要翻译的代码部分：

[
  [
    x, 
    *[y for y in list2 if x['fruit'] == y['fruit']], 
    *[z for z in list3 if x['thing'] == z['thing']]
  ]
  for x in list1
]

结果：

[
  [
    {'a': 1, 'fruit': 'apple', 'thing': 'aeroplane'}, 
    {'fruit': 'apple', 'color': 'green'}, 
    {'thing': 'aeroplane', 'capacity': 100}
  ], 
  [
    {'b': 2, 'fruit': 'banana', 'thing': 'bicycle'}, 
    {'fruit': 'banana', 'color': 'yellow'}, 
    {'thing': 'bicycle', 'capacity': 2}
  ], 
  [
    {'c': 3, 'fruit': 'cherry', 'thing': 'chair'}, 
    {'fruit': 'cherry', 'color': 'red'}, 
    {'thing': 'chair', 'capacity': 1}
  ]
]

from collections import defaultdict

d = defaultdict(list)

for x in (list2+list3): 
  key = x.get('fruit', None) or x.get('thing', None)
  if key: d[key].append(x)

grouped_data = [
  [
    x, 
    *d[x['fruit']], 
    *d[x['thing']]
  ] 
  for x in list1
]

请告诉我如果您需要任何其他翻译或帮助。

英文:

We can solve this with a list comprehension with some nested list comprehensions to select matching items from list2 and list3.

[
  [
    x, 
    *[y for y in list2 if x[&#39;fruit&#39;] == y[&#39;fruit&#39;]], 
    *[z for z in list3 if x[&#39;thing&#39;] == z[&#39;thing&#39;]]
  ]
  for x in list1
]

Result:

[
  [
    {&#39;a&#39;: 1, &#39;fruit&#39;: &#39;apple&#39;, &#39;thing&#39;: &#39;aeroplane&#39;}, 
    {&#39;fruit&#39;: &#39;apple&#39;, &#39;color&#39;: &#39;green&#39;}, 
    {&#39;thing&#39;: &#39;aeroplane&#39;, &#39;capacity&#39;: 100}
  ], 
  [
    {&#39;b&#39;: 2, &#39;fruit&#39;: &#39;banana&#39;, &#39;thing&#39;: &#39;bicycle&#39;}, 
    {&#39;fruit&#39;: &#39;banana&#39;, &#39;color&#39;: &#39;yellow&#39;}, 
    {&#39;thing&#39;: &#39;bicycle&#39;, &#39;capacity&#39;: 2}
  ], 
  [
    {&#39;c&#39;: 3, &#39;fruit&#39;: &#39;cherry&#39;, &#39;thing&#39;: &#39;chair&#39;}, 
    {&#39;fruit&#39;: &#39;cherry&#39;, &#39;color&#39;: &#39;red&#39;}, 
    {&#39;thing&#39;: &#39;chair&#39;, &#39;capacity&#39;: 1}
  ]
]

This works well for small samples of data, but performance is poor for large data sets due to repeated iterations over list2 and list3. This would be O(len(list1) * (len(list2) + len(list3))) or simplified O(n^2), which is not ideal.

Instead, using collections.defaultdict we can create a dictionary where the keys are either the 'fruit' or 'thing' values and the values are lists of any dictionaries that have those attributes.

This is slightly fragile, as it doesn't handle something like 'apple' as a value for a 'thing' key very well. To solve this we might have a dictionary of defaultdicts for each key name we're searching for. I leave that further exercise to the reader.

This approach is O(max(len(list1), len(list2) + len(list3))) or in simpler form O(n) since it only iterates once over any of the lists involved. This scales much better.

from collections import defaultdict

d = defaultdict(list)

for x in (list2+list3): 
  key = x.get(&#39;fruit&#39;, None) or x.get(&#39;thing&#39;, None)
  if key: d[key].append(x)

# {&#39;apple&#39;:     [{&#39;fruit&#39;: &#39;apple&#39;, &#39;color&#39;: &#39;green&#39;}], 
#  &#39;banana&#39;:    [{&#39;fruit&#39;: &#39;banana&#39;, &#39;color&#39;: &#39;yellow&#39;}], 
#  &#39;cherry&#39;:    [{&#39;fruit&#39;: &#39;cherry&#39;, &#39;color&#39;: &#39;red&#39;}], 
#  &#39;aeroplane&#39;: [{&#39;thing&#39;: &#39;aeroplane&#39;, &#39;capacity&#39;: 100}],    
#  &#39;bicycle&#39;:   [{&#39;thing&#39;: &#39;bicycle&#39;, &#39;capacity&#39;: 2}], 
#  &#39;chair&#39;:     [{&#39;thing&#39;: &#39;chair&#39;, &#39;capacity&#39;: 1}]}

grouped_data = [
  [
    x, 
    *d[x[&#39;fruit&#39;]], 
    *d[x[&#39;thing&#39;]]
  ] 
  for x in list1
]

# [
#   [
#     {&#39;a&#39;: 1, &#39;fruit&#39;: &#39;apple&#39;, &#39;thing&#39;: &#39;aeroplane&#39;}, 
#     {&#39;fruit&#39;: &#39;apple&#39;, &#39;color&#39;: &#39;green&#39;}, 
#     {&#39;thing&#39;: &#39;aeroplane&#39;, &#39;capacity&#39;: 100}
#   ], 
#   [
#     {&#39;b&#39;: 2, &#39;fruit&#39;: &#39;banana&#39;, &#39;thing&#39;: &#39;bicycle&#39;}, 
#     {&#39;fruit&#39;: &#39;banana&#39;, &#39;color&#39;: &#39;yellow&#39;}, 
#     {&#39;thing&#39;: &#39;bicycle&#39;, &#39;capacity&#39;: 2}
#   ], 
#   [
#     {&#39;c&#39;: 3, &#39;fruit&#39;: &#39;cherry&#39;, &#39;thing&#39;: &#39;chair&#39;}, 
#     {&#39;fruit&#39;: &#39;cherry&#39;, &#39;color&#39;: &#39;red&#39;}, 
#     {&#39;thing&#39;: &#39;chair&#39;, &#39;capacity&#39;: 1}
#   ]
# ]

答案2

得分: 0

以下是您要的翻译：

对于您的示例数据，您可以简单地将这些列表使用zip函数合并在一起，因为每个列表中的顺序相匹配：

res = list(zip(list1, list2, list3))

如果不是这种情况，您可以从list2和list3变量构建字典（使用它们的fruit和thing属性作为键），然后使用列表推导式，使用每个list1字典中的fruit和thing属性索引这些字典：

dict2 = {d['fruit']: d for d in list2}
dict3 = {d['thing']: d for d in list3}
res = [[d, dict2.get(d['fruit'], None), dict3.get(d['thing'], None)] for d in list1]

输出：

[
  [
    {'a': 1, 'fruit': 'apple', 'thing': 'aeroplane'},
    {'fruit': 'apple', 'color': 'green'},
    {'thing': 'aeroplane', 'capacity': 100}
  ],
  [
    {'b': 2, 'fruit': 'banana', 'thing': 'bicycle'},
    {'fruit': 'banana', 'color': 'yellow'},
    {'thing': 'bicycle', 'capacity': 2}
  ],
  [
    {'c': 3, 'fruit': 'cherry', 'thing': 'chair'},
    {'fruit': 'cherry', 'color': 'red'},
    {'thing': 'chair', 'capacity': 1}
  ]
]

请注意，我已经将代码部分排除在翻译之外，只返回了翻译好的内容。

英文:

For your sample data, you can simply zip the lists together, as the ordering in each list matches the others:

res = list(zip(list1, list2, list3))

If that is not going to be the case, you can build dicts from the list2 and list3 variables (using their fruit and thing properties as the keys), then use a list comprehension, indexing into those dicts using the fruit and thing properties from each list1 dict:

dict2 = { d[&#39;fruit&#39;] : d for d in list2 }
dict3 = { d[&#39;thing&#39;] : d for d in list3 }
res = [[ d, dict2.get(d[&#39;fruit&#39;], None), dict3.get(d[&#39;thing&#39;], None)] for d in list1]

Output:

[
  [
    {&#39;a&#39;: 1, &#39;fruit&#39;: &#39;apple&#39;, &#39;thing&#39;: &#39;aeroplane&#39;},
    {&#39;fruit&#39;: &#39;apple&#39;, &#39;color&#39;: &#39;green&#39;},
    {&#39;thing&#39;: &#39;aeroplane&#39;, &#39;capacity&#39;: 100}
  ],
  [
    {&#39;b&#39;: 2, &#39;fruit&#39;: &#39;banana&#39;, &#39;thing&#39;: &#39;bicycle&#39;},
    {&#39;fruit&#39;: &#39;banana&#39;, &#39;color&#39;: &#39;yellow&#39;},
    {&#39;thing&#39;: &#39;bicycle&#39;, &#39;capacity&#39;: 2}
  ],
  [
    {&#39;c&#39;: 3, &#39;fruit&#39;: &#39;cherry&#39;, &#39;thing&#39;: &#39;chair&#39;},
    {&#39;fruit&#39;: &#39;cherry&#39;, &#39;color&#39;: &#39;red&#39;},
    {&#39;thing&#39;: &#39;chair&#39;, &#39;capacity&#39;: 1}
  ]
]

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

将3个字典列表（具有相同的值）分组。

问题

答案1

答案2

创建一个字典，其中值是表达式，而不进行评估或将其视为字符串。

PyCharm pip 和 PIL 安装

如何配置dependabot来检查多个文件？

hasattr():属性名必须是字符串

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论