2023年2月27日 09:46:08go评论118阅读模式

英文:

convert list of multiple json files into a dataframe pandas

问题

以下是您提供的代码和问题的翻译部分：

base_dir = 'jsons_final_folder/'
data_list = []
for file in os.listdir(base_dir):
    if 'json' in file:
        json_path = os.path.join(base_dir, file)
        json_data = pd.read_json(json_path, lines=True)
        data_list.append(json_data)

我得到了一个看起来像这样的列表

print(data_list)
output:
[                                                   0
0  {"general":{"key":"value","q":"...,                                          0
0  {"general":{"key":"value","q":"...,                                          0
0  {"general":{"key":"value","q":"...,                                          0
0  {"general":{"key":"value","q":"...,                                          0
0  {"general":{"key":"value","q":"...,                                          0
0  {"general":{"key":"value","q":"...,                                          0
0  {"general":{"key":"value","q":"...,]                                         0

所以这是我的代码来转换DataFrame（数据帧）：

with open("f.csv","w") as f:
    wr = csv.writer(f)
    wr.writerow(data_list)

但是我得到了一个类型为pandas.core.frame.DataFrame的DataFrame，像这样：

|{"general":{"key":"value","q":"... | {"general":{"key":"value","q":"... | {"general":{"key":"value","q":"... | {"general":{"key":"value","q":"... |
|-------------------------------- | -------------------------------- | -------------------------------- | -------------------------------- |
|                                 |                                 |                                 |                                 |

具有 n 列和 0 行的形状。

我在尝试将**dilimiter（分隔符）**添加到DataFrame，但是仍然无法获得我想要的结果。

我想要的最终形状是这样的：

|      json      |
|---------------------|
|   {"general":{"key":"value","q":"...         |
|   {"general":{"key":"value","q":"...         |

谢谢您。

英文:

This how I parsed multiple json files in a single list

base_dir = &#39;jsons_final_folder/&#39;
data_list = []
for file in os.listdir(base_dir):
    if &#39;json&#39; in file:
        json_path = os.path.join(base_dir, file)
        json_data = pd.read_json(json_path, lines=True)
        data_list.append(json_data)

And I got a list that look like this

print(data_list)
output:
[                                                   0
0  {&quot;general&quot;:{&quot;key&quot;:&quot;value&quot;,&quot;q&quot;:&quot;...,                                          0
0  {&quot;general&quot;:{&quot;key&quot;:&quot;value&quot;,&quot;q&quot;:&quot;...,                                          0
0  {&quot;general&quot;:{&quot;key&quot;:&quot;value&quot;,&quot;q&quot;:&quot;...,                                          0
0  {&quot;general&quot;:{&quot;key&quot;:&quot;value&quot;,&quot;q&quot;:&quot;...,                                          0
0  {&quot;general&quot;:{&quot;key&quot;:&quot;value&quot;,&quot;q&quot;:&quot;...,                                          0
0  {&quot;general&quot;:{&quot;key&quot;:&quot;value&quot;,&quot;q&quot;:&quot;...,                                          0
0  {&quot;general&quot;:{&quot;key&quot;:&quot;value&quot;,&quot;q&quot;:&quot;...,]                                         0

So this is my code to convert df

with open(&quot;f.csv&quot;,&quot;w&quot;) as f:
    wr = csv.writer(f)
    wr.writerow(data_list)

But I get a df that type pandas.core.frame.DataFrame like this:

{"general":{"key":"value","q":"...,	{"general":{"key":"value","q":"...,	{"general":{"key":"value","q":"...,	{"general":{"key":"value","q":"...,

with shape of n columns and 0 rows

What I am trying to do here is to make a df out of this list that contains only jsons with specific queries but i don't what's the problem.

I also tried to add dilimiter

I wanted the final shape be look like this

json
{"general":{"key":"value","q":"...,
{"general":{"key":"value","q":"...,

Thank you

答案1

得分: 0

你试过 df = pd.DataFrame({'json': data_list}) 吗？

英文:

Have you tried df = pd.DataFrame({'json': data_list}) ?

答案2

得分: 0

Guys I found the solution from This video

function to return files

def get_files(filepath):
   all_files = []
   for root, dirs, files in os.walk(filepath):
      files = glob.glob(os.path.join(root, '*.json'))
      for f in files:
          all_files.append(os.path.abspath(f))
   return all_files

j_files = get_files("../path goes here/")

Here we read each file and import it into list

j_files_list = []
for j_file in j_files:
    with open(j_file) as doc:
        exp = json.load(doc)
        j_files_list.append(exp)

here we convert it into a df

df = pd.DataFrame(j_files_list)

And then save list to csv

df.to_csv('json_files_in_df.csv')

Thanks for helping out

英文:

Guys I found the solution from This video

function to return files

def get_files(filepath):
   all_files = []
   for root, dirs, files in os.walk(filepath):
      files = glob.glob(os.path.join(root,&#39;*.json&#39;))
      for f in files:
          all_files.append(os.path.abspath(f))
   return all_files
j_files = get_files(&quot;../path goes here/&quot;)

Here we read each file and import it into list

j_files_list = []
for j_file in j_files:
    with open(j_file) as doc:
        exp = json.load(doc)
        j_files_list.append(exp)

here we convert it into a df

df = pd.DataFrame(j_files_list)

And then save list to csv

df.to_csv(&#39;json_files_in_df.csv&#39;)

Thanks for helping out

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

将多个JSON文件的列表转换为Pandas数据框。

问题

答案1

答案2

Set! 在 Kattis 上的测试用例失败。

改变乌龟图形的颜色按键按下时。

在Windows上编译.go文件… & Python能连接到Go吗？

如何删除包含单词的撇号？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。