问题

import os
import pandas as pd
import csv

folder_path = 'D:/Libraries/Documents/data'
rtAvg = 'block1_respo.rt'
cond = 'block1_trigger'
output_file = 'results_compiled.csv'
df_list = []

for filename in os.listdir(folder_path):
    if filename.endswith('.csv'):
        # Load the CSV file into a data frame
        df = pd.read_csv(os.path.join(folder_path, filename))
        
        # This code will check if the column is missing, and will populate that row with "9999" if it is
        if rtAvg not in df.columns:
            df[rtAvg] = 9999
        
        rt_mean_b1 = df[rtAvg].mean()

        con1 = df.loc[df[cond] == 101][rtAvg].mean()
        con2 = df.loc[df[cond] == 102][rtAvg].mean()
        con3 = df.loc[df[cond] == 103][rtAvg].mean()
        con4 = df.loc[df[cond] == 104][rtAvg].mean()

        # Create a new row for the summary data
        summary_row = pd.DataFrame({
            'csv_file': filename,
            'rt_average': rt_mean_b1,
            '101_rt': con1,
            '102_rt': con2,
            '103_rt': con3,
            '104_rt': con4
        }, index=[0])

        # Append the summary data to the list of data frames
        df_list.append(summary_row)

summary_df = pd.concat(df_list)

summary_df.to_csv(output_file, index=False)

错误代码：

con1 = df.loc[df[cond] == 101][rtAvg].mean()
  File "C:\Program Files\PsychoPy\lib\site-packages\pandas\core\frame.py", line 3458, in __getitem__
    indexer = self.columns.get_loc(key)
  File "C:\Program Files\PsychoPy\lib\site-packages\pandas\core/indexes/base.py", line 3363, in get_loc
    raise KeyError(key) from err
KeyError: 'block1_trigger'

block1_trigger 明显存在。我已经打印了列表，它在其中。我之前尝试过用代码删除空格，但没有任何区别。我还尝试了 cond = ' block1_trigger' 和 'block1_trigger '，但都没有产生预期的结果。

英文:

I am trying to create a script to iterate over .csv files in a folder, and then run some calculations on them, before saving to a new .csv file. I have been able to get this to work fine when producing means and percentages, but there's the KeyError issue when trying to add in some conditions.

Here is the code I have produced so far:

import os
import pandas as pd
import csv


folder_path = &#39;D:/Libraries/Documents/data&#39;


rtAvg = &#39;block1_respo.rt&#39;
cond = &#39;block1_trigger&#39;

output_file = &#39;results_compiled.csv&#39;


df_list = []


for filename in os.listdir(folder_path):
    if filename.endswith(&#39;.csv&#39;):
        # Load the CSV file into a data frame
        df = pd.read_csv(os.path.join(folder_path, filename))
    # This code will check if the column is missing, and will populate that row with &quot;9999&quot; if it is
        if rtAvg not in df.columns:
            df[rtAvg] = 9999
        
       
        rt_mean_b1 = df[rtAvg].mean()
        

        con1 = df.loc[df[cond]==101][rtAvg].mean()
        con2 = df.loc[df[cond]==102][rtAvg].mean()
        con3 = df.loc[df[cond]==103][rtAvg].mean()
        con4 = df.loc[df[cond]==104][rtAvg].mean()
    

    # Create a new row for the summary data
        summary_row = pd.DataFrame({
            &#39;csv_file&#39;: filename,
            &#39;rt_average&#39;: rt_mean_b1,
            &#39;101_rt&#39;:con1,
            &#39;102_rt&#39;:con2,
            &#39;103_rt&#39;:con3,
            &#39;104_rt&#39;:con4
        },index = [0])
    
    # Append the summary data to the list of data frames
        df_list.append(summary_row)
    

summary_df = pd.concat(df_list)


summary_df.to_csv( output_file, index = False)

Here is the error code I am receiving:

con1 = df.loc[df[cond]==101][rtAvg].mean()
  File &quot;C:\Program Files\PsychoPy\lib\site-packages\pandas\core\frame.py&quot;, line 3458, in __getitem__
    indexer = self.columns.get_loc(key)
  File &quot;C:\Program Files\PsychoPy\lib\site-packages\pandas\core\indexes\base.py&quot;, line 3363, in get_loc
    raise KeyError(key) from err
KeyError: &#39;block1_trigger&#39;

block1_trigger definitely exists. I have printed the lists and it is in there. I had also previously tried removing the white space with code but it made no difference. I also tried cond = ' block1_trigger' and 'block1_trigger ' and neither produced the expected results.

答案1

得分: 0

我已经找到了错误。有些文件缺少了这一列。我现在已经添加了一个捕获代码来解决它。

if cond not in df.columns:
        df[cond] = 9999

英文:

I have figured out the error. Some of the files were missing this column. I have now added a catch code to sort it out.

if cond not in df.columns:
        df[cond] = 9999

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Pandas 抛出 KeyError(key) from err KeyError:，即使该键存在。

问题

答案1

TypeError – 读取 CSV 功能

Google Admin SDK API – HttpError 412 “域用户限制已达到。请联系支持。”

在基于其他列的情况下修改数据框列中的值

如何删除pandas DataFrame中包含零的尾行

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论