2023年5月29日 23:09:54go评论154阅读模式

英文:

Updating existing Excel file with Pandas and Openpyxl throws an AttributeError: property 'book' of 'OpenpyxlWriter' object has no setter

问题

我一直在尝试更新现有Excel中的数据，过程如下：

从Excel读取数据
使用pandas将其与新数据合并
将合并后的数据帧保存到原始文件中

它一直返回以下错误，我认为这是由于在读取文件时写入（更新）同一文件导致的：

AttributeError                            Traceback (most recent call last)
Cell In[14], line 19
     18 with pd.ExcelWriter(file_path, engine='openpyxl') as writer:
---> 19     writer.book = book
     20     writer.sheets = {ws.title: ws for ws in book.worksheets}
AttributeError: property 'book' of 'OpenpyxlWriter' object has no setter

我的代码在这里 - 运行它还会擦除原始文件的数据并使文件无法使用：

# Load the Excel file
file_path = 'original.xlsx'
update = 'update.xlsx'
# Open the file in read-only mode to prevent any locks
with open(file_path, "rb") as file:
    book = load_workbook(file)
# Combine original with update file
for sheet_name in ['sheet1', ...]:
    df1 = pd.read_excel(file_path, sheet_name=sheet_name)
    df2 = pd.read_excel(update, sheet_name=sheet_name)
    df2 = df2.iloc[::-1]
    df1 = pd.concat([df1, df2], ignore_index=True)
    df1 = df1.drop_duplicates(subset='column1', keep='last')
    # Write combined data to the sheet
    with pd.ExcelWriter(file_path, engine='openpyxl') as writer:
        writer.book = book
        writer.sheets = {ws.title: ws for ws in book.worksheets}
        
        # Set the sheet as the active sheet
        book.active = book.sheetnames.index(sheet_name)
        
        df1.to_excel(writer, sheet_name=sheet_name, index=False, startrow=1)
    print(f"Successfully updated '{sheet_name}' sheet in '{file_path}'.")

请注意，这是你的原始代码的中文翻译部分。

英文:

I have been trying to update data in an existing Excel--the process as follows:

1)read the data from Excel

2)combine it with a new data using pandas

3)save the combined dataframe into the original file

It keeps return the following error, which I think it is coming from writing(updating) the same file while reading it:

AttributeError                            Traceback (most recent call last)
Cell In[14], line 19
     18 with pd.ExcelWriter(file_path, engine=&#39;openpyxl&#39;) as writer:
---&gt; 19     writer.book = book
     20     writer.sheets = {ws.title: ws for ws in book.worksheets}
AttributeError: property &#39;book&#39; of &#39;OpenpyxlWriter&#39; object has no setter

My code is here--running it also erases the data of the original file and make the file unusable:

# Load the Excel file
file_path = &#39;original.xlsx&#39;
update = &#39;update.xlsx&#39;
# Open the file in read-only mode to prevent any locks
with open(file_path, &quot;rb&quot;) as file:
    book = load_workbook(file)
# Combine original with update file
for sheet_name in [&#39;sheet1&#39;, ...]:
    df1 = pd.read_excel(file_path, sheet_name = sheet_name)
    df2 = pd.read_excel(update, sheet_name = sheet_name)
    df2 = df2.iloc[::-1]
    df1 = pd.concat([df1, df2], ignore_index = True)
    df1 = df1.drop_duplicates(subset = &#39;column1&#39;, keep = &#39;last&#39;)
    # Write combined data to the sheet
    with pd.ExcelWriter(file_path, engine=&#39;openpyxl&#39;) as writer:
        writer.book = book
        writer.sheets = {ws.title: ws for ws in book.worksheets}
        
        # Set the sheet as the active sheet
        book.active = book.sheetnames.index(sheet_name)
        
        df1.to_excel(writer, sheet_name = sheet_name, index = False, startrow = 1)
    print(f&quot;Successfully updated &#39;{sheet_name}&#39; sheet in &#39;{file_path}&#39;.&quot;)

答案1

得分: 0

这是因为 book 是只读属性。

由于您只是在更新文件，可以尝试使用 Pandas 提供的附加模式和 if_sheet_exists 标志，示例文档请查看：docs

结果将类似于以下内容：

# 将原始数据与更新文件合并
for sheet_name in ['sheet1', 'sheet2']:
    df1 = pd.read_excel(file_path, sheet_name=sheet_name)
    df2 = pd.read_excel(update, sheet_name=sheet_name)
    df2 = df2.iloc[::-1]
    df1 = pd.concat([df1, df2], ignore_index=True)
    df1 = df1.drop_duplicates(subset='column1', keep='last')
    # 将合并后的数据写入工作表
    with pd.ExcelWriter(file_path, mode='a', if_sheet_exists='replace', engine='openpyxl') as writer:
        df1.to_excel(writer, sheet_name=sheet_name, index=False, startrow=0)
    print(f"成功更新了 '{sheet_name}' 工作表在 '{file_path}' 中。")

英文:

This is because book is read-only property.

Since you're only updating a file, you may try append mode with flags a and if_sheet_exists, provided by Pandas: docs

Result will look similar to that:

# Combine original with update file
for sheet_name in [&#39;sheet1&#39;, &#39;sheet2&#39;]:
    df1 = pd.read_excel(file_path, sheet_name=sheet_name)
    df2 = pd.read_excel(update, sheet_name=sheet_name)
    df2 = df2.iloc[::-1]
    df1 = pd.concat([df1, df2], ignore_index=True)
    df1 = df1.drop_duplicates(subset=&#39;column1&#39;, keep=&#39;last&#39;)
    # Write combined data to the sheet
    with pd.ExcelWriter(file_path, mode=&#39;a&#39;, if_sheet_exists=&#39;replace&#39;, engine=&#39;openpyxl&#39;) as writer:
        df1.to_excel(writer, sheet_name=sheet_name, index=False, startrow=0)
    print(f&quot;Successfully updated &#39;{sheet_name}&#39; sheet in &#39;{file_path}&#39;.&quot;)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Updating existing Excel file with Pandas and Openpyxl throws an AttributeError: property 'book' of 'OpenpyxlWriter' object has no setter

问题

答案1

No module named 'pydantic_core._pydantic_core' in AWS Lambda though library is installed for fast api based code

将JavaScript加密算法转换为Python。

如何使用scatter_kws自定义pairplot中的标记样式

my drawing in pygame is covered by something and I want to animate it what does not work

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。