2023年2月14日 05:28:08go评论68阅读模式

英文:

Checking if file to be copied already exists in specified directory and if so skip the file and move onto next

问题

我正在遍历一个主目录，其中包含许多子目录，每个子目录都包含它们自己的子目录。我想要复制扩展名为 .xlsx 的文件从主目录到一个新目录，以汇总所有文件在一个地方。每个文件都有唯一的名称，每天都会添加新文件。

一旦文件被复制到新目录，我希望脚本通过比较文件名来防止其被覆盖，基于已经包含在主目录中的内容，例如：

今天主目录包含 test1.xlsx 和 test2.xlsx，它们被复制到我指定的新目录。

两天后，主目录包含 test1.xlsx、test2.xlsx 和 test3.xlsx。在这种情况下，一旦执行代码，我希望遍历主目录和子目录，并识别只有 test3.xlsx 是新的，基于主目录中的文件搜索和我复制文件到的指定目录之间的比较。

抱歉，我是 StackOverflow 和 Python 新手，英语是我的第二语言，所以不太确定我是否解释得够清楚，但希望有人能理解。

我尝试了以下代码，但它一直在覆盖我希望复制 .xlsx 文件的指定目录中的文件：

import os
import shutil
from os.path import isfile

#count = 0

for root, dirs, files in os.walk('Checklists'):
    for file in files:
        if file.endswith('.xlsx'):
            #print(file)
            if isfile('Checklist'):
                print("文件已存在")
            else:
                #print(os.path.join(root, file))
                #count +=1
                #print(count)
                #if not os.path.exists(os.path.join('Checklists', file)):
                shutil.copy(os.path.abspath(root + '/' + file), 'Checklist', follow_symlinks=True)

（注意：我已经删除了代码中的 HTML 实体字符以及多余的注释。）

英文:

I am iterating through a master directory with numerous sub-directories each containing their own sub-directories. I am looking to copy files of extension type .xlsx from the master directory to a new directory to collate all the files in a single locations. Each file has a unique name with new files being added daily.

Once a file is copied to the new directory I would like the script to prevent it from being over-written by comparing file names based on what is already contained within the master directory eg:

Master directory today contains test1.xlsx and test2.xlsx which is copied to the new directory I specified.

2 Days later the master directory contains test1.xlsx, test 2.xlsx and test 3.xlsx. In this instance once I execute the code, I would like to iterate through the master directory and sub dirs and identify that only test 3.xlsx is new based on a comparison between the file search in the master directory and the specified directory where I copy the files to.

Apologies new to StackOverFlow and Python with English being a second language so not too sure if I explained it too well but hopefully someone will get the gist.

I have tried the following code but it keeps overwriting my files in my specified directory where I wish to copy the found .xlsx files to.

import os
import shutil
from os.path import isfile

#count = 0

for root, dirs, files in os.walk(&#39;Checklists&#39;):
    for file in files:
       if file.endswith(&#39;.xlsx&#39;):
        #print(file)
        if isfile(&#39;Checklist&#39;):
            print(&quot;File exists&quot;)
        else:
        #print(os.path.join(root, file))
        #count +=1
        #print(count)
        #if not os.path.exists(os.path.join(&#39;Checklists&#39;, file)):
            shutil.copy(os.path.abspath(root + &#39;/&#39; + file), &#39;Checklist&#39;, follow_symlinks=True)

答案1

得分: 0

我用以下的附加代码成功解决了这个谜题：

for root, dirs, files in os.walk('Checklists'):
for file in files:
   if file.endswith('.xlsx'):
    #print(file)
    if os.path.exists('Checklist'):
        pass
        print("文件已存在")
    else:
    #print(os.path.join(root, file))
    #count +=1
    #print(count)
    #if not os.path.exists(os.path.join('Checklists', file)):
        shutil.copy(os.path.abspath(root + '/' + file), 'Checklist', follow_symlinks=True)

感谢那些抽出时间查看我的问题的人

英文:

I managed to solve this riddle myself with the following addition:

for root, dirs, files in os.walk(&#39;Checklists&#39;):
for file in files:
   if file.endswith(&#39;.xlsx&#39;):
    #print(file)
    **if os.path.exists(&#39;Checklist&#39;):**
        pass
        print(&quot;File exists&quot;)
    else:
    #print(os.path.join(root, file))
    #count +=1
    #print(count)
    #if not os.path.exists(os.path.join(&#39;Checklists&#39;, file)):
        shutil.copy(os.path.abspath(root + &#39;/&#39; + file), &#39;Checklist&#39;, follow_symlinks=True)

Thanks to the folks who took the time out to look at my question at least

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Checking if file to be copied already exists in specified directory and if so skip the file and move onto next

问题

答案1

Apply tf.ensure_shape for multiple outputs.

如何使用Python检查网站是否使用WordPress编写？

按组合并并将每次出现保存在列中

Python继承更改子类字段

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论