2023年2月9日 03:20:06go评论106阅读模式

英文:

To open all the files from a directory and pass each of them to SEPERATE lists

问题

这是我的当前代码。

my_file = open("/content/txts/txt1.txt", "r")
data = my_file.read()
l1 = clean_data(data)
my_file = open("/content/txts/txt2.txt", "r")
data = my_file.read()
l2 = clean_data(data)
my_file = open("/content/txts/txt3.txt", "r")
data = my_file.read()
l3 = clean_data(data)
my_file = open("/content/txts/txt4.txt", "r")
data = my_file.read()
l4 = clean_data(data)

但我不想一遍又一遍地应用相同的函数。为了为我的每个txt文件创建单独的列表，我尝试了另一种方法：

import os
pathToFolder = '/content/txts'
fileList = os.listdir(pathToFolder)
dataDict = {}
for i in range(len(fileList) - 1):
    with open(fileList[i], "r") as f:
        data = f.read()
        dataDict['l' + str(i)] = clean_data(data)

但我遇到了这个错误。这是我的"txts"文件夹。

英文:

This is my current code.

my_file = open(&quot;/content/txts/txt1.txt&quot;, &quot;r&quot;)
data = my_file.read()
l1 = clean_data(data)
my_file = open(&quot;/content/txts/txt2.txt&quot;, &quot;r&quot;) 
data = my_file.read() 
l2 = clean_data(data)
my_file = open(&quot;/content/txts/txt3.txt&quot;, &quot;r&quot;) 
data = my_file.read() 
l3 = clean_data(data)
my_file = open(&quot;/content/txts/txt4.txt&quot;, &quot;r&quot;) 
data = my_file.read() 
l4 = clean_data(data)

But I dont want to apply the same functions over and over again.
To create seperate lists for each of my txt file, I have tried an alternative:

import os
pathToFolder = &#39;/content/txts&#39;
fileList = os.listdir(pathToFolder)
dataDict = {}
for i in range(len(fileList)-1):
   with open(fileList[i],&quot;r&quot;) as f:
    data = f.read()
    dataDict[&#39;l&#39; + str(i)] = clean_data(data)
    f.close()

This is my txts folder

答案1

得分: 0

尝试使用os.listdir()来列出文件夹中的文件，然后循环遍历该列表。您需要保存您离开的文件位置，以便可以从相同的位置开始。以下代码将从您在文件夹中离开的索引处循环到末尾，并将所有干净的数据保存到一个字典中，其中键的名称类似于您在问题中使用的名称。

import os
fileList = os.listdir(pathToFolder)
dataDict = {}
for i in range(WhereYouLeftOffAt, len(fileList) - 1):
   with open(pathToFolder + '/' + fileList[i], "r") as f:
       data = f.read()
       dataDict['l' + str(i)] = clean_data(data)

注意：上述代码中的WhereYouLeftOffAt应该是您离开的索引位置的值。此代码将遍历文件夹中的文件，以及将每个文件的干净数据存储在dataDict字典中，键的命名方式是以'l'开头并附加索引号。

英文:

Try to use os.listdir() to list the files in the folder than loop through the list. You would need to save which file you left off on so you could start in the same place. The following code would loop from whatever index you left off at in the folder to the end and save all the clean dataset into a dictionary with the keys being names similar to what you used in your question.

import os
fileList = os.listdir(pathToFolder)
dataDict = {}
for i in range(WhereYouLeftOffAt,len(fileList)-1):
   open(pathToFolder + &#39;/&#39; + fileList[i],&quot;r&quot;) as f
   data = f.read()
   dataDict[&#39;l&#39; + str(i)] = clean_data(data)
   f.close()

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

打开目录中的所有文件，并将它们分别传递到单独的列表中。

问题

答案1

在tkinter窗口的左上角添加一个Logo图像，与标题在同一行。

PicklingError: 无法序列化对象：IndexError: 元组索引超出范围。

如何使用连续/链接的prefetch_related() 减少 SQL 语句？

如何将使用pytesseract.image_to_string提取的信息转换为数据框？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。