2023年2月16日 04:14:00go评论63阅读模式

英文:

EasyOCR - Batch processing images with Python

问题

I am attempting to write a bit of python that uses EasyOCR to write the numbers it sees in the images into a text file. My goal is to batch process all images in a directory, rather than a single images at a time, as I have several thousand images to process.

The python code:

import cv2
import os
import io

reader = easyocr.Reader(['en'])

for image_name in os.listdir("ocr-source"):
        image = cv2.imread(f'ocr-source/{image_name}')
        result = reader.readtext(image, allowlist='0123456789', detail=0)
        
print(image_name, " ", result, file=open('output.txt', 'w'))

My test ocr-source directory contains about 10 images.

The resulting output.txt file only contains the results from a single image.

How to I get it to properly iterate through the entire directory?

英文:

The python code:

import cv2
import os
import io

reader = easyocr.Reader([&#39;en&#39;])

for image_name in os.listdir(&quot;ocr-source&quot;):
        image = cv2.imread(f&#39;ocr-source/{image_name}&#39;)
        result = reader.readtext(image, allowlist=&#39;0123456789&#39;, detail=0)
        
print(image_name, &quot; &quot;, result, file=open(&#39;output.txt&#39;, &#39;w&#39;))

My test ocr-source directory contains about 10 images.

The resulting output.txt file only contains the results from a single image.

How to I get it to properly iterate through the entire directory?

答案1

得分: 1

EasyOCR似乎最近支持了批量推断：https://github.com/JaidedAI/EasyOCR/pull/458

英文:

Easyocr seem to have recently supported batch inference:
https://github.com/JaidedAI/EasyOCR/pull/458

答案2

得分: 0

Simple fix: Instead of writing over the file each loop, I needed to append.

import cv2
import os
import io

reader = easyocr.Reader(['en'])

for image_name in os.listdir("ocr-source"):
    image = cv2.imread(f'ocr-source/{image_name}')
    result = reader.readtext(image, allowlist='0123456789', detail=0)

print(image_name, " ", result, file=open('output.txt', 'a'))

Note the 'a' in the print call.

英文:

Simple fix: Instead of writing over the file each loop, I needed to append.

import cv2
import os
import io

reader = easyocr.Reader([&#39;en&#39;])

for image_name in os.listdir(&quot;ocr-source&quot;):
        image = cv2.imread(f&#39;ocr-source/{image_name}&#39;)
        result = reader.readtext(image, allowlist=&#39;0123456789&#39;, detail=0)
        
print(image_name, &quot; &quot;, result, file=open(&#39;output.txt&#39;, &#39;a&#39;))

Note the 'a' in the print call

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

EasyOCR – 使用Python批处理图像

问题

答案1

答案2

为什么我尝试添加新字典时，列表中的先前字典会发生变化？

python字符串解析问题在将SQL命令保存到文件时

如何在Docker容器内使用正确的安全证书下载NLTK包？

SQLAlchemy：在Python中指定连接URL的正确方法是什么？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论