2023年2月8日 22:37:25go评论119阅读模式

英文:

use the result of YOLOv8 for pyzbar

问题

以下是翻译的代码部分：

我想将来自YOLOv8的结果传递给解码函数，以便从中读取条形码。
我的程序代码是：
    model = YOLO("yolov8n.pt")
    
    cap = cv2.VideoCapture(0)
    while True:
        ret, frame = cap.read()
        results = model.predict(source=frame, show=True, conf=0.70, stream=True, device=0)
        decode(results.numpy())
        if cv2.waitKey(10) & 0xFF == ord('q'):
            break
    cap.release()
    cv2.destroyAllWindows()
当我这样做时，我收到以下错误消息：
    AttributeError: 'generator' object has no attribute 'numpy'
此外，我想使用kraken.binarization.nlbin()对帧进行预处理，这是可能的吗？如果可以的话，应该如何操作？

英文:

I want to pass the result from the YOLOv8 to the decode function so that the barcodes are read from it.

My program code is:

model = YOLO(&quot;yolov8n.pt&quot;)
cap = cv2.VideoCapture(0)
while True:
    ret, frame = cap.read()
    results = model.predict(source=frame, show=True, conf=0.70, stream=True, device=0)
    decode(results.numpy())
    if cv2.waitKey(10) &amp; 0xFF == ord(&#39;q&#39;):
        break
cap.release()
cv2.destroyAllWindows()

When I do this, I get the following error message:

AttributeError: &#39;generator&#39; object has no attribute &#39;numpy&#39;

Additionally I want to preprocess the frame with kraken.binarization.nlbin() is this possible, if so how?

答案1

得分: 0

如果您阅读Ultralytics的预测文档，您会发现返回结果中不包含任何图像。

您需要自定义您的预测器以返回原始图像，以便您可以使用results中存在的边界框来裁剪图像。然后，您可以将裁剪后的图像传递给decode函数：

import cv2
from ultralytics.yolo.engine.model import YOLO
from pyzbar.pyzbar import decode
    
def on_predict_batch_end(predictor):
    # results -&gt; List[batch_size]
    path, im, im0s, vid_cap, s = predictor.batch
    predictor.results = zip(predictor.results, im0s)
         
model = YOLO(&quot;yolov8n.pt&quot;)
model.add_callback(&quot;on_predict_batch_end&quot;, on_predict_batch_end)
results = model.predict(source=&quot;0&quot;, show=True, stream=True)
for i, (result, im0) in enumerate(results):
    boxes = result.boxes
    for box in boxes:
        xyxy = box.xyxy[0]  # get box coordinates in (top, left, bottom, right) format
        t = int(xyxy[0].item())
        l = int(xyxy[1].item())
        b = int(xyxy[2].item())
        r = int(xyxy[3].item())
        crop_img = im0[l:r, t:b]
        d = decode(crop_img)
        print(d)
        cv2.imshow(&#39;YOLO V8 crop&#39;, crop_img)

这给了我以下输出（我手机屏幕上有一个QR码），我已经对其进行了匿名处理：

0: 480x640 2 persons, 1 cell phone, 9.7ms
[Decoded(data=b&#39;https://wa.me/qr/XXXXXXXXXXXXXX&#39;, type=&#39;QRCODE&#39;, rect=Rect(left=105, top=248, width=90, height=95), polygon=[Point(x=105, y=343), Point(x=193, y=341), Point(x=195, y=251), Point(x=111, y=248)], quality=1, orientation=None)]

英文:

If you read the documentation for Ultralytics' predict you will see that return does not contain any image.

You have to customize your predictor to return the original image so that you can use the bboxes present in results in order to crop the image. Then you can pass the crops to decode:

import cv2
from ultralytics.yolo.engine.model import YOLO
from pyzbar.pyzbar import decode
    
def on_predict_batch_end(predictor):
    # results -&gt; List[batch_size]
    path, im, im0s, vid_cap, s = predictor.batch
    predictor.results = zip(predictor.results, im0s)
         
model = YOLO(&quot;yolov8n.pt&quot;)
model.add_callback(&quot;on_predict_batch_end&quot;, on_predict_batch_end)
results = model.predict(source=&quot;0&quot;, show=True, stream=True)
for i, (result, im0) in enumerate(results):
    boxes = result.boxes
    for box in boxes:
        xyxy = box.xyxy[0]  # get box coordinates in (top, left, bottom, right) format
        t = int(xyxy[0].item())
        l = int(xyxy[1].item())
        b = int(xyxy[2].item())
        r = int(xyxy[3].item())
        crop_img = im0[l:r, t:b]
        d = decode(crop_img)
        print(d)
        cv2.imshow(&#39;YOLO V8 crop&#39;, crop_img)

This gives me the following output (I had a QR code on my phone screen) which I anonymized for obvious reasons:

0: 480x640 2 persons, 1 cell phone, 9.7ms
[Decoded(data=b&#39;https://wa.me/qr/XXXXXXXXXXXXXX&#39;, type=&#39;QRCODE&#39;, rect=Rect(left=105, top=248, width=90, height=95), polygon=[Point(x=105, y=343), Point(x=193, y=341), Point(x=195, y=251), Point(x=111, y=248)], quality=1, orientation=None)]

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用YOLOv8的结果进行pyzbar操作

问题

答案1

根据另一列中的前一行对 Pandas DataFrame 进行排序

尝试使用Python对使用SQLite 3创建的数据库进行详细验证。

Set declined to True if approved is False

抓取隐藏页面，如果搜索结果多于显示的结果。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。