2023年8月9日 14:58:17go评论136阅读模式

英文:

Real time object detection with Yolo model not working

问题

我已经训练了一个自定义的YOLO模型，用于检测棋盘上的正方形槽，对图像的准确率超过95%。

但是一旦我切换到视频检测，似乎连一个物体都无法检测到。

我使用以下代码进行实时目标检测：

cap = cv2.VideoCapture('../video/1st/output.mp4')
while cap.isOpened():
    ret, frame = cap.read()
    results = model(frame)
    final_img = np.squeeze(results.render())
    cv2.imshow("YOLO", final_img)
    if cv2.waitKey(10) & 0XFF == ord("q"):
        break
cap.release()
cv2.destroyAllWindows()

我使用以下代码加载模型：

model = torch.hub.load("ultralytics/yolov5", "custom", path="yolov5/runs/train/exp36/weights/best.pt", force_reload=True)
model.conf = 0.6

我甚至尝试将可用的视频拆分成JPEG图像，然后在单独的图像上运行模型，保存输出，然后将输出图像合并成一个新的视频文件。

这个方法运行得很完美，所以模型确实能够检测到一些物体。

但一旦切换到视频，似乎什么都检测不到。

英文:

I have trained a custom yolo model to detect square slots on a board, and is working with more than 95 % accuracy on images.

But as soon as I switch to video detection it seems to not detect even a single thing

I am using the following code to run real time object detection

cap = cv2.VideoCapture(&#39;../video/1st/output.mp4&#39;)
while cap.isOpened():
    ret, frame = cap.read()
    results = model(frame)
    final_img = np.squeeze(results.render())
    cv2.imshow(&quot;YOLO&quot;, final_img)
    if cv2.waitKey(10) &amp; 0XFF == ord(&quot;q&quot;):
        break
cap.release()
cv2.destroyAllWindows()

I load the model using this code

model = torch.hub.load(&quot;ultralytics/yolov5&quot;, &quot;custom&quot;, path=&quot;yolov5/runs/train/exp36/weights/best.pt&quot;,force_reload=True)
model.conf = 0.6

I have even tried splitting the available video into jpegs and running the model on individual images, saving the output and then merging the output images into a new video file.

that works perfectly, so the model is detecting something.

but as soon as I switch to video it seems to go back to nothing.

答案1

得分: 0

是的，由于results.renders()返回的对象类型为none，您无法看到任何内容。
您可以像这样更改代码脚本

cap = cv2.VideoCapture('../video/1st/output.mp4')
while cap.isOpened():
    ret, frame = cap.read()
    results = model(frame)
    bboxes = results.xyxy[0].cpu().tolist()
    for bbox in bboxes:
        conf = f'{bbox[4]:.4f}' # 预测的置信度
        bbox = list(map(lambda x: int(x), bbox)) # 将浮点数转换为整数
        class_id = bbox[5] # 类别ID
        bbox = bbox[:4]
        cv2.rectangle(frame, (bbox[0], bbox[1]), (bbox[2], bbox[3]), color=(255, 255, 255), thickness=3)
    cv2.imshow("YOLO", frame)
    if cv2.waitKey(10) & 0XFF == ord("q"):
        break
cap.release()
cv2.destroyAllWindows()

并将帧写入视频。完整的代码应该如下所示

input_video_path = #输入您的视频路径
cap = cv2.VideoCapture(input_video_path)
fps = int(cap.get(cv2.CAP_PROP_FPS))
frame_width = int(cap.get(cv2.CAP_PROP_FRAME_WIDTH))
frame_height = int(cap.get(cv2.CAP_PROP_FRAME_HEIGHT))
output_video_path = 'output_video.mp4' # 输出视频路径
fourcc = cv2.VideoWriter_fourcc(*'XVID')
out = cv2.VideoWriter(output_video_path, fourcc, fps, (frame_width, frame_height))
model = #在此加载您的模型
while True:
    ret, frame = cap.read()
    if not ret:
        break
    results = model(frame)
    bboxes = results.xyxy[0].cpu().tolist()
    for bbox in bboxes:
        conf = f'{bbox[4]:.4f}' # 预测的置信度
        bbox = list(map(lambda x: int(x), bbox)) # 将浮点数转换为整数
        class_id = bbox[5] # 类别ID
        bbox = bbox[:4]
        cv2.rectangle(frame, (bbox[0], bbox[1]), (bbox[2], bbox[3]), color=(255, 255, 255), thickness=3)
    out.write(frame)
    if cv2.waitKey(10) & 0XFF == ord("q"):
        break
cap.release()
out.release()
# 关闭所有OpenCV窗口
cv2.destroyAllWindows()

参考文献：[https://docs.ultralytics.com/][1]，[https://docs.ultralytics.com/yolov5/][2]，[https://docs.ultralytics.com/yolov5/tutorials/pytorch_hub_model_loading/][3]

英文:

Yes, You cannot see anything due to the none type of object return in the results.renders()/
You can change the code script like this

cap = cv2.VideoCapture(&#39;../video/1st/output.mp4&#39;)
while cap.isOpened():
    ret, frame = cap.read()
    results = model(frame)
    bboxes = results.xyxy[0].cpu().tolist()
    for bbox in bboxes:
      conf = f&#39;{bbox[4]:.4f}&#39; #Confidance of that prediction
      bbox = list(map(lambda x: int(x), bbox)) #To convert float to integer
      class_id = bbox[5] #Class_id 
      bbox =bbox[:4] 
      cv2.rectangle(frame,(bbox[0],bbox[1]),(bbox[2],bbox[3]),color=(255,255,255),thickness=3)
    cv2.imshow(&quot;YOLO&quot;, frame)
    if cv2.waitKey(10) &amp; 0XFF == ord(&quot;q&quot;):
        break
cap.release()
cv2.destroyAllWindows()

and write the frames in the video
the full code should look like this

input_video_path = #Enter your video path
cap = cv2.VideoCapture(input_video_path)
fps = int(cap.get(cv2.CAP_PROP_FPS))
frame_width = int(cap.get(cv2.CAP_PROP_FRAME_WIDTH))
frame_height = int(cap.get(cv2.CAP_PROP_FRAME_HEIGHT))
output_video_path = &#39;output_video.mp4&#39; # Output video path
fourcc = cv2.VideoWriter_fourcc(*&#39;XVID&#39;)
out = cv2.VideoWriter(output_video_path, fourcc, fps, (frame_width, frame_height))
model = #Here load your model
while True:
    ret, frame = cap.read()
    if not ret:
      break
    results = model(frame)
    bboxes = results.xyxy[0].cpu().tolist()
    for bbox in bboxes:
      conf = f&#39;{bbox[4]:.4f}&#39; #Confidance of that prediction
      bbox = list(map(lambda x: int(x), bbox)) #To convert float to integer
      class_id = bbox[5] #Class_id 
      bbox =bbox[:4] 
      cv2.rectangle(frame,(bbox[0],bbox[1]),(bbox[2],bbox[3]),color=(255,255,255),thickness=3)
    out.write(frame)
    #cv2_imshow(frame)
    if cv2.waitKey(10) &amp; 0XFF == ord(&quot;q&quot;):
        break
cap.release()
out.release()
# Close all OpenCV windows
cv2.destroyAllWindows()

References:-
[https://docs.ultralytics.com/][1] <br>
[https://docs.ultralytics.com/yolov5/][2] <br>
[https://docs.ultralytics.com/yolov5/tutorials/pytorch_hub_model_loading/][3]<br>

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

实时目标检测使用Yolo模型不起作用。

问题

答案1

获取DataFrame索引列的值

VScode不会计算抛出异常的测试（0/0个测试通过）

Python控制台脚本可执行文件在复制包后不存在。

将文本文件中较长的句子在逗号（多个逗号）处分成两部分。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。