2023年7月23日 13:31:35go评论97阅读模式

英文:

Why is the image being partially processed?

问题

我已经找到问题所在。是我读取图像的方式有问题。

应该改成：

img = cv2.imread(img_path, cv2.IMREAD_GRAYSCALE)

现在它可以正常工作了，尽管由于某种原因，它的运行时间是我另一个执行相同操作的脚本的10倍... 嗯...

英文:

It is been hours writing scripts and I think I am tired overlooking something simple.
I have the following pycuda script

import cv2
import numpy as np
import time
import pycuda.autoinit
import pycuda.driver as cuda
from pycuda.compiler import SourceModule
import pycuda.gpuarray as gpuarray
def apply_threshold(img_src,img_width, img_height, img_dest, mythreshold):
    mod = SourceModule(&quot;&quot;&quot;
        __global__ void ThresholdKernel(
            const int src_sizeX,  //&lt; source image size. x: width,
            const unsigned char* src,   //&lt; source image pointer
            const int dst_sizeX,  //&lt; destination image size. x: width, y: height
            const int dst_sizeY,
            unsigned char* dst,         //&lt; destination image pointer
            const int mythreshold) {
                int col = blockIdx.x * blockDim.x + threadIdx.x;
                int row = blockIdx.y * blockDim.y + threadIdx.y;
                if (dst_sizeX &lt;= col || dst_sizeY &lt;= row) return;
                auto src_val = src[row * src_sizeX + col];
                unsigned char dst_val = src_val &gt; mythreshold ? 255 : 0;
                dst[row * dst_sizeX + col] = dst_val;
            }
    &quot;&quot;&quot;)
    block_dim =(32,8,1)
    grid_dim_x = (img_width + block_dim[0] -1) // block_dim[0]
    grid_dim_y = (img_width + block_dim[1] -1) // block_dim[1]
    print(grid_dim_x,grid_dim_y)
    
    thresholdkernel = mod.get_function(&quot;ThresholdKernel&quot;)
    thresholdkernel(np.int32(img_width), img_src, np.int32(img_width),np.int32(img_height), 
                    img_dest,np.int32(mythreshold),
                    block = block_dim , grid = (grid_dim_x,grid_dim_y))
    
mythreshold = 128
img_path = &quot;../images/lena_gray.png&quot;
img = cv2.imread(img_path)
if img is None:
    print(&quot;Image not found&quot;)
    exit()
else:
    height,width,channels = img.shape
    print(&quot;Hegiht, width and channels&quot;,height,width,channels)
    print(type(width))
img_gpu = cuda.mem_alloc(img.nbytes)
cuda.memcpy_htod(img_gpu,img)
dtype=img.dtype
# dest_img=gpuarray.empty_like(img.shape,dtype=dtype)
dest_img = cuda.mem_alloc(img.nbytes)
apply_threshold(img_gpu,width,height,dest_img  ,mythreshold )
image_result= np.empty_like(img)
cuda.memcpy_dtoh(image_result,dest_img )
cv2.imshow(&quot;Original image&quot;,img)
cv2.imshow(&quot;Thresholded&quot;,image_result)
cv2.waitKey(0)
cv2.destroyAllWindows()

When I run it I get a binarized picture but this one

What am I overlooking that makes the kernel only process part of the image? It must be something really simple

EDIT: I found the problem. The way I am reading the image.

It should be

img = cv2.imread(img_path,cv2.IMREAD_GRAYSCALE)

Now it works, although for some reason it takes 10 times the time of a similar script I have that does the same... well...

答案1

得分: 1

我猜测这是因为你在grid_dim_x和grid_dim_y都使用了img_width。但你可能想要将img_height用于grid_dim_y。

试试这个：

grid_dim_x = (img_width + block_dim[0] -1) // block_dim[0]
grid_dim_y = (img_height + block_dim[1] -1) // block_dim[1]

英文:

I assume it's because you are using img_width for both grid_dim_x and grid_dim_y. But you probably meant to use img_height for grid_dim_y.

Give this a shot:

grid_dim_x = (img_width + block_dim[0] -1) // block_dim[0]
grid_dim_y = (img_height + block_dim[1] -1) // block_dim[1]

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

图像为什么只被部分处理？

问题

答案1

SimpleITK的`sitk.ConnectedThresholdImageFilter()`输出错误。

Pytest-xdist: 所有工作进程完成后的 tearDown

在Django中未通过外键关系获取特定对象。

librosa中的y轴频谱图未正确对齐。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。