2023年6月22日 04:57:50go评论99阅读模式

英文:

Find an empty space in a binary image that can fit a shape

问题

我有这张图片
我需要找到一个能容纳这个形状的空白区域
以便最终结果类似于这样

英文:

I have this image
 

 
I need to find an empty area that can fit this shape
 

 
so that the end result is something like this

答案1

得分: 2

以下是对该代码部分的翻译：

这是一个简单但天真的解决此问题的方法。正如UnquoteQuote所提到的，它使用了2D卷积。

import random
import numpy as np
import scipy.signal as sig
import matplotlib.pyplot as plt
import cv2
# 载入图像
image = (cv2.imread('image.jpg').mean(axis=2) > 127).astype(np.float32)
shape = (cv2.imread('shape.jpg').mean(axis=2) > 127).astype(np.float32)
# 执行2D卷积
conv = sig.convolve2d(image, shape, mode='valid')
solutions = np.where(conv == 0)
# 绘制一些解决方案
plt.figure(figsize=(16, 4))
for i in range(4):
    r = random.randint(0, solutions[0].shape[0] - 1)
    x, y = solutions[0][r], solutions[1][r]
    solution_plot = np.zeros((*image.shape, 3))
    solution_plot[:, :, 0] = image
    solution_plot[x:x + shape.shape[0], y:y + shape.shape[1], 1] = shape
    plt.subplot(1, 4, i + 1)
    plt.imshow(solution_plot)
plt.show()

示例结果：

此算法找到了所有可能的解决方案。如果您只需要一个解决方案，可以优化它，以获取随机的(x, y)点，并执行形状和裁剪图像区域[x:x+shape_width, y:y+shape_height]的点积，以检查是否有空间，直到找到正确的点。

可以像这样执行：

while True:
    x = random.randint(0, image.shape[0] - shape.shape[0])
    y = random.randint(0, image.shape[1] - shape.shape[1])
    if np.sum(shape*image[x:x + shape.shape[0], y:y + shape.shape[1]]) == 0:
        break
# x, y 是解决方案

与卷积相比，这个方法要快得多（但这取决于解决方案的数量）：

卷积：6.65 秒 ± 21.4 毫秒每次循环（7次运行的平均值 ± 标准偏差，每次循环1次）
随机搜索：1.31 毫秒 ± 31.2 微秒每次循环（7次运行的平均值 ± 标准偏差，每次循环1000次）

英文:

Here is a simple yet naive solution to this problem. It uses 2D convolution as mentioned by UnquoteQuote.

import random
import numpy as np
import scipy.signal as sig
import matplotlib.pyplot as plt
import cv2
# load images
image = (cv2.imread(&#39;image.jpg&#39;).mean(axis=2) &gt; 127).astype(np.float32)
shape = (cv2.imread(&#39;shape.jpg&#39;).mean(axis=2) &gt; 127).astype(np.float32)
# perform 2D convolution
conv = sig.convolve2d(image, shape, mode=&#39;valid&#39;)
solutions = np.where(conv == 0)
# draw some solutions
plt.figure(figsize=(16, 4))
for i in range(4):
    r = random.randint(0, solutions[0].shape[0] - 1)
    x, y = solutions[0][r], solutions[1][r]
    solution_plot = np.zeros((*image.shape, 3))
    solution_plot[:, :, 0] = image
    solution_plot[x:x + shape.shape[0], y:y + shape.shape[1], 1] = shape
    plt.subplot(1, 4, i + 1)
    plt.imshow(solution_plot)
plt.show()

Example results:

This algorithm finds all possible solutions. If you only need one, you can optimize it so that it gets a random (x, y) point and perform dot product of the shape and a cropped image area [x:x+shape_width, y:y+shape_height] to check if there is a space until you find the right point.

This can be done for example like this:

while True:
    x = random.randint(0, image.shape[0] - shape.shape[0])
    y = random.randint(0, image.shape[1] - shape.shape[1])
    if np.sum(shape*image[x:x + shape.shape[0], y:y + shape.shape[1]]) == 0:
        break
# x, y is the solution

Compared to convolution this one is much faster (but it depends on the number of the solutions):

convolution: 6.65 s ± 21.4 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)
random search: 1.31 ms ± 31.2 µs per loop (mean ± std. dev. of 7 runs, 1,000 loops each)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

寻找能够容纳形状的二进制图像中的空白空间。

问题

答案1

Python错误：当列表具有多个值时，列表赋值索引超出范围

HMAC在Python3和Golang中产生不同的字符串。

如何在对数刻度下显示所有主要和次要刻度标签

plt.plot(…)没有显示窗口。我错过了什么吗？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。