2023年6月19日 00:56:44go评论120阅读模式

英文:

Filtering numpy arrays based on the index of certain value

问题

我有一个类似的numpy数组：

array([[ 1, 17, 33, ..., 28,  9, 22],
       [ 3, 11,  1, ..., 25, 45, 14],
       [ 3, 11,  1, ..., 21, 23,  5],
       ...,
       [20,  6, 27, ..., 43, 15, 14],
       [27,  6, 39, ..., 37, 17,  2],
       [ 3, 11,  8, ..., 27, 35, 32]], dtype=int32)

从这里，我想要筛选出在索引10或之前出现值4的行。

例如：
[1, 2, 3, 4, ..., 34, 35] - 筛选，因为值4在索引3之前出现，即在索引10之前。

例如：
[35, 34, 33, 32..., 4, 3, 2, 1] - 保留，因为值4在索引10之后出现。

使用numpy掩码，可以实现这种筛选的方式是什么？

英文:

I have a numpy array like:

array([[ 1, 17, 33, ..., 28,  9, 22],
       [ 3, 11,  1, ..., 25, 45, 14],
       [ 3, 11,  1, ..., 21, 23,  5],
       ...,
       [20,  6, 27, ..., 43, 15, 14],
       [27,  6, 39, ..., 37, 17,  2],
       [ 3, 11,  8, ..., 27, 35, 32]], dtype=int32)

From here, I would like to filter out rows where value 4 is occurring at index of 10 or before.

e.g.
[1, 2, 3, 4, ..., 34, 35] - filter, as value 4 is occurring at index3, which is before index 10

e.g.
[35, 34, 33, 32..., 4, 3, 2, 1] - keep, as value 4 is occurring after index10.

what would be the way to achieve this filtering using numpy masking?

答案1

得分: 2

这是您要翻译的代码部分：

arr = np.array(
    [[1, 2, 3, 4, 34, 35, 2, 1],
     [1, 2, 3, 5, 6, 7, 8, 9],
     [1, 2, 4, 5, 6, 7, 8, 9],
     [4, 2, 3, 5, 6, 7, 8, 9],
     [35, 34, 33, 32, 4, 3, 2, 1]]
)
V, T = 4, 3 # &lt;-- change the threshold to 10
m = np.any(arr[:, :T+1] == V, axis=1)
out = arr[~m]
Output :
print(out)
array([[ 1,  2,  3,  5,  6,  7,  8,  9],
       [35, 34, 33, 32,  4,  3,  2,  1]])

Intermediates :

&gt;&gt;&gt; arr[:, :T+1]
array([[ 1,  2,  3,  4],
       [ 1,  2,  3,  5],
       [ 1,  2,  4,  5],
       [ 4,  2,  3,  5],
       [35, 34, 33, 32]])
    
&gt;&gt;&gt; arr[:, :T+1] == V
array([[False, False, False,  True],
       [False, False, False, False],
       [False, False,  True, False],
       [ True, False, False, False],
       [False, False, False, False]])
    
&gt;&gt;&gt; np.any(arr[:, :T+1] == V, axis=1)
array([ True, False,  True,  True, False])

请注意，代码部分已经被复制到翻译中。

英文:

You can try this :

arr = np.array(
    [[1, 2, 3, 4, 34, 35, 2, 1],
     [1, 2, 3, 5, 6, 7, 8, 9],
     [1, 2, 4, 5, 6, 7, 8, 9],
     [4, 2, 3, 5, 6, 7, 8, 9],
     [35, 34, 33, 32, 4, 3, 2, 1]]
)
V, T = 4, 3 # &lt;-- change the threshold to 10
m = np.any(arr[:, :T+1] == V, axis=1)
out = arr[~m]

Output :

print(out)
array([[ 1,  2,  3,  5,  6,  7,  8,  9],
       [35, 34, 33, 32,  4,  3,  2,  1]])

Intermediates :

&gt;&gt;&gt; arr[:, :T+1]
array([[ 1,  2,  3,  4],
       [ 1,  2,  3,  5],
       [ 1,  2,  4,  5],
       [ 4,  2,  3,  5],
       [35, 34, 33, 32]])
&gt;&gt;&gt; arr[:, :T+1] == V
array([[False, False, False,  True],
       [False, False, False, False],
       [False, False,  True, False],
       [ True, False, False, False],
       [False, False, False, False]])
&gt;&gt;&gt; np.any(arr[:, :T+1] == V, axis=1)
array([ True, False,  True,  True, False])

答案2

得分: -1

以下是翻译后的代码部分：

import numpy as np
np.random.seed(0)
vals = np.random.choice(50, size=(10, 100))
print(vals[:,:10])
print(vals.shape)
vals = vals[[4 not in i for i in vals[:,:10]]]
print(vals.shape)
print(vals[:,:10])

应该输出：

[[44 47  0  3  3 39  9 19 21 36]
 [ 5 41 35  0 31  5 30  0 49 36]
 [24 15 41 18 40 15 11 38 47 29]
 [ 2  5 37 12 44  2 47 27 21 39]
 [42 48 30 16 26 35 49 42  9 44]
 [ 3 34 40 33 28  4 26 32 45  9]
 [41 38 43 18  7 28  1 41  2 28]
 [36 49 24 33 18 33 14 49  7 43]
 [ 6 27 35  6 19 34 38 20 43  0]
 [ 9 20 37 48 17  9 44 15 38 14]]
(10, 100)
(9, 100)
[[44 47  0  3  3 39  9 19 21 36]
 [ 5 41 35  0 31  5 30  0 49 36]
 [24 15 41 18 40 15 11 38 47 29]
 [ 2  5 37 12 44  2 47 27 21 39]
 [42 48 30 16 26 35 49 42  9 44]
 [41 38 43 18  7 28  1 41  2 28]
 [36 49 24 33 18 33 14 49  7 43]
 [ 6 27 35  6 19 34 38 20 43  0]
 [ 9 20 37 48 17  9 44 15 38 14]]

英文:

Here's one way to do it, the key bit is [4 not in i for i in vals[:,:10]] which creates a mask by iterating through a view of the first 10 elements of each row.

import numpy as np
np.random.seed(0)
vals = np.random.choice(50, size=(10, 100))
print(vals[:,:10])
print(vals.shape)
vals = vals[[4 not in i for i in vals[:,:10]]]
print(vals.shape)
print(vals[:,:10])

which should yield:

[[44 47  0  3  3 39  9 19 21 36]
[ 5 41 35  0 31  5 30  0 49 36]
[24 15 41 18 40 15 11 38 47 29]
[ 2  5 37 12 44  2 47 27 21 39]
[42 48 30 16 26 35 49 42  9 44]
[ 3 34 40 33 28  4 26 32 45  9]
[41 38 43 18  7 28  1 41  2 28]
[36 49 24 33 18 33 14 49  7 43]
[ 6 27 35  6 19 34 38 20 43  0]
[ 9 20 37 48 17  9 44 15 38 14]]
(10, 100)
(9, 100)
[[44 47  0  3  3 39  9 19 21 36]
[ 5 41 35  0 31  5 30  0 49 36]
[24 15 41 18 40 15 11 38 47 29]
[ 2  5 37 12 44  2 47 27 21 39]
[42 48 30 16 26 35 49 42  9 44]
[41 38 43 18  7 28  1 41  2 28]
[36 49 24 33 18 33 14 49  7 43]
[ 6 27 35  6 19 34 38 20 43  0]
[ 9 20 37 48 17  9 44 15 38 14]]

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

根据特定值的索引筛选numpy数组

问题

答案1

答案2

How to redirect User to login Screen when not logged in for a Flask App with Azure authorization?

Django动态表单集更新视图未更新。

PIL.Image.open() 仅提供文件名时搜索图像的位置在哪里？

Download a .csv file using requests.get() in Python

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。