2023年3月4日 05:32:27go评论104阅读模式

英文:

How to drop a row in Pandas if a third letter in a column is W?

问题

我有这样的数据框：

ID	Jan	Feb	Mar
20WAST	2	2	5
20S22	0	0	1
20W1ST	2	2	5
200122	0	0	1

我想要删除所有第一列中第三个字母是 'W' 的行，以获得以下输出：

ID	Jan	Feb	Mar
20S22	0	0	1
200122	0	0	1

这是一个非常大的数据框，我尝试了类似这样的操作：

df[df.ID.str[2] != 'W']

但这只选择了第二行的项。我可能可以遍历数据框，但想看看是否有更好的选项。

英文:

I have a dataframe of this kind:

ID	Jan	Feb	Mar
20WAST	2	2	5
20S22	0	0	1
20W1ST	2	2	5
200122	0	0	1

And I want to drop all the rows where the third letter in the first column is a 'W' to give an output:

ID	Jan	Feb	Mar
20S22	0	0	1
200122	0	0	1

It is a very large dataframe and I tried doing something like this:

df[df.ID[2] != &#39;W&#39;]

But this only selects the item in the second row. I could potentially iterate over the dataframe but wanted to see if there was a better option.

答案1

得分: 4

df = df[df['ID'].str[2].ne('W')]
在进行此选择后，您可能需要重置索引。

英文:

You are almost there. Use:

df= df[df[&#39;ID&#39;].str[2].ne(&#39;W&#39;)]

you might want to reset the index after this selection

答案2

得分: 0

你可以使用正则表达式来查找第3个字符。

out = df[df['ID'].str.contains('^.{2}(?!W)')]
# 或者
out = df[df['ID'].str.match('.{2}(?!W)')]
# 或者
out = df[df['ID'].str.match('.{2}[^W]')]

注意：str.contains 和 str.match 之间的区别是 str.match 从目标字符串的开头匹配。

print(out)
       ID  Jan  Feb  Mar
1   20S22    0    0    1
3  200122    0    0    1

英文:

You can use regex to find the 3rd character

out = df[df[&#39;ID&#39;].str.contains(&#39;^.{2}(?!W)&#39;)]
# or
out = df[df[&#39;ID&#39;].str.match(&#39;.{2}(?!W)&#39;)]
# or
out = df[df[&#39;ID&#39;].str.match(&#39;.{2}[^W]&#39;)]

NOTE: difference between str.contains and str.match is that str.match match the string from beginning of the target.

$ print(out)
       ID  Jan  Feb  Mar
1   20S22    0    0    1
3  200122    0    0    1

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在Pandas中如何删除包含第三个字母为W的行？

问题

答案1

答案2

如何使用SQLAlchemy Connection.execute()传递多个参数给INSERT INTO … VALUES？

Updating existing Excel file with Pandas and Openpyxl throws an AttributeError: property 'book' of 'OpenpyxlWriter' object has no setter

如何更改QtDesigner中QDateEdit对象中箭头的外观（例如颜色）？

如何使用Python绘制图表？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。