2023年6月6日 09:51:02go评论67阅读模式

英文:

how to count failure occurrences in a column using pandas?

问题

我需要使用Python的pandas来以CSV格式整理测试结果。结果可以是"passed"或有时是"failed"。在我导入Python和我的代码之后，代码如下：

import pandas as pd
df = pd.read_csv('myfile.csv')
pass_res = df['Status'].value_counts()['passed']
fail_res = df['Status'].value_counts().get('failed', 0)

这段代码将在有失败的情况下运行。然而，如果没有失败，最后一行代码会导致错误。如何检查是否有失败，然后我将执行最后一行呢？

英文:

I need to use python's pandas to tabulate the test result in a csv format. The result could be "passsed" or sometime "failed". After I

    import python as pd,my code is:
    df = pd.read_csv(&#39;myfile.csv&#39;)
    pass_res =df[&#39;Status&#39;].value_counts()[&#39;passed&#39;]
    fail_res =df[&#39;Status&#39;].value_counts()[&#39;failed&#39;]

this code will work if there IS a case of fail. However, when there is no failure, the last line of code will cause an error. How do check, if there is a failure, then I will execute my last line.

答案1

得分: 2

使用Series.get来提取找到的值，否则返回0。

s = df['Status'].value_counts()

passed = s.get('passed', 0)
failed = s.get('failed', 0)

英文:

Lets use Series.get to yank the value if found otherwise return 0

s = df[&#39;Status&#39;].value_counts()

passed = s.get(&#39;passed&#39;, 0)
failed = s.get(&#39;failed&#39;, 0)

答案2

得分: 2

以下是代码的翻译部分：

# 示例
df = pd.DataFrame({'Status': ['passed']*5 + ['other']*3})

status = pd.CategoricalDtype(['passed', 'failed'], ordered=True)
passed, failed = df['Status'].astype(status).value_counts().sort_index()

输出：

>>> passed
5

>>> failed
0

>>> df['Status'].astype(status).value_counts().sort_index()
Status
passed    5
failed    0
Name: count, dtype: int64

>>> df
   Status
0  passed
1  passed
2  passed
3  passed
4  passed
5   other
6   other
7   other

请注意，上述内容只是代码的翻译，不包括问题的回答。

英文:

You can also add a CategoricalDType as value_counts returns all observed:

# sample
df = pd.DataFrame({&#39;Status&#39;: [&#39;passed&#39;]*5 + [&#39;other&#39;]*3})

status = pd.CategoricalDtype([&#39;passed&#39;, &#39;failed&#39;], ordered=True)
passed, failed = df[&#39;Status&#39;].astype(status).value_counts().sort_index()

Output:

&gt;&gt;&gt; passed
5

&gt;&gt;&gt; failed
0

&gt;&gt;&gt; df[&#39;Status&#39;].astype(status).value_counts().sort_index()
Status
passed    5
failed    0
Name: count, dtype: int64

&gt;&gt;&gt; df
   Status
0  passed
1  passed
2  passed
3  passed
4  passed
5   other
6   other
7   other

答案3

得分: 1

与@Corralien的方法类似，但使用reindex函数：

df['Status'].value_counts(sort=False).reindex(['passed', 'failed'], fill_value=0)

一次性定义变量：

passed, failed = (df['Status'].value_counts(sort=False)
                  .reindex(['passed', 'failed'], fill_value=0)
                 )

英文:

Similar to @Corralien's but with reindex:

df[&#39;Status&#39;].value_counts(sort=False).reindex([&#39;passed&#39;, &#39;failed&#39;], fill_value=0)

Defining the variables in one shot:

passed, failed = (df[&#39;Status&#39;].value_counts(sort=False)
                  .reindex([&#39;passed&#39;, &#39;failed&#39;], fill_value=0)
                 )

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何使用pandas计算列中的失败次数？

问题

答案1

答案2

答案3

如何使用pytz和datetime库将具有区域/城市时区格式的时间字符串转换为UTC？

如何使用Selenium在Chrome中禁用“某个网站想要打开此应用程序”的警告？

为什么调用不同数值的time.sleep会改变与sleep无关的部分的执行时间？

Lambda函数同时包含if和for循环

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论