2023年8月10日 15:53:33go评论144阅读模式

英文:

Apply highlight to pivot_table

问题

我通过pivot_table函数获取DataFrame：

df2 = pd.pivot_table(df, values=['labor costs'],
                     index=['Division', 'Performer'],
                     columns=['completed on time'], aggfunc=[np.sum, len], margins=True, fill_value=0)

如何根据条件突出显示行：on schedule == 0 and overdue == 0，就像上面的表格一样？

我这样做：

def apply_colors(df_slice: pd.DataFrame) -> pd.DataFrame:
    styles_df = pd.DataFrame('', index=df_slice.index, columns=df_slice.columns)
    print('df_slice.index', df_slice.index)
    print('df_slice.columns', df_slice.columns)
    styles_df['Performer'] = np.select([
        # 条件 1
        df_slice['overdue'] == 0 & df_slice['on schedule'] == 0
    ], [
        # 条件 1 的颜色
        'background-color: silver',
    ])
    return styles_df
df2.style.apply(apply_colors, axis=None)

然后出现了 KeyError: 'overdue'。

#print('df_slice.columns', df_slice.columns) 
df_slice.columns MultiIndex([('sum', 'labor costs', 'on time'),
                ('sum', 'labor costs', 'on schedule'),
                ('sum', 'labor costs', 'overdue'),
                ('len', 'labor costs', 'on time'),
                ('len', 'labor costs', 'on schedule'),
                ('len', 'labor costs', 'overdue')],
               names=[None, None, 'completed on time'])

英文:

I get DataFrame by function pivot_table:

df2 = pd.pivot_table(df, values=[&#39;labor costs],
                         index=[&#39;Division&#39;, &#39;Perfomer&#39;],
                         columns=[&#39;completed on time&#39;], aggfunc=[np.sum, len], margins=True, fill_value=0)

How can I highlight the row by condition: on schedule == 0 and overdue == 0 like table above?

I do:

def apply_colors(df_slice: pd.DataFrame) -&gt; pd.DataFrame:
        styles_df = pd.DataFrame(&#39;&#39;, index=df_slice.index, columns=df_slice.columns)
        print(&#39;df_slice.index&#39;, df_slice.index)
        print(&#39;df_slice.columns&#39;, df_slice.columns)
        styles_df[&#39;Perfomer&#39;] = np.select([
            # Condition 1
            df_slice[&#39;overdue&#39;] == 0 &amp; df_slice[&#39;on schedule&#39;] == 0
        ], [
            # Color for Condition 1
            &#39;background-color: silver&#39;,
        ])
        return styles_df
df2.style.apply(apply_colors, axis=None)

And get: KeyError: 'overdue'

#print(&#39;df_slice.columns&#39;, df_slice.columns) 
df_slice.columns MultiIndex([(&#39;sum&#39;, &#39;labor costs&#39;, &#39;on time&#39;),
                (&#39;sum&#39;, &#39;labor costs&#39;,       &#39;on schedule&#39;),
                (&#39;sum&#39;, &#39;labor costs&#39;,       &#39;overdue&#39;),
                (&#39;len&#39;, &#39;labor costs&#39;, &#39;on time&#39;),
                (&#39;len&#39;, &#39;labor costs&#39;,       &#39;on schedule&#39;),
                (&#39;len&#39;, &#39;labor costs&#39;,       &#39;overdue&#39;),
               names=[None, None, &#39;completed on time&#39;])

答案1

得分: 1

我认为你需要在 DataFrame.any 函数中添加括号，用于测试是否至少有一个匹配，将 default 参数添加到 numpy.select 函数中，以便在未匹配的情况下添加空格，用 numpy.broadcast_to 函数来重复着色：

def apply_colors(df_slice: pd.DataFrame) -> pd.DataFrame:
    
    arr = np.select([
        # 条件 1
        ((df_slice.xs('overdue', axis=1, level=2) == 0) &
        (df_slice.xs('on schedule', axis=1, level=2) == 0)).any(axis=1)
    
    ], [
        # 条件 1 的颜色
        'background-color: silver',
    
    ], default='')
    
    return pd.DataFrame(np.broadcast_to(arr[:, None], df_slice.shape),
                        index=df_slice.index,
                        columns=df_slice.columns)

英文:

I think you need add parantheses with DataFrame.any for test at least one match, default parameter to numpy.select for space if not matched masks, for repeat coloring is used numpy.broadcast_to:

def apply_colors(df_slice: pd.DataFrame) -&gt; pd.DataFrame:
        # print(&#39;df_slice.index&#39;, df_slice.index)
        # print(&#39;df_slice.columns&#39;, df_slice.columns)
        arr = np.select([
            # Condition 1
            ((df_slice.xs(&#39;overdue&#39;, axis=1, level=2) == 0) &amp; 
            (df_slice.xs(&#39;on schedule&#39;, axis=1, level=2) == 0)).any(axis=1)
        ], [
            # Color for Condition 1
            &#39;background-color: silver&#39;,
        ], default=&#39;&#39;)
        return pd.DataFrame(np.broadcast_to(arr[:, None], df_slice.shape),
                            index=df_slice.index,
                            columns=df_slice.columns)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

应用高亮于数据透视表

问题

答案1

将1个输入句子与1个给定句子通过相似性映射。

How to extract date from a specified column containing different types of date formats of a given Pandas DataFrame using Regex

安装Python版本后再降级会导致兼容性问题吗（例如模块未找到）？

WARNING: pip is configured with locations that require TLS/SSL, however the ssl module in Python is not available. Windows 10

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。