2023年7月27日 23:27:43go评论74阅读模式

英文:

How would I style a column's elements based on another column's elements in a pandas data frame?

问题

以下是您要翻译的代码部分：

import pandas as pd

# 示例数据框架
data = {
    'Greek letters': ["Alpha", "Beta", "Gamma", "Omega", "Delta"],
    'English letters': ["A", "B", "C", "D", "E"],
    'Greek Letter score': [5, 10, 15, 20, 25],
    'English Letter score': [3, 11, 12, 18, 25]
}
df = pd.DataFrame(data)

def highlight_letter(value):
    # 如何使用字母元素来获取其对应的分数？
    # 分数 = 用于获取字母分数的某种技巧
    if score <= 10:
        return 'background-color: lightgreen'
    elif score <= 20:
        return 'background-color: yellow'
    else:
        return 'background-color: blue'

styled_df = df.style.applymap(highlight_letter, subset=['Greek letters', 'English letters'])

希望这有所帮助。

英文:

Say, for example, I have a pandas data frame like so:

import pandas as pd

# Sample DataFrame
data = {
    &#39;Greek letters&#39;: [&quot;Alpha&quot;, &quot;Beta&quot;, &quot;Gamma&quot;, &quot;Omega&quot;, &quot;Delta&quot;],
    &#39;English letters&#39;: [&quot;A&quot;, &quot;B&quot;, &quot;C&quot;, &quot;D&quot;, &quot;E&quot;],
    &#39;Greek Letter score&#39;: [5, 10, 15, 20, 25],
    &#39;English Letter score&#39;: [3, 11, 12, 18, 25]
}
df = pd.DataFrame(data)

What I want to do is apply specific background colors only to the elements in the columns Greek letters and English letters based on their respective scores (so, based on the elements in the Greek Letter score and English Letter score columns respectively).

def highlight_letter(value):
    # How would I use the letter element to obtain its corresponding score?
    # score = some technique to obtain the letter&#39;s score 
    if score &lt;= 10:
        return &#39;background-color: lightgreen&#39;
    elif score &lt;= 20:
        return &#39;background-color: yellow&#39;
    else:
        return &#39;background-color: blue&#39;

styled_df = df.style.applymap(highlight_letter, subset=[&#39;Greek letters&#39;, &#39;English letters&#39;])

This is what the expected output should look like:

答案1

得分: 2

实际上，您所需要的不是applymap，而是apply，沿着行进行操作。为了避免硬编码，让我们假设您至少拥有像“Something letters”和“Something Letter score”这样的一对对，无论以什么大小写和列顺序出现。考虑到这一点，我建议采用以下方法：

def highlight_letter(record):
    formatting = record.copy()
    subset = record.index.str.endswith('letters')
    formatting[~subset] = ''
    for name in formatting.index[subset]:
        score = record[name[:-1] + ' score']   # -1用于删除letters末尾的s
        formatting[name] = (
            'background-color: lightgreen' if score <= 10 else
            'background-color: yellow' if score <= 20 else
            'background-color: blue'
        )
    return formatting


styled_df = df.rename(columns=str.lower).style.apply(
    highlight_letter,
    axis='columns'
)

在您的测试数据上的输出如下所示：

更新

这是另一种情况的方式，即我们有一个固定的结构，就像前半部分的列是字母，接下来的一半是它们得分的对应列：

half = len(df.columns)//2
score = df.iloc[:, half:]

category_level = np.add(
    score <= 10,
    score <= 20,
    dtype=int     # 需要这个来累加整数，而不是布尔值
)

category_formatting = [   
    'background-color: blue',          # 默认颜色
    'background-color: lightgreen',    # <=10
    'background-color: yellow',        # <=20
]

formatting = np.choose(category_level, category_formatting)

此时，formatting是一个具有第二半部分列名的DataFrame。为了得到正确的应用，我们必须用它们的对应列名替换第一半部分的列名：

formatting.columns = df.columns[:half]
styled_df = df.style.apply(lambda _: formatting, axis=None)

英文:

Actually, what you need is not applymap but apply along the rows. To avoid hardcoding, let's assume that you have at least pairs like "Something letters" and "Something Letter score" in whatever case and column order it can be. With that in mind, I'd suggest this approach:

def highlight_letter(record):
    formatting = record.copy()
    subset = record.index.str.endswith(&#39;letters&#39;)
    formatting[~subset] = &#39;&#39;
    for name in formatting.index[subset]:
        score = record[name[:-1] + &#39; score&#39;]   # -1 to drop ending s in letters
        formatting[name] = (
            &#39;background-color: lightgreen&#39; if score &lt;= 10 else
            &#39;background-color: yellow&#39; if score &lt;= 20 else
            &#39;background-color: blue&#39;
        )
    return formatting


styled_df = df.rename(columns=str.lower).style.apply(
    highlight_letter,
    axis=&#39;columns&#39;
)

The output on your test data looks like this:

Update

Here's another way for the case where we have a fixed structure, like first half of columns are letters, and the next one - the corresponding columns of their scores:

half = len(df.columns)//2
score = df.iloc[:, half:]

category_level = np.add(
    score &lt;= 10,
    score &lt;= 20,
    dtype=int     # need this to accumulate integers, not booleans
)

category_formatting = [   
    &#39;background-color: blue&#39;,          # default color
    &#39;background-color: lightgreen&#39;,    # &lt;=10
    &#39;background-color: yellow&#39;,        # &lt;=20
]

formatting = np.choose(category_level, category_formatting)

At this point, formatting is a DataFrame with column names from the second half. To get the right applying, we have to replace column names with their twins from the first half:

formatting.columns = df.columns[:half]
styled_df = df.style.apply(lambda _: formatting, axis=None)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

我该如何根据 pandas 数据框中另一列的元素来设置某一列的元素样式？

问题

答案1

在垃圾回收语言中实现的虚拟机的垃圾回收机制

在Python中在新窗口中读取命令提示符输出。

如何在 datetime.datetime 坐标轴上显示误差线？

DJANGO：数据库相关的组合框保持空白。没有显示任何项目。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论