问题

Here's the translated code portion you requested:

df['sum-Md', 'sum-Lo', 'sum-Up'] = (
    pd.wide_to_long(
        df, stubnames=["Medium", 'Lower', 'Upper'],
        i=["Date",], j="zone",
        sep="-", suffix='\w+'
    )
    .query("year>=2022", engine="python")
    .groupby("Date")
    .Medium
    .sum()
    .array
)

Please note that the translated code assumes you have the necessary Python libraries and modules imported and that the DataFrame df is defined as per your provided DataFrame structure.

英文:

My problem is: sum of columns which starts with Medium (5+7) not 12 as well the remaining columns which starts Lower and Upper using pd.wide_to_long and only shows the first sum i.e sum-Md.

I have the following dataframe:

   Date  Medium-Ab  Lower-B.c   Upper-Dd  Medium-Fb  Lower-Gc Upper-H.I  year
09/2022          5          3         10          7         4        12  2022
10/2022          8          4         12          9         6        14  2022
11/2022          9          6         14         10         9        16  2022
12/2022         15         14         20          5         4        18  2022
01/2023         17         13         25         13         8        12  2023 
    ...        ...        ...        ...        ...       ...       ...  ...
12/2023         16         11         24         16        12        19  2023
01/2024         27         23         35         33        28        42  2023 
    ...        ...        ...        ...        ...       ...       ...   ...
12/2024         10         11         14         16        12        19  2023
    ...        ...        ...        ...        ...       ...       ...  ...
12/2032        ...        ...        ...        ...       ...       ...  ...

What I want is:

   Date  Medium-Ab  Lower-B.c  Upper-Dd  Medium-Fb  Lower-Gc  Upper-H.I  year  sum-Md sum-Lo  sum-Up
09/2022          5          3        10          7         4         12  2022      12      7      22 
10/2022          8          4        12          9         6         14  2022     ...    ...     ...
11/2022          9          6        14         10         9         16  2022     ...    ...     ...
    ...        ...        ...       ...        ...       ...        ...   ...     ...    ...     ...
11/2022        ...        ...       ...        ...       ...        ...   ...     ...    ...     ...

My try is:

df[&#39;sum-Md&#39;,&#39;sum-Lo&#39;,&#39;sum-Up&#39;] = (
   pd.wide_to_long(
       df, stubnames=[&quot;Medium&quot;, &#39;Lower&#39;, &#39;Upper&#39;],
       i=[&quot;Date&quot;,], j=&quot;zone&quot;,
       sep=&quot;-&quot;, suffix=&#39;\w+&#39;
   )
   .query(&quot;year&gt;=2022&quot;, engine=&quot;python&quot;)
   .groupby(&quot;Date&quot;)
   .Medium
   .sum()
   .array
)

答案1

得分: 0

If I understand correctly, then the most straightforward and explicit way to do that would be:

df['sum-Md'] = df['Medium-Ab'] + df['Medium-Fb']
df['sum-Lo'] = df['Lower-B.c'] + df['Lower-Gc']
df['sum-Up'] = df['Upper-Dd'] + df['Upper-H.I']
# Part in comment
df['Medium-Ab'] = df['Medium-Ab'] / df['sum-Md']
df['Medium-Fb'] = df['Medium-Fb'] / df['sum-Md']
df['Lower-B.c'] = df['Lower-B.c'] / df['sum-Lo']
df['Lower-Gc'] = df['Lower-Gc'] / df['sum-Lo']
df['Upper-Dd'] = df['Upper-Dd'] / df['sum-Up']
df['Upper-H.I'] = df['Upper-H.I'] / df['sum-Up']

If the dataframe has more columns of the same sort, you could try more programmatic approaches like:

for start, short in ('Medium', 'Md'), ('Lower', 'Lo'), ('Upper', 'Up'):
    col, cols = f'sum-{short}', [c for c in df.columns if c.startswith(start)]
    df[col] = df[cols].sum(axis=1)
    # Part in comment
    df[cols] = df[cols].div(df[col].values, axis=0)

df[['sum-Md','sum-Lo','sum-Up']] = (
    df.drop(columns=['Date', 'year'])
    .groupby(lambda c: c.split('-')[0], axis=1, sort=False)
    .sum()
)
# Part in comment
for start, short in ('Medium', 'Md'), ('Lower', 'Lo'), ('Upper', 'Up'):
    base = f'sum-{short}'
    for col in df.filter(regex=f'^{start}').columns:
        df[col] = df[col] / df[base]

英文:

If I understand

> Yes I want sum up Medium-Ab and Medium-Fb in a new column its name sum-Md as well as Lower columns in a new column its name sum-Lo and Upper columns in a new column sum-Up then append the new three columns to the original dataframe.

correctly, then the most straight-forward and explicit way to do that would be:

df[&#39;sum-Md&#39;] = df[&#39;Medium-Ab&#39;] + df[&#39;Medium-Fb&#39;]
df[&#39;sum-Lo&#39;] = df[&#39;Lower-B.c&#39;] + df[&#39;Lower-Gc&#39;]
df[&#39;sum-Up&#39;] = df[&#39;Upper-Dd&#39;] + df[&#39;Upper-H.I&#39;]
# Part in comment
df[&#39;Medium-Ab&#39;] = df[&#39;Medium-Ab&#39;] / df[&#39;sum-Md&#39;]
df[&#39;Medium-Fb&#39;] = df[&#39;Medium-Fb&#39;] / df[&#39;sum-Md&#39;]
df[&#39;Lower-B.c&#39;] = df[&#39;Lower-B.c&#39;] / df[&#39;sum-Lo&#39;]
df[&#39;Lower-Gc&#39;] = df[&#39;Lower-Gc&#39;] / df[&#39;sum-Lo&#39;]
df[&#39;Upper-Dd&#39;] = df[&#39;Upper-Dd&#39;] / df[&#39;sum-Up&#39;]
df[&#39;Upper-H.I&#39;] = df[&#39;Upper-H.I&#39;] / df[&#39;sum-Up&#39;]

If the dataframe has actually more columns of the sort then you could try more programatically approaches like

for start, short in (&#39;Medium&#39;, &#39;Md&#39;), (&#39;Lower&#39;, &#39;Lo&#39;), (&#39;Upper&#39;, &#39;Up&#39;):
    col, cols = f&#39;sum-{short}&#39;, [c for c in df.columns if c.startswith(start)]
    df[col] = df[cols].sum(axis=1)
    # Part in comment
    df[cols] = df[cols].div(df[col].values, axis=0)

df[[&#39;sum-Md&#39;,&#39;sum-Lo&#39;,&#39;sum-Up&#39;]] = (
    df.drop(columns=[&#39;Date&#39;, &#39;year&#39;])
    .groupby(lambda c: c.split(&#39;-&#39;)[0], axis=1, sort=False)
    .sum()
)
# Part in comment
for start, short in (&#39;Medium&#39;, &#39;Md&#39;), (&#39;Lower&#39;, &#39;Lo&#39;), (&#39;Upper&#39;, &#39;Up&#39;):
    base = f&#39;sum-{short}&#39;
    for col in df.filter(regex=f&#39;^{start}&#39;).columns:
        df[col] = df[col] / df[base]

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

“pd.wide_to_long” 使用后，某些列的总和不正确。

问题

答案1

Python和Django – 无法使用相对导入语句访问模块（已更新）

有没有用于身份验证用户删除的Python函数？

重组一个2D的NumPy数组，基于匹配的列数值。

堆叠条形图，具有不同的x轴

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论