2023年2月14日 22:06:11go评论115阅读模式

英文:

Create duplicates of row based column values

问题

以下是代码部分的翻译：

I'm trying to build a histogram of some data in polars. As part of my histogram code, I need to duplicate some rows. I've got a column of values, where each row also has a weight that says how many times the row should be added to the histogram.

How can I duplicate my value rows according to the weight column?

Here is some example data, with a target series:

import polars as pl

df = pl.DataFrame({"value":[1,2,3], "weight":[2, 2, 1]})

print(df)
# shape: (3, 2)
# ┌───────┬────────┐
# │ value ┆ weight │
# │ ---   ┆ ---    │
# │ i64   ┆ i64    │
# ╞═══════╪════════╡
# │ 1     ┆ 2      │
# │ 2     ┆ 2      │
# │ 3     ┆ 1      │
# └───────┴────────┘

s_target = pl.Series(name="value", values=[1,1,2,2,3])
print(s_target)
# shape: (5,)
# Series: 'value' [i64]
# [
# 	1
# 	1
# 	2
# 	2
# 	3
# ]

英文:

How can I duplicate my value rows according to the weight column?

Here is some example data, with a target series:

import polars as pl

df = pl.DataFrame({&quot;value&quot;:[1,2,3], &quot;weight&quot;:[2, 2, 1]})

print(df)
# shape: (3, 2)
# ┌───────┬────────┐
# │ value ┆ weight │
# │ ---   ┆ ---    │
# │ i64   ┆ i64    │
# ╞═══════╪════════╡
# │ 1     ┆ 2      │
# │ 2     ┆ 2      │
# │ 3     ┆ 1      │
# └───────┴────────┘

s_target = pl.Series(name=&quot;value&quot;, values=[1,1,2,2,3])
print(s_target)
# shape: (5,)
# Series: &#39;value&#39; [i64]
# [
# 	1
# 	1
# 	2
# 	2
# 	3
# ]

答案1

得分: 4

以下是您要翻译的内容：

如何
(
    df.with_columns(
        pl.col("value").repeat_by(pl.col("weight"))
    )
    .select(pl.col("value").arr.explode())
)

在 [11]: df.with_columns(pl.col('value').repeat_by(pl.col('weight'))).select(pl.col('value').arr.explode())
出 [11]:
形状: (5, 1)
┌───────┐
│ value │
│ ---   │
│ i64   │
╞═══════╡
│ 1     │
│ 1     │
│ 2     │
│ 2     │
│ 3     │
└───────┘

我不知道你可以这么容易地做到这一点，我只是在写答案时才了解到。Polars 真是太好用了

英文:

How about

(
    df.with_columns(
        pl.col(&quot;value&quot;).repeat_by(pl.col(&quot;weight&quot;))
    )
    .select(pl.col(&quot;value&quot;).arr.explode())
)

In [11]: df.with_columns(pl.col(&#39;value&#39;).repeat_by(pl.col(&#39;weight&#39;))).select(pl.col(&#39;value&#39;).arr.explode())
Out[11]:
shape: (5, 1)
┌───────┐
│ value │
│ ---   │
│ i64   │
╞═══════╡
│ 1     │
│ 1     │
│ 2     │
│ 2     │
│ 3     │
└───────┘

I didn't know you could do this so easily, I only learned about it while writing the answer. Polars is so nice

答案2

得分: 2

以下是翻译好的内容：

"Turns out repeat_by and a subsequent explode are the perfect building blocks for this transformation:

&gt;&gt;&gt; df.select(pl.col(&#39;value&#39;).repeat_by(&#39;weight&#39;).arr.explode()) 
shape: (5, 1)
┌───────┐
│ value │
│ ---   │
│ i64   │
╞═══════╡
│ 1     │
│ 1     │
│ 2     │
│ 2     │
│ 3     │
└───────┘
```"

<details>
<summary>英文:</summary>

Turns out [`repeat_by`](https://pola-rs.github.io/polars/py-polars/html/reference/expressions/api/polars.Expr.repeat_by.html) and a subsequent [`explode`](https://pola-rs.github.io/polars/py-polars/html/reference/expressions/api/polars.Expr.arr.explode.html#polars.Expr.arr.explode) are the perfect building blocks for this transformation:

```python
&gt;&gt;&gt; df.select(pl.col(&#39;value&#39;).repeat_by(&#39;weight&#39;).arr.explode()) 
shape: (5, 1)
┌───────┐
│ value │
│ ---   │
│ i64   │
╞═══════╡
│ 1     │
│ 1     │
│ 2     │
│ 2     │
│ 3     │
└───────┘

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

创建基于列数值的行的副本

问题

答案1

答案2

Polars 中的 .str.replace 使用表达式或 .str.split 使用正则表达式

在Polars中创建一个新列，将函数应用于一个列。

Polars – 通过使用 partition_by 和 collect_all 来提速。

Python – Pandas x Polars – 值映射（查找值）

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论