2023年5月17日 18:45:18go评论46阅读模式

英文:

How do you access a column name in a polars expression?

问题

我在polars中实现了一个sigmoid变换，代码如下：

def sigmoid(c: pl.Expr) -> pl.Expr:
    return 1 / ((-c).exp() + 1)

这个实现很好，但是按照polars的命名规范，生成的列名为'literal'。

你可以通过重写sigmoid函数来保留列名，如下：

def sigmoid(c: pl.Expr) -> pl.Expr:
    return ((c * -1).exp() + 1) ** -1

但是这种方法有两个问题：
A. 这种写法不太直观
B. 我不希望我的代码具有这种“神奇/不可见”的列名跟踪功能

我想要的是在函数的末尾添加一个.alias()来确保列名被保留。下面的伪代码表达了这个想法：

def sigmoid(c: pl.Expr) -> pl.Expr:
    return (1 / ((-c).exp() + 1)).alias(c.name)

然而，polars表达式没有.name属性。那么，我怎么样才能保留列名呢？

你可以尝试以下方法：

df.select(
   pl.col('a').pipe(sigmoid).alias('a'), 
   pl.col('b').pipe(sigmoid).alias('b'), 
   pl.col('c').pipe(sigmoid).alias('c'), 
   ...
)

但这很繁琐，并且在以下情况下效果不佳：

df.select(
   pl.all().pipe(sigmoid)
)

英文:

I implemented a sigmoid transformation in polars as follows:

def sigmoid(c:pl.Expr)-&gt;pl.Expr:
    return 1 / ((-c).exp() + 1)

This works great, except that by polars naming conventions the resulting column is called 'literal'

I could keep the column name by re-writing sigmoid as

def sigmoid(c:pl.Expr)-&gt;pl.Expr:
    return ((c * -1).exp() + 1)**-1

But:
A. That is horrible
B. I don't want my code to have this "magical/invisible" tracking of column names

What I'd like to do is add a .alias() at the end of my function to ensure the column name is preserved.

The following pseudo-code expresses the idea:

def sigmoid(c:pl.Expr)-&gt;pl.Expr:
    return (1 / ((-c).exp() + 1)).alias(c.name)

However, polars expressions do not have a .name attribute.

How else could I keep the column name?

Not that I could do:

df.select(
   pl.col(&#39;a&#39;).pipe(sigmoid).alias(&#39;a&#39;), 
   pl.col(&#39;b&#39;).pipe(sigmoid).alias(&#39;b&#39;), 
   pl.col(&#39;c&#39;).pipe(sigmoid).alias(&#39;c&#39;), 
   ...
)

But that is cumbersome, and would not work well with

df.select(
   pl.all().pipe(sigmoid)
)

答案1

得分: 1

以下是翻译好的内容：

.output_name 方法来自元命名空间

pl.col("a").meta.output_name()

# 'a'

使用条件语句，当 a 等于 1 时选择 b，否则选择 c，然后使用 .output_name() 方法来获取输出名称。

pl.when(pl.col("a") == 1).then(pl.col("b")).otherwise(pl.col("c")).meta.output_name()

# 'b'

使用条件语句，当 a 等于 1 时选择 b，否则选择 c，然后使用 .root_names() 方法来获取根名称列表。

pl.when(pl.col("a") == 1).then(pl.col("b")).otherwise(pl.col("c")).meta.root_names()

# ['b', 'c', 'a']

定义了一个名为 sigmoid 的函数，接受一个参数 c，并返回带有输出名称的表达式。

def sigmoid(c: pl.Expr) -> pl.Expr:
    return (1 / ((-c).exp() + 1)).alias(c.meta.output_name())

pl.DataFrame(dict(a=[1, 2, 3])).select(sigmoid(pl.col("a")))

输出的表格形状为 (3, 1)，包含一列名为 a 的浮点数。

shape: (3, 1)
┌──────────┐
│ a        │
│ ---      │
│ f64      │
╞══════════╡
│ 0.731059 │
│ 0.880797 │
│ 0.952574 │
└──────────┘

英文:

The methods such as .output_name from the meta namespace.

pl.col(&quot;a&quot;).meta.output_name()

# &#39;a&#39;

pl.when(pl.col(&quot;a&quot;) == 1).then(pl.col(&quot;b&quot;)).otherwise(pl.col(&quot;c&quot;)).meta.output_name()

# &#39;b&#39;

pl.when(pl.col(&quot;a&quot;) == 1).then(pl.col(&quot;b&quot;)).otherwise(pl.col(&quot;c&quot;)).meta.root_names()

# [&#39;b&#39;, &#39;c&#39;, &#39;a&#39;]

def sigmoid(c:pl.Expr)-&gt;pl.Expr:
   return (1 / ((-c).exp() + 1)).alias(c.meta.output_name())

pl.DataFrame(dict(a = [1, 2, 3])).select(sigmoid(pl.col(&quot;a&quot;)))

shape: (3, 1)
┌──────────┐
│ a        │
│ ---      │
│ f64      │
╞══════════╡
│ 0.731059 │
│ 0.880797 │
│ 0.952574 │
└──────────┘

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在 polars 表达式中访问列名？

问题

答案1

在Polars中进行“indexed”查找的最快方法是什么？

创建基于列数值的行的副本

如何以列为单位，在 Polars 数据框中将所有列逐元素除以列特定的标量？

在Python Polars中，如何根据另一列的条件将多列的值更改为null或0。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论