2023年3月9日 12:30:28go评论100阅读模式

英文:

Snowflake Snowpark Python: Group By and Concat

问题

我在Snowflake中有这个：

id_request	alert_code
100	R70
100	R69
100	R54
101	R24
101	R93

我想把它变成这样：

id_request	alert_all
100	R70, R69, R54
101	R24, R93

我尝试写了下面的代码，但似乎有问题：

df_alerts_3 = df_alerts_2.groupBy('id_request')\
.agg(concat_ws(',', collect_list('alert_code')).alias('alert_all'))

非常感谢您的任何协助。

英文:

I have this in Snowflake

id_request	alert_code
100	R70
100	R69
100	R54
101	R24
101	R93

I want to turn it into this

id_request	alert_all
100	R70,R69,R54
101	R24,R93

I tried writing this but it seems to be wrong

df_alerts_3 = df_alerts_2.groupBy(&#39;id_request&#39;)\
.agg(concat_ws(lit(&#39;,&#39;), array_agg(&#39;alert_code&#39;)).alias(&#39;alert_all&#39;))

Thank you very much for any assistance

答案1

得分: 2

CONCAT_WS 在单个行的上下文中连接字符串。要在多个行之间连接字符串，您需要使用聚合函数 LISTAGG。

Snowpark 的等效函数是 snowflake.snowpark.functions.listagg：

返回用分隔符字符串分隔的连接输入值

df.group_by(df.col1).agg(listagg(df.col2, ",").within_group(df.col2.asc()))
df.select(listagg(df["col2"], ",", False)

英文:

CONCAT_WS concatenates strings in context of a single row. To concatenate strings across multiple rows you need to use aggregate function LISTAGG.

Snowpark equivalent is snowflake.snowpark.functions.listagg:

> Returns the concatenated input values, separated by delimiter string
>
> df.group_by(df.col1).agg(listagg(df.col2. ",")).within_group(df.col2.asc())
> df.select(listagg(df["col2"], ",", False)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Snowflake Snowpark Python: Group By and Concat

问题

答案1

根据分布随机抽样

Configparser 如何使用 URL？

从 pandas 数据框中提取相关行，当存在重复列数值时。

Azure数据工厂：在Python的ForEach循环中使用Lookup结果

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。