2023年6月5日 15:31:48go评论55阅读模式

英文:

Redshift SQL: How to only aggregate subsequent rows with a certain value using window functions?

问题

我有以下表格：

A	B	C
1	0	12
2	0	13
3	1	5
4	1	1
5	1	2
6	0	22
7	0	20
8	1	1
9	1	10
10	0	11
11	0	12

我想以一种方式转换数据，即在按A升序排序后，仅在B的值为1时将后续行汇总在一起。每个窗口中的汇总应该取MAX(A)和SUM(C)。将此过程应用于上述表格后的结果如下：

new_A	new_C
1	12
2	13
5	8
6	22
7	20
9	11
10	11
11	12

如果可能的话，我想仅使用SQL来完成这个任务。我尝试了多个窗口函数，但未能实现我想要做的事情。

英文:

I have the following table:

A	B	C
1	0	12
2	0	13
3	1	5
4	1	1
5	1	2
6	0	22
7	0	20
8	1	1
9	1	10
10	0	11
11	0	12

I would like to transform the data in a way that subsequent rows (after ordering by A ascending), are aggregated together only if their value of B is 1. The aggregation in each window should take MAX(A) and SUM(C). The result of this process after applying it to the above table is the following:

new_A	new_C
1	12
2	13
5	8
6	22
7	20
9	11
10	11
11	12

I want to do this via SQL only if possible.

I tried multiple window functions but didn't manage to achieve what I am trying to do.

答案1

得分: 0

我们可以通过创建一个伪序列来实现这一目标，该序列为B列中的每个岛分配相同的值。然后，我们可以按照这一列进行聚合以获得最终结果。

WITH cte AS (
    SELECT *, LAG(B) OVER (ORDER BY A) AS lag_B
    FROM yourTable
),
cte2 AS (
    SELECT *, SUM(CASE WHEN B = 1 AND lag_B = 1 THEN 0 ELSE 1 END)
                  OVER (ORDER BY A) AS seq
    FROM cte
)

SELECT MAX(A) AS new_A, SUM(C) AS new_C
FROM cte2
GROUP BY seq
ORDER BY MAX(A);

英文:

We can achieve this by creating a pseudo sequence which assigns the same value to every island of a values in the B column. Then, we can aggregate by this column to get the final result.

WITH cte AS (
    SELECT *, LAG(B) OVER (ORDER BY A) AS lag_B
    FROM yourTable
),
cte2 AS AS (
    SELECT *, SUM(CASE WHEN B = 1 AND lag_B = 1 THEN 0 ELSE 1 END)
                  OVER (ORDER BY A) AS seq
    FROM cte
)

SELECT MAX(A) AS new_A, SUM(C) AS new_C
FROM cte2
GROUP BY seq
ORDER BY MAX(A);

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Redshift SQL：如何使用窗口函数仅聚合具有特定值的后续行？

问题

答案1

golang: http.get() 返回的是什么？

Python – pandas：复制创建变量的函数

VBA宏或条件格式化以识别指定的案件编号。

DB2 – 修改表格添加具有唯一默认值（UUID）的列

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论