问题

在我的代码中，我已经注释掉了我认为是多余的排序子句的行。我已经检查了结果（行值），在有注释和没有注释的情况下，结果是相同的。我只是想知道是否存在第二个和第四个CTE中我已经注释掉的两个"order by"子句不是真正多余的情况。

speed_dataset as (
select uc_id, imei, points_geom, time_created, st_distance((points_geom::geography),lag(points_geom::geography) over (partition by imei order by time_created))
 / nullif(( EXTRACT(EPOCH FROM time_created) - EXTRACT(EPOCH FROM LAG(time_created) OVER(PARTITION BY imei ORDER BY time_created)))::FLOAT8,0)  as speed
from orig_dataset 
order by imei,time_created 
)
,
subset_speed as (
select uc_id, ROW_NUMBER() OVER (ORDER BY (time_created)) AS row_id, speed, imei,points_geom ,time_created
from speed_dataset sd 
where speed < 0.1 or speed between 0.75 and 2 
--order by time_created 
)
,
leading_speeds as (
select *,lead (speed) over (partition by imei order by time_created) as lead_speed from subset_speed 
)
,
subset_cr as (
select * from leading_speeds 
where 
(
(speed < 0.1 and lead_speed between 0.75 and 2)
or 
(speed between 0.75 and 2 and lead_speed < 0.1)
)
--order by imei,time_created 
)
,
clustering as(
SELECT uc_id,row_id,imei, speed, points_geom ,time_created, ST_ClusterDBSCAN(st_transform(points_geom,24313),eps := 150, minPoints := 3) 
  OVER(ORDER BY row_id) AS cluster_id FROM subset_cr 
)

希望这有所帮助。

英文:

In my code below, I have commented out the lines which I believe are redundant ordering clauses. I have checked the results (row values) with and without commenting, and the results are the same. I was just wondering if there is ANY scenario where the two order by's that I have commented out in the second and fourth CTE are not really redundant.

speed_dataset as (
select uc_id, imei, points_geom, time_created, st_distance((points_geom::geography),lag(points_geom::geography) over (partition by imei order by time_created))
 / nullif(( EXTRACT(EPOCH FROM time_created) - EXTRACT(EPOCH FROM LAG(time_created) OVER(PARTITION BY imei ORDER BY time_created)))::FLOAT8,0)  as speed
from orig_dataset 
order by imei,time_created 
)
,
subset_speed as (
select uc_id, ROW_NUMBER() OVER (ORDER BY (time_created)) AS row_id, speed, imei,points_geom ,time_created
from speed_dataset sd 
where speed &lt; 0.1 or speed between 0.75 and 2 
--order by time_created 
)
,
leading_speeds as (
select *,lead (speed) over (partition by imei order by time_created) as lead_speed from subset_speed 
)
,
subset_cr as (
select * from leading_speeds 
where 
(
(speed &lt; 0.1 and lead_speed between 0.75 and 2)
or 
(speed between 0.75 and 2 and lead_speed &lt; 0.1)
)
--order by imei,time_created 
)
,
clustering as(
SELECT uc_id,row_id,imei, speed, points_geom ,time_created, ST_ClusterDBSCAN(st_transform(points_geom,24313),eps := 150, minPoints := 3) 
  OVER(ORDER BY row_id) AS cluster_id FROM subset_cr 
)

答案1

得分: 1

你的直觉是正确的。通常情况下，除非你将其与 DISTINCT ON (..) 或 LIMIT/FETCH FIRST ... ROWS ONLY 结合使用，否则在 CTE 或视图定义中 永远不应该 使用 ORDER BY。

英文:

Your intuition is right. As a rule, you should never have an ORDER BY in a CTE or a view definition unless you use it in conjunction with DISTINCT ON (..) or LIMIT/FETCH FIRST ... ROWS ONLY.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

识别冗余的订单 by’s

问题

答案1

在使用Golang的PostgreSQL中，需要将日期增加1年。

返回行值的总和和行数

如何在 Golang 中使用数据库连接池来管理连接到多个集群主机的连接？

如何在插入时指定本地表字段和外部表字段？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。