2020年1月6日 20:33:25go评论77阅读模式

英文:

Subqueries vs Multi Table Join

问题

Way 1: -

选择 count(id) from A a join B b on a.id = b.id join C c on  B.id = C.id;

结果计数 - X

Way 2: -

选择 count(id) FROM A WHERE id IN (选择 id FROM B WHERE id IN (选择 id FROM C));

结果计数 - Y

每个查询的结果计数不同。到底出了什么问题？

英文:

I've 3 tables A, B, C. I want to list the intersection count.

Way 1:-

select count(id) from A a join B b on a.id = b.id join C c on  B.id = C.id;

Result Count - X

Way 2:-

SELECT count(id) FROM A WHERE id IN (SELECT id FROM B WHERE id IN (SELECT id FROM C));

Result Count - Y

The result count in each of the query is different. What exactly is wrong?

答案1

得分: 2

使用JOIN可以增加行数，同时过滤掉一些行。

在这种情况下，第二个计数应该是正确的，因为没有重复计算 - 假设a中的id是唯一的。如果不是唯一的，则需要使用count(distinct a.id)。

使用JOIN的等效方法将使用COUNT(DISTINCT)：

select count(distinct a.id)
from A a join
     B b
     on a.id = b.id join
     C c
     on B.id = C.id;

我提到这个完整性的问题，但不建议这种方法。增加行数然后使用distinct来删除它们是低效的。

在许多数据库中，最有效的方法可能是：

select count(*)
from a
where exists (select 1 from b where b.id = a.id) and
      exists (select 1 from c where c.id = a.id);

注意：这假设id列上有索引，并且a中的id是唯一的。

英文:

A JOIN can multiply the number of rows as well as filtering out rows.

In this case, the second count should be the correct one because nothing is double counted -- assuming id is unique in a. If not, it needs count(distinct a.id).

The equivalent using JOIN would use COUNT(DISTINCT):

select count(distinct a.id)
from A a join
     B b
     on a.id = b.id join
     C c
     on B.id = C.id;

I mention this for completeness but do not recommend this approach. Multiplying the number of rows just to remove them using distinct is inefficient.

In many databases, the most efficient method might be:

select count(*)
from a
where exists (select 1 from b where b.id = a.id) and
      exists (select 1 from c where c.id = a.id);

Note: This assumes there are indexes on the id columns and that id is unique in a.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

子查询 vs 多表连接

问题

答案1

管理Google表格中的BigQuery表格

获取参数化查询的结果集，使用R的`DBI`将其直接合并到数据库中。

基于函数的索引未能提高查询性能。

如何将m:n关系映射到切片字段？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论