问题

以下是翻译好的部分：

SELECT 
    item.id, ich.id, ich.existing_stores, mid.missing_stores
FROM 
    `hd-merch-prod.merch_item_cache_validation.item` item 
JOIN
    `hd-merch-prod.merch_item_cache.item_change_history` ich 
         ON item.date = "2023-08-03" 
         AND DATE(ich.createdTime) = item.date 
         AND ich.id = item.id  
JOIN
    `hd-merch-prod.merch_item_cache.mic_item_discrepancy` mid 
         ON item.id = mid.id
GROUP BY
    item.id, ich.id, ich.existing_stores, mid.missing_stores;

希望这能帮助您。

英文:

I got the following three tables in BigQuery along with the data below.

I'd like to write a SQL query using id and current date columns and get the following row as the output.

Expected result set:

id            existing_stores                             missing_stores
1003812607    &quot;3640,0130,0131,2306,3638,0127,2789,2305&quot;   &quot;3102,2681,2686,2670,2682,3101,2673,2669,3103,2668&quot;

These are the tables:

item table:

id            date
------------------------
1003812607    2023-08-03
1003812607    2023-08-01
1003812607    2023-07-23
1003812607    2023-06-30

item_change_history:

createdTime	                    docType	   id	    existing_stores
---------------------------------------------------------------------------------------------
2023-08-03 11:01:10.139617 UTC	Item	1003812607	&quot;3640,0130,0131,2306,3638,0127,2789,2305&quot;
2023-07-01 09:01:10.139617 UTC	Item	1003812607	&quot;3640,0130,0131,2306,3638,0127,2789,2301&quot;

mic_item_discrepancy:

ID	        MISSING_STORE
-------------------------
1003812607	3102
1003812607	2681
1003812607	2686
1003812607	2670
1003812607	2682
1003812607	3101
1003812607	2673
1003812607	2669
1003812607	3103
1003812607	2668

I tried to come up with this query and it is not working as expected or giving me the wrong data given that duplicate id rows in
mic_item_discrepancy table.

SELECT 
    item.id, ich.id, ich.existing_stores, mid.missing_stores
FROM 
    `hd-merch-prod.merch_item_cache_validation.item` item 
JOIN
    `hd-merch-prod.merch_item_cache.item_change_history` ich 
         ON item.date = &quot;2023-08-03&quot; 
         AND DATE(ich.createdTime) = item.date 
         AND ich.id = item.id  
JOIN
    `hd-merch-prod.merch_item_cache.mic_item_discrepancy` mid 
         ON item.id = mid.id
GROUP BY
    item.id, ich.id, ich.existing_stores, mid.missing_stores;

答案1

得分: 2

尝试在item_change_history和mic_item_discrepancy表上都使用LEFT JOIN，以确保结果中包括所有来自项目表的行，并在STRING_AGG函数中添加DISTINCT：

SELECT 
    item.id, ich.id, ich.existing_stores, COALESCE(mid.missing_stores, '') AS missing_stores
FROM 
    `hd-merch-prod.merch_item_cache_validation.item` item 
LEFT JOIN
    (
      SELECT id, existing_stores, MAX(createdTime) as latest_createdTime
      FROM 
        `hd-merch-prod.merch_item_cache.item_change_history`
      WHERE 
        DATE(createdTime) = '2023-08-03'
      GROUP BY 
        id, existing_stores
    ) ich ON item.id = ich.id  
LEFT JOIN
    (
      SELECT ID, STRING_AGG(DISTINCT MISSING_STORE, ',') AS missing_stores 
      FROM 
        `hd-merch-prod.merch_item_cache.mic_item_discrepancy`
      GROUP BY 
        ID
    ) mid ON item.id = mid.ID;

英文:

Try to use LEFT JOIN for both the item_change_history and mic_item_discrepancy tables to ensure that all rows from the item table are included in the result, and add a DISTINCT in the STRING_AGG function:

SELECT 
    item.id, ich.id, ich.existing_stores, COALESCE(mid.missing_stores, &#39;&#39;) AS missing_stores
FROM 
    `hd-merch-prod.merch_item_cache_validation.item` item 
LEFT JOIN
    (
      SELECT id, existing_stores, MAX(createdTime) as latest_createdTime
      FROM 
        `hd-merch-prod.merch_item_cache.item_change_history`
      WHERE 
        DATE(createdTime) = &#39;2023-08-03&#39;
      GROUP BY 
        id, existing_stores
    ) ich ON item.id = ich.id  
LEFT JOIN
    (
      SELECT ID, STRING_AGG(DISTINCT MISSING_STORE, &#39;,&#39;) AS missing_stores 
      FROM 
        `hd-merch-prod.merch_item_cache.mic_item_discrepancy`
      GROUP BY 
        ID
    ) mid ON item.id = mid.ID;

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

SQL查询以连接具有重复行的表并获取唯一行

问题

答案1

Error Code: 3588. Window '<unnamed window>' with RANGE frame has ORDER BY expression of datetime type. Only INTERVAL bound value allowed

为什么要使用预编译语句而不是使用go sql包中的Query/Exec方法？

云运行 Golang 容器问题/误解

SQL Management Studio 19: what determines if action Script SP appends "collate database_default" or not?

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论