英文:
Google Analytics 4 streaming export to BigQuery
问题
我觉得流式导出文档非常模糊,它没有充分详细说明一旦我开始这种类型的导出,我将使用什么数据,所以我不知道可以期待什么。
当每日导出因数据大小而不可行时,剩下的选择是将数据流式传输到BigQuery,但是流式传输除了额外的成本外,还有一些重大限制 - 流量、名称、来源和媒介数据点不包括在这种类型的导出中,而这些是Google Analytics中的关键数据点。
我觉得令人困惑的是上面链接页面上的这句话:
> 对于现有用户的用户归因数据已包括在内,但该数据需要大约24小时才能完全处理,因此我们建议不依赖于从流式导出获取的该数据,而是从完整的每日导出获取用户归因数据。
已经尝试过流式导出的任何人是否可以确认这意味着用户归因数据将不可用于“实时”表格,但将可用于“每日”表格?如果是这样,这是否意味着在这种情况下,每日表格可以每天收集超过1000000个事件?我们是否需要同时启用流式导出和每日导出才能收集这些信息?
英文:
I find the Streaming export documentation very ambiguous, it doesn't go fully into detail about what data I will be working with once I start this type of export, so I don't know what to expect.
When the daily type of export is not an option because of data size, whatt remains is streaming the data into BigQuery, but streaming, besides additional cost, has some major limitations - traffic name, source and medium data points aren't included with this type of export, and these are the crucial data points in Google Analytics.
What I find confusing is this quote from the page linked above:
> User-attribution data for existing users is included but that data requires ~24 hours to fully process, so we recommend not relying on that data from the streaming export and instead getting user-attribution data from the full daily export.
Can anyone who has tried the streaming export confirm if this means that user-atribution data will not be available in the "intraday" tables, but will be available in the "daily" table? If so, does this mean that the daily table can gather more than 1000000 events per day in this case? And do we need to have both streaming and daily export turned on to be able to gather this information?
答案1
得分: 1
首先,您可以在此处找到GA4导出的示例数据集链接。
对于大多数用例,每日导出已足够,但在数据收集和在BQ中公开数据之间存在延迟。如果您需要当天的数据,您将在intraday表中找到它。每日表可以每天导出超过1百万个事件,但您需要GA4 360(付费版本)。流式导出(intraday)没有此限制。
通常情况下,您不需要用户流量来源/媒介。这些用户维度仅是第一次访问的流量来源/媒介。我假设您需要会话级别的流量来源/媒介,它仍然在intraday(流式)导出中可用。最终,您可以从那里计算用户的第一个流量来源/媒介。
英文:
First of all, you can find a sample dataset of GA4 export here.
For most of the use cases Daily export is enough but there is a delay between data collection and exposing data in BQ. If you need data even for the current day you will find it in the intraday table. Daily table can export more than 1M events per day but you will need GA4 360 (paid version). Streaming export (intraday) does not have this limitation.
Usually you do not need the user traffic source/medium. These user dimensions are just the first visit traffic source/medium. I assume you need session level traffic source/medium and it is still available in the intraday (streaming) export. Eventually you can calculate the user first traffic source/medium from there.
通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库,让每个人都能够通过互相帮助和分享经验来进步。
评论