问题

I am trying to build dataflow for change data from cloud spanner to pubsub topic, however after providing necessary information when I click on create job, it directly fails and gives error as follow:
"Failed to start the VM, launcher-202305251132348291864736406387748, used for launching because of status code: INVALID_ARGUMENT, reason: Invalid Error: Message: Invalid value for field 'resource.networkInterfaces[0].network': 'global/networks/default'. The referenced network resource cannot be found. HTTP Code: 400."

It is very much unclear if I need to create any VPC since there's no requirement in GCP doc about needing VPC and for what? If VPC is really such important, why GCP doc doesn't mention about it?

By the way, another question If you know, For Cloud Spanner when I am creating Change Data Stream I didn't create a separate instance/DB to store metadata for change since my changes & table won't have thousands of rows, it will be quiet smaller as only text values in 8 columns will it hold. is it fine to have metadata & actual DB same?

英文:

It is very much unclear if I need to create any VPC since there's no requirement in GCP doc about needing VPC and for what? if VPC is really such important, why GCP doc doesn't mention about it?

答案1

得分: 1

我已经找到答案，我只需要提供子网络值，其他信息会从中获取。

关于问题-2，我可以使用相同的表。GCP将创建另一个元数据表，但在配置数据流作业时，我应该将数据库名称和元数据数据库名称指定为相同的表，而不是GCP创建的那个。

英文:

I have figured the answer that I need to provide only subnetwork value and it will take other information out of it.

Regarding Question-2, I can use the same table. GCP will create another metadata table but still while I configuring dataflow job I should mention database name and metadata database name as same table, not the one GCP created.

答案2

得分: 0

建议在变更流水线中为元数据表使用单独的数据库（https://cloud.google.com/spanner/docs/change-streams/use-dataflow#metadata）。如果您正在使用相同的数据库，请确保变更流不会跟踪为流水线创建的元数据表，因为这将导致不必要的变更记录生成。

此外，您不应传递元数据表名称。Dataflow 流水线将自动为您创建元数据表。

英文:

It is recommended to use a separate database for the metadata table in the change stream pipeline (https://cloud.google.com/spanner/docs/change-streams/use-dataflow#metadata). If you are using the same database, make sure the change stream is not tracking the metadata tables created for the pipeline, since that will cause unnecessary generation of change records.

In addition, you should not pass in the metadata table name. The Dataflow pipeline will create the metadata table for you automatically.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

GCP Cloud Spanner CDC 通过 Dataflow 推送到 Pubsub。

问题

答案1

答案2

Dataflow 在 BigQuery 写入完成后发送 PubSub 消息。

每次请求都应该创建一个新的Cloud Spanner Client实例吗？

How to avoid warning message when read BigQuery data to custom data type: Can't verify serialized elements of type BoundedSource

将CSV转换为JSON在Dataflow中

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论