2023年4月17日 11:28:39go评论65阅读模式

英文:

Inserting value stored in a pipeline variable as a new row of sink dataset data

问题

在ADF中，我从各种活动中生成管道变量，例如管道运行时间、复制行数、复制状态等等。然后我使用“Set Variable”来创建一个包含上述数值的JSON字符串。现在我想将这个字符串存储到现有的CosmosDB中作为一个新记录。

我不能使用Azure Functions或Notebook作业来插入，因为我希望在ADF内部能够实现而不需要额外的依赖。

我已经尝试使用Data Flow，但它无法从管道变量中的值添加新记录。我不能使用单独的数据源来联合并添加到CosmosDB中，因为我想要插入的值是在运行时生成的。

英文:

In ADF, I generate pipeline variables such as the pipeline run time, rows copied count, copy status etc. from various activities. I then use Set Variable to create a JSON string consisting of the above values. Now I want to store this string into an existing CosmosDB as a new record.

I cannot use Azure Functions or a Notebook job to insert, as I want it to be possible within ADF without additional dependencies.

I have tried using Data Flow, but it is unable to add a new record from values in a pipeline variable. I cannot use a separate data source to union and add into the CosmosDB as the values I want inserted are generated during runtime.

答案1

得分: 0

你可以使用个别的管道变量，如pipeline run time，copied rows count，copy status，它们可以直接使用（无需创建包含所有值的JSON文档），然后在ADF中使用复制活动将这些值插入Cosmos DB容器。以下是具体步骤。

三个管道变量pipeline run time，copied rows count，copy status 被获取并在此处使用“设置变量活动”来设置它们的值。

将管道变量中存储的值插入为汇集数据集数据的新行。

然后，使用一个带有虚拟源数据集的复制活动。源数据集只有一行。在复制活动的源选项卡中，通过点击“+新建”来添加新列。所有三个列都使用“设置变量活动”定义的值进行设置。

将管道变量中存储的值插入为汇集数据集数据的新行。

接下来，为Cosmos DB选择了一个Sink数据集，写操作选择insert。

将管道变量中存储的值插入为汇集数据集数据的新行。

然后，在映射部分，通过为管道变量提供示例值导入模式。然后将源中创建的所有附加列映射到目标列。

将管道变量中存储的值插入为汇集数据集数据的新行。

当管道运行时，数据将被插入到Cosmos DB中。

英文:

You can use the individual pipeline variables like pipeline run time , copied rows count, copy status as they are (without creating a json document consisting of all values) and use the copy activity in adf to insert the values in cosmos db container. Below is the approach.

Three pipeline variables pipeline run time , copied rows count, copy status are taken and their values are set using set variable activity here.

将管道变量中存储的值插入为汇集数据集数据的新行。

Then a copy activity is taken with a dummy source dataset. Source dataset has one row. In source tab of copy activity, New columns are added by clicking +New in additional columns. All three columns are set with the value that is defined using set variable activity.

将管道变量中存储的值插入为汇集数据集数据的新行。

Then sink dataset is taken for cosmos dB
with insert as write behavior.

将管道变量中存储的值插入为汇集数据集数据的新行。

Then in Mapping section, Schema is imported by giving sample values for the pipeline variables. Then all additional columns that is created in source are mapped to sink columns.

将管道变量中存储的值插入为汇集数据集数据的新行。

When pipeline is run, data is inserted to cosmos dB.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

将管道变量中存储的值插入为汇集数据集数据的新行。

问题

答案1

Azure存储容器级别的指标

将MMM DD YYYY HH:mm:ss日期时间格式转换为dd-mm-yyyy HH:mm:ss在Azure DataFlow中

如何在MySQL中使用UNION基于日期分组合并或组合多行的列名。

Is it possible to add a custom domain for an Azure Blob Storage Static Website without a public CNAME record?

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论