问题

我是新来的Palantir Foundry。我需要知道如何自动化数据流程？如果我们在数据同步时进行数据摄取并设置计划，是否与在完成所有操作（如编码到数据转换等）并在数据血统处查看数据时进行数据流程调度相同？这两者是否相同？这是否意味着自动化数据流程？请有人能解释一下。谢谢。

英文:

I'm new to Palantir Foundery. I need to know how to automate the data pipeline? Is that same if we do the data ingestion to foundry while setting schedule at data sync? OR after all done (such as coding to data transformation etc.) when we see the data lineage at that point we can do the scheduling to the data pipeline. Is that both same ? Is that the meaning to say automate the data pipeline? Please some one can explain me. Thank you

答案1

得分: 1

在不清楚上下文的情况下，很难理解什么最适合，但通常情况下，“自动化数据管道”确实可以指调度。

关于调度器的文档可以在这里找到。调度器操作的对象是数据集。数据集可以通过从其他系统中摄取数据或通过各种方式定义的转换（如代码存储库、管道构建器等）来填充。

英文:

It is hard to understand what is best suited if the context of the ask is unclear to you in the first place, but usually "automating a data pipeline" can indeed refer to scheduling.

Docs about Schedulers can be found here. Schedulers are operating on Datasets. Datasets can be filled with data ingested from other systems or generated via transforms (defined in many ways: code repositories, pipeline builder, etc.).

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在Palantir Foundry中自动化数据管道？

问题

答案1

在集群中的n个Web服务器之间进行文件同步

如何在Foundry中保存历史数据集

在多个 goroutine 之间共享的 Golang 结构体中，非共享成员需要互斥保护吗？

在Umbraco 11中以编程方式设置内容计划安排。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论