英文:
Which Azure service for batch processing?
问题
Once a day I want to download multiple CSV files (could be several hundred of them), parse them, do some transforming and write them to another file or database. I want to have an overview which processing failed and which succeeded and prefereably trigger a retry on those that failed. So I would need some kind of batch/job processing in Azure. Which services could be used for that? Is that possible with these services?
- 使用 Azure 功能的存储队列
- 使用 Azure 批处理和点实例虚拟机?
What other services could I look into?
英文:
Once a day I want to download multiple CSV files (could be several hundred of them), parse them, do some transforming and write them to another file or database. I want to have an overview which processing failed and which succeeded and prefereably trigger a retry on those that failed. So I would need some kind of batch/job processing in Azure. Which services could be used for that? Is that possible with these services?
- Storage queue with Azure functions
- Azure batch with spot VMs?
What other services could I look into?
答案1
得分: 1
-
Azure Data Factory - 这项服务专为您的用例而设计,用于ETL文件。
-
Databricks - 这是数据专业人员首选的“大数据”服务,可能对您的情况过于复杂。
-
Logic Apps - 这是一个“低代码”解决方案,可以按照您所描述的方式运行。我担心,根据您将进行的转换,它可能会变得有点笨重且昂贵。通常,对于更复杂的操作,您可能需要在Azure Functions中编写代码,并从Logic Apps调用这些函数。
英文:
There are quite a few Azure Services that could perform this task. Selection would really come down to your existing skills and preference. These are the one's I'd start evaluating (but not an exhaustive list 🤪);
-
Azure Data Factory - This service is designed for precisely your use case, ETL of files.
-
Databricks - This is a "big data" service preferred by data professionals and will likely be overkill for your scenario.
-
Logic Apps - A "low code" solution that can operate as you've described. I do worry that it'll end up being a bit clunky and costly for you, depending on the transformations you'll be doing. Often for more complex activities you'll end up needing to write code in Azure Functions and call those functions from Logic Apps.
通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库,让每个人都能够通过互相帮助和分享经验来进步。
评论