问题

I have a Kedro Pipeline Node that on AWS Lambda that accesses s3. It runs if I'm not using torch but fails with "Install s3fs to access S3" if I add torch as a dependency.

I have a Kedro Pipeline I want to deploy on AWS Step Functions.
My requirements look like this:

Python3.9

Pillow==9.5.0
aws_lambda_powertools==2.15.0
fsspec==2023.5.0
kedro==0.18.8
numpy==1.24.3
pandas==2.0.1
pydantic==1.10.7
pytest==7.3.1
rasterio==1.3.6
rawpy==0.18.1
s3fs==2023.5.0

The lambda access some data on s3.
With this setup everything runs fine.

However if I add torch,

torch==2.0.1+cpu -f https://download.pytorch.org/whl/torch_stable.html
torchvision==0.15.2+cpu -f https://download.pytorch.org/whl/torch_stable.html

I get the following error:

{
  "errorMessage": "\nInstall s3fs to access S3.\nFailed to instantiate DataSet 'companies' of type 'kedro.extras.datasets.pandas.csv_dataset.CSVDataSet'.",
  "errorType": "DataSetError",
  "requestId": "3da771f3-af50-49a9-98de-0a6d924018f2",
  "stackTrace": [
    "  File \"/home/app/lambda_handler.py\", line 18, in handler\n    session.run(node_names=[node_to_run])\n",
    "  File \"/home/app/kedro/framework/session/session.py\", line 413, in run\n    catalog = context._get_catalog(\n",
    "  File \"/home/app/kedro/framework/context/context.py\", line 287, in _get_catalog\n    catalog = settings.DATA_CATALOG_CLASS.from_config(\n",
    "  File \"/home/app/kedro/io/data_catalog.py\", line 277, in from_config\n    data_sets[ds_name] = AbstractDataSet.from_config(\n",
    "  File \"/home/app/kedro/io/core.py\", line 162, in from_config\n    raise DataSetError(\n"
  ]
}

This error is also just appearing in the lambda. if I install all those requirements locally on my linux it runs fine.

英文:

I have a Kedro Pipeline Node that on AWS Lambda that accesses s3. It runs if I'm not using torch but fails with Install s3fs to access S3 if I add torch as a dependency.

I have a Kedro Pipeline I want to deploy on AWS Step Functions.
My requirements look like this:

Python3.9

Pillow==9.5.0
aws_lambda_powertools==2.15.0
fsspec==2023.5.0
kedro==0.18.8
numpy==1.24.3
pandas==2.0.1
pydantic==1.10.7
pytest==7.3.1
rasterio==1.3.6
rawpy==0.18.1
s3fs==2023.5.0

The lambda access some data on s3.
With this setup everything runs fine.

However if I add torch,

torch==2.0.1+cpu -f https://download.pytorch.org/whl/torch_stable.html
torchvision==0.15.2+cpu -f https://download.pytorch.org/whl/torch_stable.html

I get the following error:

{
  &quot;errorMessage&quot;: &quot;\nInstall s3fs to access S3.\nFailed to instantiate DataSet &#39;companies&#39; of type &#39;kedro.extras.datasets.pandas.csv_dataset.CSVDataSet&#39;.&quot;,
  &quot;errorType&quot;: &quot;DataSetError&quot;,
  &quot;requestId&quot;: &quot;3da771f3-af50-49a9-98de-0a6d924018f2&quot;,
  &quot;stackTrace&quot;: [
    &quot;  File \&quot;/home/app/lambda_handler.py\&quot;, line 18, in handler\n    session.run(node_names=[node_to_run])\n&quot;,
    &quot;  File \&quot;/home/app/kedro/framework/session/session.py\&quot;, line 413, in run\n    catalog = context._get_catalog(\n&quot;,
    &quot;  File \&quot;/home/app/kedro/framework/context/context.py\&quot;, line 287, in _get_catalog\n    catalog = settings.DATA_CATALOG_CLASS.from_config(\n&quot;,
    &quot;  File \&quot;/home/app/kedro/io/data_catalog.py\&quot;, line 277, in from_config\n    data_sets[ds_name] = AbstractDataSet.from_config(\n&quot;,
    &quot;  File \&quot;/home/app/kedro/io/core.py\&quot;, line 162, in from_config\n    raise DataSetError(\n&quot;
  ]
}

This error is also just appearing in the lambda. if I install all those requirements locally on my linux it runs fine.

答案1

得分: 1

我成功解决了这个问题。Torch安装了一个与s3fs不兼容的urllib3版本。所以我不得不使用"urllib3<2"来安装torch，然后它就可以工作了。

英文:

I was able to solve this issue. Torch did install a version of urllib3 that was incompatible with s3fs. So what I had to do was install torch with "urllib3<2" and it worked.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

“S3FS在使用torch时在lambda中未被识别。”

问题

答案1

如何纠正我对子情节的误解

Amazon Bedrock类在通过Lambda函数调用时无法加载我的凭据。

将BigQuery的输出从Python保存为JSON。

需要输入搜索行的Excel

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论