AWS EMR网络连接

huangapple go评论46阅读模式
英文:

AWS EMR network connection

问题

我正在尝试将一个包安装到EMR集群中。每次我都收到以下错误:

警告:在连接中断后进行重试(Retry(total=4, connect=None, read=None, redirect=None, status=None))后,连接失败 'NewConnectionError('<pip._vendor.urllib3.connection.HTTPSConnection object at 0x7f5af4e9dad0>: 无法建立新连接:[Errno 101] 网络不可达')':/simple/fastparquet/

我应该怎么做?

我尝试通过ping google.com,但问题仍然存在。

英文:

I am trying to install a package into EMR cluster. Every time I get the following error

WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'NewConnectionError('<pip._vendor.urllib3.connection.HTTPSConnection object at 0x7f5af4e9dad0>: Failed to establish a new connection: [Errno 101] Network is unreachable')': /simple/fastparquet/

What should i do?

I tried to ping into google.com but same problem persisits.

答案1

得分: 1

在Jupyter笔记本环境中使用PySpark进行交互时,请考虑使用EMR Studio(而不是EMR笔记本)。它需要一些基础的设置工作

之后,您可以在笔记本级别安装库。

列出当前的库:

sc.list_packages()

安装Celery(例如)库:

sc.install_pypi_package("celery")

参考链接:https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-managed-notebooks-installing-libraries-and-kernels.html#emr-managed-notebooks-work-with-libraries

英文:

To interact using PySpark in a Jupyter notebook environment, consider using EMR Studio (not EMR notebook). It requires some foundation setup work.

Afterwards, you can install libraries on the notebook level.

Listing current libraries:

sc.list_packages()

Installing the Celery (for example) library

sc.install_pypi_package(&quot;celery&quot;)

Ref: https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-managed-notebooks-installing-libraries-and-kernels.html#emr-managed-notebooks-work-with-libraries

huangapple
  • 本文由 发表于 2023年5月11日 16:55:36
  • 转载请务必保留本文链接:https://go.coder-hub.com/76225816.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定