2023年3月9日 20:15:45go评论97阅读模式

英文:

Retry task on a Windows node if unreachable

问题

有办法在Windows节点临时不可达时重试任务吗？

例如，我尝试过

- name: Hello
  ansible.windows.win_powershell:
    script: | 
      Write-Host &quot;hello&quot;
  register: _status
  until: _status is not unreachable
  retries: 3
  delay: 200

但是，30秒后，我得到了

fatal: [mylocalwin]: UNREACHABLE! =&gt; changed=false 
  msg: &#39;certificate: HTTPSConnectionPool(host=&#39;&#39;xxx.xxx.xxx.xxx&#39;&#39;, port=5986): Max retries exceeded with url: /wsman (Caused by ConnectTimeoutError(&lt;urllib3.connection.HTTPSConnection object at 0x7f4160b63eb0&gt;, &#39;&#39;Connection to xxx.xxx.xxx.xxx timed out. (connect timeout=30)&#39;&#39;))&#39;
  unreachable: true

我想在失败之前重试三次。

英文:

Is there a way to retry a task if the Windows node is temporarily unreachable?

For example, I tried

- name: Hello
  ansible.windows.win_powershell:
    script: | 
      Write-Host &quot;hello&quot;
  register: _status
  until: _status is not unreachable
  retries: 3
  delay: 200

But, after 30 seconds, I got

fatal: [mylocalwin]: UNREACHABLE! =&gt; changed=false 
  msg: &#39;certificate: HTTPSConnectionPool(host=&#39;&#39;xxx.xxx.xxx.xxx&#39;&#39;, port=5986): Max retries exceeded with url: /wsman (Caused by ConnectTimeoutError(&lt;urllib3.connection.HTTPSConnection object at 0x7f4160b63eb0&gt;, &#39;&#39;Connection to xxx.xxx.xxx.xxx timed out. (connect timeout=30)&#39;&#39;))&#39;
  unreachable: true

I would like to retry three times before failing.

答案1

得分: 1

以下是您要翻译的内容：

Here there is my solution based on https://github.com/ansible/ansible/issues/25532#issuecomment-428386816

Modify

/lib/python3.10/site-packages/winrm/protocol.py

class Protocol(object):
def init(
...
reconnection_retries=0,
reconnection_backoff_factor=2.0
):
...

self.transport = Transport(
...
reconnection_retries=reconnection_retries,
reconnection_backoff_factor=reconnection_backoff_factor
)

/lib/python3.10/site-packages/winrm/transport.py

class Transport(object):
def init(
...
reconnection_retries=0,
reconnection_backoff_factor=2.0):

...
self.reconnection_retries = reconnection_retries
self.reconnection_backoff_factor = reconnection_backoff_factor
...

def build_session(self):
...

Merge proxy environment variables

settings = session.merge_environment_settings(url=self.endpoint,
proxies=proxies, stream=None, verify=None, cert=None)

ADD

Retry on connection errors, with a backoff factor

retries = requests.packages.urllib3.util.retry.Retry(total=self.reconnection_retries,
connect=self.reconnection_retries,
status=self.reconnection_retries,
read=0,
backoff_factor=self.reconnection_backoff_factor,
status_forcelist=(413, 425, 429, 503))

ADD

session.mount('http://', requests.adapters.HTTPAdapter(max_retries=retries))
session.mount('https://', requests.adapters.HTTPAdapter(max_retries=retries))
...

Now it is possible to control the retry when the node is unreachable

name: Test
hosts: mylocalwin
gather_facts: false
vars:
ansible_winrm_reconnection_backoff_factor: 2.0
ansible_winrm_reconnection_retries: 4

tasks:

name: Hello
ansible.windows.win_powershell:
script: |
Write-Host "hello";

I checked the solution with tcpdump and I can confirm then the TCP SYN groups are re-sent for reconnection_retries times.

Here there is a small recap about performaces

TYPE ERROR DETECTION (sec) NUM OF TCP SYN SENT
RETRY_0_BACKOFF_2 30 5
RETRY_1_BACKOFF_2 60 10
RETRY_2_BACKOFF_2 94 15
RETRY_3_BACKOFF_2 133 20
RETRY_4_BACKOFF_2 179 25
RETRY_5_BACKOFF_2 240 30
NO_RETRY_MECHANISM 30 5

英文:

Here there is my solution based on https://github.com/ansible/ansible/issues/25532#issuecomment-428386816

Modify

/lib/python3.10/site-packages/winrm/protocol.py

class Protocol(object):
    def __init__(
            ...
            reconnection_retries=0,
            reconnection_backoff_factor=2.0
        ):
        ...
        
        self.transport = Transport(
            ...      
            reconnection_retries=reconnection_retries,
            reconnection_backoff_factor=reconnection_backoff_factor
        )

/lib/python3.10/site-packages/winrm/transport.py

class Transport(object):
    def __init__(
        ...
        reconnection_retries=0,
        reconnection_backoff_factor=2.0):
        
        ...
        self.reconnection_retries = reconnection_retries
        self.reconnection_backoff_factor = reconnection_backoff_factor
        ...
        
    def build_session(self):
        ...
        
        # Merge proxy environment variables
        settings = session.merge_environment_settings(url=self.endpoint,
                      proxies=proxies, stream=None, verify=None, cert=None)
        # ADD
        # Retry on connection errors, with a backoff factor
        retries = requests.packages.urllib3.util.retry.Retry(total=self.reconnection_retries,
                                                             connect=self.reconnection_retries,
                                                             status=self.reconnection_retries,
                                                             read=0,
                                                             backoff_factor=self.reconnection_backoff_factor,
                                                             status_forcelist=(413, 425, 429, 503))
        # ADD
        session.mount(&#39;http://&#39;, requests.adapters.HTTPAdapter(max_retries=retries))
        session.mount(&#39;https://&#39;, requests.adapters.HTTPAdapter(max_retries=retries))  
        ...

Now it is possible to control the retry when the node is unreachable

- name: Test
  hosts: mylocalwin
  gather_facts: false
  vars:
    ansible_winrm_reconnection_backoff_factor: 2.0
    ansible_winrm_reconnection_retries: 4

  tasks:
    - name: Hello
      ansible.windows.win_powershell:
        script: | 
          Write-Host &quot;hello&quot;

I checked the solution with tcpdump and I can confirm then the TCP SYN groups are re-sent for reconnection_retries times.

Here there is a small recap about performaces

TYPE	            ERROR DETECTION (sec)	NUM OF TCP SYN SENT
RETRY_0_BACKOFF_2	30	                    5
RETRY_1_BACKOFF_2	60	                    10
RETRY_2_BACKOFF_2	94	                    15
RETRY_3_BACKOFF_2	133	                    20
RETRY_4_BACKOFF_2	179	                    25
RETRY_5_BACKOFF_2	240	                    30
NO_RETRY_MECHANISM	30	                    5

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在 Windows 节点上如果无法访问，重试任务。

问题

答案1

Merge proxy environment variables

ADD

Retry on connection errors, with a backoff factor

ADD

ansible –ask-become-pass hangs, seeming waiting for the password

在指定的群组上运行playbook？

在CentOS 7上通过Ansible的yum安装Xfce。

如何在使用Ansible在Kubernetes集群上安装服务时修改清单文件？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论