2023年1月9日 08:53:09go评论107阅读模式

英文:

Setting the proxy using Selenium and Docker

问题

我在使用代理进行网络爬虫时遇到了问题。我使用了容器化的Python代码和selenium/standalone-chrome镜像。我尝试了类似以下的代码来传递参数，但Chrome实例似乎忽略了它。我有一个示例爬虫，从ident.me网页上抓取IP地址，但它返回了我的机器IP。

def get_chrome_driver(proxy):
    proxy = str(proxy)
    chrome_options = webdriver.ChromeOptions()
    chrome_options.add_argument('--proxy=%s' % proxy)
    chrome_options.add_argument("--no-sandbox")
    chrome_options.add_argument("--headless")
    chrome_options.add_argument("--disable-gpu")
    driver = webdriver.Remote(
        command_executor='http://chrome:4444/wd/hub',
        options=chrome_options
    )
    return driver

英文:

I have a trouble during using proxy for scraping. I use dockerized Python code and

selenium/standalone-chrome

image.
I tried something like this

def get_chrome_driver(proxy):
    proxy = str(proxy)
    chrome_options = webdriver.ChromeOptions()
    chrome_options.add_argument(&#39;--proxy=%s&#39; % proxy)
    chrome_options.add_argument(&quot;--no-sandbox&quot;)
    chrome_options.add_argument(&quot;--headless&quot;)
    chrome_options.add_argument(&quot;--disable-gpu&quot;)
    driver = webdriver.Remote(
        command_executor=&#39;http://chrome:4444/wd/hub&#39;,
        options=webdriver.ChromeOptions()
        )
    return driver

to pass the parameters but the Chrome instance seems to ignore it. I have example scraper scraping IP address from ident.me webpage and it returns my machine's IP.

答案1

得分: 1

你正在使用这行代码为驱动程序实例保存默认选项

options=webdriver.ChromeOptions()

你需要设置你创建的选项

options=chrome_options

英文:

you are saving default options with this line for the driver instance

options=webdriver.ChromeOptions()

you need to set your created options

options=chrome_options

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用Selenium和Docker设置代理。

问题

答案1

“Arrows Directions, 两个单独的箭头，双向但不是双向箭头，通过一个箭头表示”

删除包含 None 的逗号之间的字符串。

运行一个带有额外参数的Python程序。

Scipy平滑样条的偏导数不起作用。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。