2023年6月30日 00:54:18go评论144阅读模式

英文:

requests.exceptions.InvalidSchema: no connection adapters were found. I'm trying to iterate through a list

问题

I can help you translate the code part of your text into Chinese. Here it is:

我正在尝试迭代一个URL列表，并使用request和BeatifulSoup来提取每个URL的标题名称。

但是我一直在收到这个错误：

&gt; requests.exceptions.InvalidSchema: 找不到与“['https://reddit.com/?feed=home', 'https://reddit.com/chunkCSS/CollectionCommentsPage~CommentsPage~CountryPage~Frontpage~GovernanceReleaseNotesModal~ModListing~Mod~e3d63e32.74eb929a3827c754ba25_.css', 'https://reddit.com/chunkCSS/CountryPage~Frontpage~ModListing~Multireddit~ProfileComments~ProfileOverview~ProfilePosts~Subreddit.e72fce90a7f3165091b9_.css', 'https://reddit.com/chunkCSS/Frontpage.85a25b7700617eafa94b_.css', 'https://reddit.com/?feed=home', 'https://reddit.com/r/popular/']”的连接适配器。

代码：

    pages = []
    for admin_login_pages in domains:
        with open("urls.txt", "w") as f:
            f.write(admin_login_pages)
        if "admin" in admin_login_pages:
            if "login" in admin_login_pages:
                pages.append(admin_login_pages)
        with open("urls.txt", "r") as fread:
            url_list = [x.strip() for x in fread.readlines()]
            r = requests.get(str(url_list))
            soup = BeautifulSoup(r.content, 'html.parser')
            for title in soup.find_all('title'):
                print(f"{admin_login_pages} - {title.get_text()}")
    if not pages:
        print(f"{Fore.RED} 找不到管理页面或登录页面")
    else:
        for page_list in pages:
            print(f"{Fore.GREEN} {page_list}")

这是你的代码的翻译部分。

英文:

I'm trying to iterate through a list of urls and use request and BeatifulSoup to extract the title name of each url.

But I keep getting this error:

> requests.exceptions.InvalidSchema: No connection adapters were found for "['https://reddit.com/?feed=home', 'https://reddit.com/chunkCSS/CollectionCommentsPage~CommentsPage~CountryPage~Frontpage~GovernanceReleaseNotesModal~ModListing~Mod~e3d63e32.74eb929a3827c754ba25_.css', 'https://reddit.com/chunkCSS/CountryPage~Frontpage~ModListing~Multireddit~ProfileComments~ProfileOverview~ProfilePosts~Subreddit.e72fce90a7f3165091b9_.css', 'https://reddit.com/chunkCSS/Frontpage.85a25b7700617eafa94b_.css', 'https://reddit.com/?feed=home', 'https://reddit.com/r/popular/',]

The Code:

pages = []
for admin_login_pages in domains:
    with open(&quot;urls.txt&quot;, &quot;w&quot;) as f:
        f.write(admin_login_pages)
    if &quot;admin&quot; in admin_login_pages:
        if &quot;login&quot; in admin_login_pages:
            pages.append(admin_login_pages)
    with open(&quot;urls.txt&quot;, &quot;r&quot;) as fread:
        url_list = [x.strip() for x in fread.readlines()]
        r = requests.get(str(url_list))
        soup = BeautifulSoup(r.content, &#39;html.parser&#39;)
        for title in soup.find_all(&#39;title&#39;):
            print(f&quot;{admin_login_pages} - {title.get_text()}&quot;)
if not pages:
    print(f&quot;{Fore.RED} No admin or login pages Found&quot;)
else:
    for page_list in pages:
        print(f&quot;{Fore.GREEN} {page_list}&quot;)

答案1

得分: 0

正如我在评论中所述，您正在将列表的字符串表示作为URL传递给请求。这样做是行不通的。相反，应该遍历url_list，并分别对每个URL进行请求。

这是稍微重构过的代码示例：

pages = []

with open("urls.txt", "r") as fread:
    url_list = [x.strip() for x in fread.readlines()]

with open("urls.txt", "w") as f:
    for admin_login_pages in domains:
        f.write(admin_login_pages)

        if "admin" in admin_login_pages and "login" in admin_login_pages:
            pages.append(admin_login_pages)

        for url in url_list:
            r = requests.get(url)
            soup = BeautifulSoup(r.content, "html.parser")

            title = soup.find("title")
            print(f"{admin_login_pages} - {title.get_text()}")

if not pages:
    print(f"{Fore.RED} 未找到管理员或登录页面")
else:
    for page_list in pages:
        print(f"{Fore.GREEN} {page_list}")

英文:

As I stated in the comment, you are feeding the string representation of the list as an URL to requests. That isn't going to work. Iterate over url_list instead and do a request to each URL separately.

Here is slightly refactored code as an example:

pages = []

with open(&quot;urls.txt&quot;, &quot;r&quot;) as fread:
	url_list = [x.strip() for x in fread.readlines()]

with open(&quot;urls.txt&quot;, &quot;w&quot;) as f:
    for admin_login_pages in domains:
        f.write(admin_login_pages)

        if &quot;admin&quot; in admin_login_pages and &quot;login&quot; in admin_login_pages:
            pages.append(admin_login_pages)

        for url in url_list:
            r = requests.get(url)
            soup = BeautifulSoup(r.content, &quot;html.parser&quot;)

            title = soup.find(&quot;title&quot;)
            print(f&quot;{admin_login_pages} - {title.get_text()}&quot;)

if not pages:
    print(f&quot;{Fore.RED} No admin or login pages Found&quot;)
else:
    for page_list in pages:
        print(f&quot;{Fore.GREEN} {page_list}&quot;)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

requests.exceptions.InvalidSchema: 找不到连接适配器。我正在尝试遍历一个列表

问题

答案1

`get(key, value)` 返回正确的值，即使你在值的位置上输入错误的值。

作为函数参数的 Python3 中的作用域：列表。

reportlab的doc.build(story)如何使用asksaveasfilename tkinter更改输出目录？

可以用 Python 将 OrderDict 写入 CSV 表格吗？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论