2023年7月10日 17:47:58go评论96阅读模式

英文:

Printing first line then assign to a variable

问题

for a in soup.find_all('a', href=True):
    if "/xxxxxx/xxxxxxxxxxxxx-" in a['href']:
        x = ("https://www.unkown.com" + a['href'])
        print(x[0])
    else:
        pass
        
my output:
https://www.unkown.com/xxxxx/xxxxxxxxxx
https://www.unkown.com/xxxxx/1xxxxxxxxx
https://www.unkown.com/xxxxx/2xxxxxxxxx
https://www.unkown.com/xxxxx/3xxxxxxxxx

英文:

for a in soup.find_all(&#39;a&#39;, href=True):
    if &quot;/xxxxxx/xxxxxxxxxxxxx-&quot; in a[&#39;href&#39;]:
        x = (&quot;https://www.unkown.com&quot; + a[&#39;href&#39;])
        print(x[0])
    else:
        pass
        
my output:
https://www.unkown.com/xxxxx/xxxxxxxxxx
https://www.unkown.com/xxxxx/1xxxxxxxxx
https://www.unkown.com/xxxxx/2xxxxxxxxx
https://www.unkown.com/xxxxx/3xxxxxxxxx

I want to print the first line by doing print(x[0]) I just print "h" the output is bunch of url's

答案1

得分: 1

如果您的输出 x 是：

https://www.unkown.com/xxxxx/xxxxxxxxxx
https://www.unkown.com/xxxxx/1xxxxxxxxx
https://www.unkown.com/xxxxx/2xxxxxxxxx
https://www.unkown.com/xxxxx/3xxxxxxxxx

您可以通过调用 x.split('\n')[0] 来访问第一行。

英文:

If your output x is :

https://www.unkown.com/xxxxx/xxxxxxxxxx
https://www.unkown.com/xxxxx/1xxxxxxxxx
https://www.unkown.com/xxxxx/2xxxxxxxxx
https://www.unkown.com/xxxxx/3xxxxxxxxx

You can access the first line by calling x.split('\n')[0].

答案2

得分: 0

在Python中，字符串索引（x[0]）返回该索引处的字符，而不是列表中的元素。在您的情况下，x是一个字符串，所以x[0]将返回'h'，即您的URL的第一个字符。

要仅打印第一个匹配的URL，您可以使用一个布尔标志。以下是一个示例：

first_match_found = False
for a in soup.find_all('a', href=True):
    if "/xxxxxx/xxxxxxxxxxxxx-" in a['href'] and not first_match_found:
        print("https://www.unkown.com" + a['href'])
        first_match_found = True

此脚本将在找到第一个匹配后停止打印URL。

英文:

In Python, string indexing (x[0]) returns the character at that index, not an element from a list. In your case, x is a string, so x[0] will return 'h', the first character of your URL.

To print only the first matching URL, you can use a boolean flag. Here's an example:

first_match_found = False
for a in soup.find_all(&#39;a&#39;, href=True):
    if &quot;/xxxxxx/xxxxxxxxxxxxx-&quot; in a[&#39;href&#39;] and not first_match_found:
        print(&quot;https://www.unkown.com&quot; + a[&#39;href&#39;])
        first_match_found = True

This script will stop printing URLs after the first match.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

打印第一行，然后赋值给一个变量。

问题

答案1

答案2

我无法使langchain代理模块实际执行我的提示。

作为函数参数的 Python3 中的作用域：列表。

自然语言处理句子分割相较于Python算法的好处是什么？

使用BeautifulSoup如何抓取元素的相关类别？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。