2023年2月19日 18:49:20go评论91阅读模式

英文:

Scrapy selector doesn't "see" an element that is present on the webpage

问题

我想解析以下网页：
https://mafiaworldtour.com/tournaments/2653
我需要找到以下元素：
```//html/body/div[1]/div/section[2]/div/div/div/div[1]/div[1]/div/div[2]/div/div[1]/div[2]/span/text()```
当我在网页上通过检查查找它时，它明显存在，但
```city = response.xpath(&#39;//html/body/div[1]/div/section[2]/div/div/div/div[1]/div[1]/div/div[2]/div/div[1]/div[2]/span/text()&#39;).extract_first()``` 返回 None。
这是为什么呢？
我期望通过xpath获得比赛的城市 `Хайфа, Израиль`。

英文:

I want to parse the following webpage:
https://mafiaworldtour.com/tournaments/2653

And I need to find the following element:

//html/body/div[1]/div/section[2]/div/div/div/div[1]/div[1]/div/div[2]/div/div[1]/div[2]/span/text()

When I search it on the webpage via inspect, it is clearly present, but
city = response.xpath('//html/body/div[1]/div/section[2]/div/div/div/div[1]/div[1]/div/div[2]/div/div[1]/div[2]/span/text()').extract_first() returns None.

What is the reason for this?

I expect to get the city Хайфа, Израиль of the tournament via xpath.

答案1

得分: 0

使用我的项目retrieveCssOrXpathSelectorFromTextOrNode来获取完整的[tag:xpath]查询：

x('Хайфа, Израиль');
//body/div[@class=&quot;site-wrapper&quot;]/div[@class=&quot;main&quot;][@role=&quot;main&quot;]/section[@class=&quot;page-content&quot;]/div[@class=&quot;container&quot;]/div[@class=&quot;tabs&quot]/div[@class=&quot;tab-content&quot]/div[@class=&quot;tab-pane fade in active &quot;][@id=&quot;general&quot]/div[@class=&quot;row&quot]/div[@class=&quot;col-md-12&quot]/div[@class=&quot;table-responsive&quot]/div[@class=&quot;responsive-info-table&quot]/div[@class=&quot;row with-top-border&quot]/div[@class=&quot;col-md-6&quot]/span[@class=&quot;small_content&quot;]

总是比使用相对路径的chrome dev tools自动生成的XPath查询更好：

//html/body/div[1]/div/section[2]/div/div......

但是你可以删除无用的部分，应该是这样的：

(从chrome dev tools或firefox控制台)：

$x('//span[@class=&quot;small_content&quot;]')[0].innerText

或者在你的情况下：

response.xpath('//span[@class=&quot;small_content&quot;]/text()').extract_first()

输出：

" Хайфа, Израиль"

英文:

Using my own project retrieveCssOrXpathSelectorFromTextOrNode to fetch the full [tag:xpath] query:

x(&#39;Хайфа, Израиль&#39;);
//body/div[@class=&quot;site-wrapper&quot;]/div[@class=&quot;main&quot;][@role=&quot;main&quot;]/section[@class=&quot;page-content&quot;]/div[@class=&quot;container&quot;]/div[@class=&quot;tabs&quot;]/div[@class=&quot;tab-content&quot;]/div[@class=&quot;tab-pane fade in active &quot;][@id=&quot;general&quot;]/div[@class=&quot;row&quot;]/div[@class=&quot;col-md-12&quot;]/div[@class=&quot;table-responsive&quot;]/div[@class=&quot;responsive-info-table&quot;]/div[@class=&quot;row with-top-border&quot;]/div[@class=&quot;col-md-6&quot;]/span[@class=&quot;small_content&quot;]

It's always better to have these specific XPath query's than the one with relative path like auto-generated by chrome dev tools:

//html/body/div[1]/div/section[2]/div/div......

But you can remove the useless part, should be like:

(From chrome dev tools, or firefox console):

$x(&#39;//span[@class=&quot;small_content&quot;]&#39;)[0].innerText

or in your case:

response.xpath(&#39;//span[@class=&quot;small_content&quot;]/text()&#39;).extract_first()

Output

&quot; Хайфа, Израиль&quot;

答案2

得分: 0

CSS选择器

response.css('.small_content::text').get()

XPATH

response.xpath('//span[@class="small_content"]/text()').get()

英文:

you can use both CSS selector orXPATH

CSS selector

response.css(&#39;.small_content::text&#39;).get()

XPATH

response.xpath(&#39;//span[@class=&quot;small_content&quot;]/text()&#39;).get()

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Scrapy选择器没有“看到”网页上存在的元素

问题

答案1

输出：

Output

答案2

HTML5闭合标签

如何在CSS中对齐旋转的文本和图标

HTML 给元素名称分配变量

如何在表单验证结束后跳转到另一个创建的页面？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。