2023年6月1日 18:20:58go评论66阅读模式

英文:

What is the best way to find text content under relatively indistinguishable tags by Selenium-webdriver?

问题

作为新手，我更喜欢使用绝对 XPath 来查找文本定位的 WebElements。我尝试过：

List<WebElement> elements = web.findElements(By.xpath("/html[1]/body[1]/div[2]/div[2]/dl[1]/dd[2]/div[2]/div/div[1]/ol[5]/li[1]/div[2]/div/p"));

但是我无法捕获标签下带有轻微更改的文本。目标 XPath 如下：

/html[1]/body[1]/div[2]/div[2]/dl[1]/dd[2]/div[2]/div[6]/div[1]/ol[5]/li[1]/div[2]/div[4]/div[1]/div[1]/p[1]
/html[1]/body[1]/div[2]/div[2]/dl[1]/dd[2]/div[2]/div[6]/div[1]/ol[5]/li[1]/div[2]/div[8]/p
/html[1]/body[1]/div[2]/div[2]/dl[1]/dd[2]/div[2]/div[2]/div[1]/ol[5]/li[1]/div[2]/div[3]/h3[1]
/html[1]/body[1]/div[2]/div[2]/dl[1]/dd[2]/div[2]/div[2]/div[1]/ol[5]/li[1]/div[2]/div[2]/div[1]/p
/html[1]/body[1]/div[2]/div[2]/dl[1]/dd[2]/div[2]/div[5]/div[1]/ol[5]/li[1]/div[2]/div[1]/div[1]/p[1]/strong[1]

在上述提到的 XPath 中获取所有文本内容的正确公式或方法是什么？

英文:

As newbie I prefer use Abs XPath to get find WebElemnts where text is positioned.<br>
I tried:

List&lt;WebElement&gt; elements = web.findElements(By.xpath(&quot;/html[1]/body[1]/div[2]/div[2]/dl[1]/dd[2]/div[2]/div/div[1]/ol[5]/li[1]/div[2]/div/p&quot;));

But i failed to catch text under tags with minor changes<br>
Target xpaths:

/html[1]/body[1]/div[2]/div[2]/dl[1]/dd[2]/div[2]/div[6]/div[1]/ol[5]/li[1]/div[2]/div[4]/div[1]/div[1]/p[1]
/html[1]/body[1]/div[2]/div[2]/dl[1]/dd[2]/div[2]/div[6]/div[1]/ol[5]/li[1]/div[2]/div[8]/p
/html[1]/body[1]/div[2]/div[2]/dl[1]/dd[2]/div[2]/div[2]/div[1]/ol[5]/li[1]/div[2]/div[3]/h3[1]
/html[1]/body[1]/div[2]/div[2]/dl[1]/dd[2]/div[2]/div[2]/div[1]/ol[5]/li[1]/div[2]/div[2]/div[1]/p
/html[1]/body[1]/div[2]/div[2]/dl[1]/dd[2]/div[2]/div[5]/div[1]/ol[5]/li[1]/div[2]/div[1]/div[1]/p[1]/strong[1]

What is correct formula or way to get all text content in the above mentioned xpaths ?

答案1

得分: 1

如果你想要获取包含直接文本的所有元素，你可以使用以下XPath表达式：

/html/body[1]//*[text()[normalize-space()]]

这将返回所有包含直接文本节点的元素，经过过滤不必要的空白字符后，保留字符数据。

XPath部分的含义如下：

// = 任何后代元素；参见这里有关轴的信息
* = 任何元素
[某个过滤条件] = 用于筛选直接前一个节点的谓词
[#number] = 在其同级元素中的位置。body[1]可能看起来多余，但可以帮助XPath引擎不再查找其他body元素
text() = 文本类型的节点
normalize-space() = 根据这些规则删除空白字符

英文:

Not very clear what you want: If you want all elements that contain direct text you could use:

/html/body[1]//*[text()[normalize-space()]]

this will return all elements with direct text()-nodes that after filtering unnecessary whitespace, have character-data.

meaning XPath-parts:

// = any descendant; see this info on axes

* = any element

[some filter] = predicate to filter on direct previous node.

[#number] = the position within its siblings. body[1] maybe seems redundant, but can help the XPath-engine not to look any further for other body elements

text() = node of type text

normalize-space() = strips white-space according this rules

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

什么是通过Selenium-webdriver找到相对难以区分标签下的文本内容的最佳方法？

问题

答案1

Android ExoPlayer 2 – 如何在渲染多个视频时提高帧率？

Is there a good way to add FutureLocal.java to custom Future.java extending CompletableFuture? (example code below)

如何使 log4j2 每次日志调用记录到一个文件中？

Java从其他方法中使用用户输入

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论