问题

我试图在某个网页上找到对Google Sheets的单元格值的所有提及。单元格值可以用任何语言编写。网页也可以是任何语言。

现在我有这个公式：

=COUNTA(IFERROR(IMPORTXML(A2;"//*[contains(translate(text(),'ABCDEFGHJIKLMNOPQRSTUVWXYZАБВГДЕЁЖЗИКЛМНОПРСТУФХЦЧШЩЭЮЯ', 'abcdefghjiklmnopqrstuvwxyzабвгдеёжзиклмнопрстуфхцчшщэюя'),'"&LOWER(B2)&"')]")))

但它工作不正确。它无法找到拉丁字母和西里尔字母中的所有单词，也没有考虑葡萄牙语、德语和其他语言的字符。如何使这个公式通用？或者如何编写适用于Google Sheets的适当脚本？

我的公式工作不正确。我想要使用Google Sheets的公式或脚本在网页上找到任何语言的任何文本。

英文:

I'm trying to find all mentions of a Google Sheets' cell value on some website page. The cell value can be written in any language. The website page can also be in any language.

Now I have this formula:

But it works incorrectly. It's not possible to find all words in Latin and Cyrillic, characters from Portuguese, German and other languages are not taken into account. How to make the formula universal? Or how can I write the appropriate script for Google Sheets?

My formula works incorrectly. I want to find any text in any language on website page with Google Sheets' formula or script.

答案1

得分: 0

我们可以更加优雅地解决这个问题，如果Google Docs支持XPath 2.0，并且我们可以使用fn:lower-case()。在XPath 1.0中，我们被限制使用translate()，需要自己提供大写和小写的转换。然而，这种方式很难涵盖所有可能的（Unicode）字符和所有语言。1 我已经将拉丁字母的重音符号添加到您的公式中，这应该涵盖大多数西方语言。

=COUNTA(IFERROR(IMPORTXML(A2;"//*[contains(translate(text(),
'ABCDEFGHJIKLMNOPQRSTUVWXYZÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖØÙÚÛÜÝÞŸŽŠŒАБВГДЕЁЖЗИКЛМНОПРСТУФХЦЧШЩЭЮЯ',
'abcdefghjiklmnopqrstuvwxyzàáâãäåæçèéêëìíîïðñòóôõöøùúûüýþÿžšœабвгдеёжзиклмнопрстуфхцчшщэюя'))&LOWER(B2)&"')]"))

<hr>

1: 此类转换对于所有语言都不完美，例如，德语的 'ß'（小写）在大写时可能写作 'SS'，这使得查找所有匹配变得困难。

英文:

We could solve this more elegantly if Google Docs would support Xpath 2.0 and we could use the fn:lower-case(). In Xpath 1.0 we are stuck with translate() and need to provide the upper-case and lower-case translation ourselves. However, it is difficult to cover all possible (Unicode) characters in all languages this way.1 I have added the diacritics from the Latin alphabet to your formula, with should cover most Western languages.

=COUNTA(IFERROR(IMPORTXML(A2;&quot;//*[contains(translate(text(),
&#39;ABCDEFGHJIKLMNOPQRSTUVWXYZ&#192;&#193;&#194;&#195;&#196;&#197;&#198;&#199;&#200;&#201;&#202;&#203;&#204;&#205;&#206;&#207;&#208;&#209;&#210;&#211;&#212;&#213;&#214;&#216;&#217;&#218;&#219;&#220;&#221;&#222;ŸŽŠŒАБВГДЕЁЖЗИКЛМНОПРСТУФХЦЧШЩЭЮЯ&#39;, 
&#39;abcdefghjiklmnopqrstuvwxyz&#224;&#225;&#226;&#227;&#228;&#229;&#230;&#231;&#232;&#233;&#234;&#235;&#236;&#237;&#238;&#239;&#240;&#241;&#242;&#243;&#244;&#245;&#246;&#248;&#249;&#250;&#251;&#252;&#253;&#254;&#255;žšœабвгдеёжзиклмнопрстуфхцчшщэюя&#39;),&#39;&quot;&amp;LOWER(B2)&amp;&quot;&#39;)]&quot;)))

<hr>

1: Furthermore, such a transformation does not work perfectly for all languages, e.g., the German 'ß' (lowercase) may be written as 'SS' when in uppercase which makes it difficult to find all matches.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

XPATH内的IMPORTXML：如何查找所有语言的文本？

问题

答案1

如果表中存在该名称，则从另一列或行返回值。

Sure, here’s the translation: “java selenium xpath relative”

如何从用户那里读取Excel文件并生成Google表格文件。

谷歌表格仪表板比较图表功能

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论