2023年2月16日 03:18:20go评论62阅读模式

英文:

Removing the same element across all the nodes of an XML tree

问题

以下是要翻译的内容：

如何编写代码以便我可以删除每个国家节点中的一个节点元素（即年份或描述）。例如，在以下代码中：

# 要删除
# for country in root.findall('country'):
    # year = int(country.find('year').text)
    # if year > 2010:
        # root.remove(country)
# tree.write('sample.xml')

我可以删除元素年份属性大于2010的任何国家节点。但这将删除整个节点，而不仅仅是年份元素。我知道可以使用以下代码删除节点的单个元素：

# for country in root.findall('country'):
    # description_node = country.find('description')
    # if description_node.text == "Singapore has a lot of street markets.":
        # country.remove(description_node)
# tree.write('sample.xml')

但现在我想创建一个条件，其中我可以删除所有国家节点中的描述元素或年份元素或邻居元素。

英文:

For example sake, this is the xml file that I'm working with:

&lt;?xml version=&quot;1.0&quot;?&gt;
&lt;data&gt;
    &lt;country name=&quot;Liechtenstein&quot;&gt;
        &lt;rank&gt;1&lt;/rank&gt;
        &lt;year&gt;2008&lt;/year&gt;
        &lt;gdppc&gt;141100&lt;/gdppc&gt;
        &lt;neighbor name=&quot;Austria&quot; direction=&quot;E&quot;/&gt;
        &lt;neighbor name=&quot;Switzerland&quot; direction=&quot;W&quot;/&gt;
        &lt;description&gt;Liechtenstein has a lot of flowers.&lt;/description&gt;
    &lt;/country&gt;
    &lt;country name=&quot;Singapore&quot;&gt;
        &lt;rank&gt;4&lt;/rank&gt;
        &lt;year&gt;2011&lt;/year&gt;
        &lt;gdppc&gt;59900&lt;/gdppc&gt;
        &lt;neighbor name=&quot;Malaysia&quot; direction=&quot;N&quot;/&gt;
        &lt;description&gt;Singapore has a lot of street markets.&lt;/description&gt;
    &lt;/country&gt;
    &lt;country name=&quot;Panama&quot;&gt;
        &lt;rank&gt;68&lt;/rank&gt;
        &lt;year&gt;2011&lt;/year&gt;
        &lt;gdppc&gt;13600&lt;/gdppc&gt;
        &lt;neighbor name=&quot;Costa Rica&quot; direction=&quot;W&quot;/&gt;
        &lt;neighbor name=&quot;Colombia&quot; direction=&quot;E&quot;/&gt;
        &lt;description&gt;Panama has a lot of great food.&lt;/description&gt;
    &lt;/country&gt;
&lt;/data&gt;

How would I write the code such that I could delete one node element (i.e. year or description) across each of the country nodes. For example, in the following code:

# To remove 
# for country in root.findall(&#39;country&#39;):
	# year = int(country.find(&#39;year&#39;).text)
	# if year &gt; 2010:
		# root.remove(country)
# tree.write(&#39;sample.xml&#39;)

I can remove any country nodes whose attribute of the element year is greater than 2010. But that removes the entire node, not just the year element. I know that I can remove a single element of a node with the following:

# for country in root.findall(&#39;country&#39;):
	# description_node = country.find(&#39;description&#39;)
	# if description_node.text == &quot;Singapore has a lot of street markets.&quot;:
		# country.remove(description_node)
# tree.write(&#39;sample.xml&#39;)

But now I want to create a condition where I delete the description element or the year element or the neighbor element throughout all of the country nodes present.

答案1

得分: 0

以下是代码部分的翻译：

import xml.etree.ElementTree as ET

file = 'source.xml'
data = ET.parse(file)

for country in data.findall('country'):
    for neighbor in country.findall('neighbor'):
        country.remove(neighbor)
    for year in country.findall('year'):
        country.remove(year)
    for description in country.findall('description'):
        country.remove(description)

ET.dump(data)

输出部分不需要翻译。

英文:

One option might be the following that uses .findall and .remove:

import xml.etree.ElementTree as ET

file = &#39;source.xml&#39;
data = ET.parse(file)

for country in data.findall(&#39;country&#39;):
    for neighbor in country.findall(&#39;neighbor&#39;):
        country.remove(neighbor)
    for year in country.findall(&#39;year&#39;):
        country.remove(year)
    for description in country.findall(&#39;description&#39;):
        country.remove(description)

ET.dump(data)

Output:

python yourscript.py 
&lt;data&gt;
    &lt;country name=&quot;Liechtenstein&quot;&gt;
        &lt;rank&gt;1&lt;/rank&gt;
        &lt;gdppc&gt;141100&lt;/gdppc&gt;
        &lt;/country&gt;
    &lt;country name=&quot;Singapore&quot;&gt;
        &lt;rank&gt;4&lt;/rank&gt;
        &lt;gdppc&gt;59900&lt;/gdppc&gt;
        &lt;/country&gt;
    &lt;country name=&quot;Panama&quot;&gt;
        &lt;rank&gt;68&lt;/rank&gt;
        &lt;gdppc&gt;13600&lt;/gdppc&gt;
        &lt;/country&gt;
&lt;/data&gt;

答案2

得分: 0

在XSLT 3.0中，例如，您可以这样做：

<xsl:transform xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
   version="3.0">
  <xsl:mode on-no-match="shallow-copy"/>
  <xsl:template match="year[. > 2000]"/>
</xsl:transform>

空模板规则会导致与谓词匹配的元素被移除；xsl:mode指令会导致其他一切保留下来。

英文:

In XSLT 3.0 you can do, for example:

&lt;xsl:transform xmlns:xsl=&quot;http://www.w3.org/1999/XSL/Transform&quot;
   version=&quot;3.0&quot;&gt;
  &lt;xsl:mode on-no-match=&quot;shallow-copy&quot;/&gt;
  &lt;xsl:template match=&quot;year[. &gt; 2000]&quot;/&gt;
&lt;/xsl:transform&gt;

The empty template rule causes elements that match the predicate to be removed; the xsl:mode instruction causes everything else to be retained.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

移除XML树的所有节点中相同的元素

问题

答案1

答案2

Selenium/chrome: Why do I first need to manually run chrome before I can succeed to do the same using selenium?

无法在WSL中安装或升级Python 3.10.8。

面对在XML解析中出现的org.xml.sax.SAXParseException异常

applying Singleton to Spotipy throws error

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论