2014年4月11日 12:26:49go评论115阅读模式

英文:

Unmarshal with global namespace

问题

我有以下的XML：

<rss version="2.0">
  <channel>
    ...
    <item>
      <link>http://stackoverflow.com</link>
      <atom:link xmlns:atom="http://www.w3.org/2005/Atom" href="http://stackoverflow.com"/>
      ...
    </item>
  </channel>
</rss>

我想提取link属性，我有以下的结构体：

type Item struct {
  Link string `xml:"http://www.w3.org/2005/Atom link"`
}

我知道，我需要一个前缀来获取Link，但是因为没有给出命名空间（以xmls属性的形式），我不知道该怎么做。

当然，我可以将所有的:*link属性保存到一个切片中，但我相信有更好的解决方案。

提前谢谢！

英文:

I have the following XML:

&lt;rss version=&quot;2.0&quot;&gt;
  &lt;channel&gt;
    ...
    &lt;item&gt;
      &lt;link&gt;http://stackoverflow.com&lt;/link&gt;
      &lt;atom:link xmlns:atom=&quot;http://www.w3.org/2005/Atom&quot; href=&quot;http://stackoverflow.com&quot;/&gt;
      ...
    &lt;/item&gt;
  &lt;/channel&gt;
&lt;/rss&gt;

I want to extract the link attribute, I have the following struct:

type Item struct {
  Link string `xml:&quot;http://www.w3.org/2005/Atom link&quot;`
}

I know, that I need a prefix to get the Link, but because there is no namespace given (in form of an xmls-Attribute, but I don't know, how.

I could, of course, save all :*link-Attributes to a slice, but I'm sure there is a better solution.

Thanks in advance!

答案1

得分: 1

标准库encoding/xml包中的命名空间处理似乎是一个大杂糅，并且具有相同名称的不同命名空间中的元素似乎是一个触发器。

理想情况下，您应该能够将给定的XML解码为以下结构：

type Rss struct {
    Items []Item `xml:"channel>item"`
}
type Item struct {
    Link     string   `xml:"link"`
    AtomLink AtomLink `xml:"http://www.w3.org/2005/Atom link"`
}
type AtomLink struct {
    Href string `xml:"href,attr"`
}

但是这会导致错误main.Item字段"Link"的标签"link"与字段"AtomLink"的标签"http://www.w3.org/2005/Atom link"冲突（如http://play.golang.org/p/LgW-vm4euL中所示）。

然而，如果我们决定忽略<atom:link>元素，即将Item.AtomLink字段注释掉，我们最终解码得到一个空字符串，因为xml:"link"匹配任何命名空间中的<link>元素，而不仅仅是空命名空间。最后的<atom:link>元素是空的，所以不返回任何内容。

一些可能的解决方法包括：

仅尝试解码<atom:link>元素，因为它可以唯一选择。如果您还要处理不带Atom命名空间元素的RSS源，这可能不太有用。
通过修改Item结构来收集所有<link>元素的内容：

Links []string `xml:"link"`

然后丢弃切片中的任何空字符串。

归根结底，该包需要一种引用空命名空间的方式。为了保持现有程序的功能，这可能需要新的语法。

英文:

The namespace handling in the standard library encoding/xml package seems to be a big ad-hoc, and having elements in different namespaces with the same name seems to be a trigger.

Ideally you'd be able to decode the given XML into the following structures:

type Rss struct {
	Items []Item `xml:&quot;channel&gt;item&quot;`
}
type Item struct {
	Link     string   `xml:&quot;link&quot;`
	AtomLink AtomLink `xml:&quot;http://www.w3.org/2005/Atom link&quot;`
}
type AtomLink struct {
	Href string `xml:&quot;href,attr&quot;`
}

But this results in the error main.Item field "Link" with tag "link" conflicts with field "AtomLink" with tag "http://www.w3.org/2005/Atom link" (as seen in http://play.golang.org/p/LgW-vm4euL).

However, if we decide that we want to ignore the <atom:link> element by commenting out the Item.AtomLink field, we end up decoding an empty string, since xml:"link" matches <link> elements in any namespace rather than just the blank namespace. The final <atom:link> element is empty, so doesn't return anything.

A couple of possible work arounds include:

Only try to decode the <atom:link> element, since it can be selected uniquely. This may not be useful if you're also processing RSS feeds without Atom namespace elements.
Collect the contents of all <link> elements by modifying the Item struct to use:
```
Links []string `xml:&quot;link&quot;`
```
And then discard any empty strings in the slice.

At the end of the day, the package will need some way to refer to the blank namespace. That may require new syntax in order to keep existing programs functioning though.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用全局命名空间进行解组

问题

答案1

为什么不能为结构体和它的指针同时定义一个方法？

Golang：解析组的XML元素值和属性

防止golang的http.NewRequest在POST请求的正文中添加大括号。

当工作完成时，goroutine是否会退出？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。