英文:
Handling nested JSON structs with custom unmarshaller
问题
我正在处理一个返回带有嵌套结构和一些可选字段(以及随机顺序)的JSON的旧系统。类似于这样:
type A struct {
/* 简单的结构体,可以正常解组 */
AF1 string `json:"AF1"`
}
type B struct {
/* 简单的结构体,可以正常解组 */
BF1 string `json:"BF1"`
}
type X struct {
Things []A `json:"things"` /* 必需的 */
Thangs []B `json:"thangs"` /* 必需的 */
/* 一些单独的字符串值可能会出现或不出现,例如:
Item1 string
Item2 string
*/
}
如果Item1和Item2出现,我想将它们存储在一个映射或类似的数据结构中。
有没有一种优雅的方式来解组X?是否有一种方法可以为X编写自定义的UnmarshalJSON函数(以处理可选的字符串字段),然后将其传递给默认的JSON解组器来处理A和B?
英文:
I'm dealing with a legacy system that's returning JSON with nested structs and some optional fields (and in random order). Something like this:
type A struct {
/* simple struct, can be unmarshalled normally */
AF1 string `json:"AF1"`
}
type B struct {
/* simple struct, can be unmarshalled normally */
BF1 string `json:"BF1"`
}
type X struct {
Things []A `json:"things"` /* mandatory */
Thangs []B `json:"thangs"` /* mandatory */
/* some individual string values may or may not appear, eg:
Item1 string
Item2 string
*/
}
If Item[12] do appear, I want to stash them in a map or similar.
Is there any elegant way to unmarshal X? Is there some way to write a custom UnmarshalJSON func for X (to handle option string fields), and then hand off to the default JSON unmarshaller for A and B?
答案1
得分: 1
如果我从你的附加评论中正确理解了问题,那么输入可能包含任意未知名称(和类型?)的额外字段,你想要/需要访问这些字段。如果只是为了以后重新编组,那么json.RawMessage
类型可能会有用。
理想情况下,encoding/json
会有一个特殊的标签(类似于encoding/xml
标签中的",any"
),可以自动将任何额外/未引用的JSON项收集到map[string]interface{}
或map[string]json.RawMessage
字段中。然而,我没有找到任何这样的功能,也没有找到明显的方法来使用匿名结构体模拟它(但我没有努力尝试)。
编辑:Go项目中有一个关于这个功能的未解决问题。显然,在Go 1.2左右提交了一项更改,并进行了部分审查,但最终没有被接受。
如果无法实现上述功能,你可以通过几种方式来实现你所建议的功能,即为X创建自定义的(un)marshaller,并回调到json包来处理[]A
和[]B
。
这里是一个快速组合的示例,可能有更好/更清晰/更安全的方法来实现这一点。(在整个示例中,A和B可以是任意复杂的,可能包含自定义(un)marshalling方法的类型。)
package main
import (
"encoding/json"
"fmt"
)
type A struct {
AF1 string
}
type B struct {
BF1 string
}
type X struct {
Things []A
Thangs []B
// Or perhaps json.RawMessage if you just
// want to pass them through.
// Or map of string/int/etc if the value type is fixed.
Extra map[string]interface{}
}
// Marshal Way 1: call unmarshal twice on whole input
type xsub struct {
Things []A `json:"things"`
Thangs []B `json:"thangs"`
}
func (x *X) _UnmarshalJSON(b []byte) error {
// First unmarshall the known keys part:
var tmp xsub
if err := json.Unmarshal(b, &tmp); err != nil {
return err
}
// Then unmarshall the whole thing again:
var vals map[string]interface{}
if err := json.Unmarshal(b, &vals); err != nil {
return err
}
// Everything worked, chuck the map entries for
// "known" fields and store results.
delete(vals, "things")
delete(vals, "thangs")
x.Things = tmp.Things
x.Thangs = tmp.Thangs
x.Extra = vals
return nil
}
// Way 2:
func (x *X) UnmarshalJSON(b []byte) error {
// Only partially decode:
var tmp map[string]json.RawMessage
if err := json.Unmarshal(b, &tmp); err != nil {
return err
}
// Now handle the known fields:
var things []A
if err := json.Unmarshal(tmp["things"], &things); err != nil {
return err
}
var thangs []B
if err := json.Unmarshal(tmp["thangs"], &thangs); err != nil {
return err
}
// And the unknown fields.
var extra map[string]interface{}
// Either:
if true {
// this has more calls to Unmarshal, but may be more desirable
// as it completely skips over the already handled things/thangs.
delete(tmp, "things")
delete(tmp, "thangs")
// If you only needed to store the json.RawMessage for use
// in MarshalJSON then you'd just store "tmp" and stop here.
extra = make(map[string]interface{}, len(tmp))
for k, raw := range tmp {
var v interface{}
if err := json.Unmarshal(raw, &v); err != nil {
return err
}
extra[k] = v
}
} else { // Or:
// just one more call to Unmarshal, but it will waste
// time with things/thangs again.
if err := json.Unmarshal(b, &extra); err != nil {
return err
}
delete(extra, "things")
delete(extra, "thangs")
}
// no error, we can store the results
x.Things = things
x.Thangs = thangs
x.Extra = extra
return nil
}
func (x X) MarshalJSON() ([]byte, error) {
// abusing/reusing x.Extra, could copy map instead
x.Extra["things"] = x.Things
x.Extra["thangs"] = x.Thangs
result, err := json.Marshal(x.Extra)
delete(x.Extra, "things")
delete(x.Extra, "thangs")
return result, err
}
func main() {
inputs := []string{
`{"things": [], "thangs": []}`,
`
{
"things": [
{
"AF1": "foo"
},
{
"AF1": "bar"
}
],
"thangs": [
{
"BF1": "string value"
}
],
"xRandomKey": "not known ahead of time",
"xAreValueTypesKnown": 172
}`,
}
for _, in := range inputs {
fmt.Printf("\nUnmarshal(%q):\n", in)
var x X
err := json.Unmarshal([]byte(in), &x)
if err != nil {
fmt.Println("unmarshal:", err)
} else {
fmt.Printf("\tas X: %+v\n", x)
fmt.Printf("\twith map: %v\n", x.Extra)
out, err := json.Marshal(x)
if err != nil {
fmt.Println("marshal:", err)
continue
}
fmt.Printf("\tRemarshals to: %s\n", out)
}
}
}
英文:
If I understand the problem correctly from your additional comment,
then input might contain any arbitrary extra fields with unknown names (and types?)
and you want/need access to these.
If it's just for later re-marshalling then the json.RawMessage
type would be of interest.
Ideally encoding/json
would have a special tag
(like the ",any"
encoding/xml
tag)
that would automatically collect any extra/unreferenced JSON items into either a
map[string]interface{}
or a map[string]json.RawMessage
field.
However I couldn't find any such feature nor figure out an obvious way to emulate it with anonymous structs (but I didn't try very hard).
Edit: There is an open issue in the Go project for this feature. Apparently a change was submitted and partially reviewed around Go 1.2 but ended up not getting accepted.
Failing that, there are a couple of ways you can do exactly what you suggest,
make custom (un)marshaller for X and call back into the json package to handle []A
and []B
.
Here is an example quickly thrown together,
there may be better/clearer/safer ways to do this.
(Throughout this example, A and B can be arbitrarily complex, perhaps containing types that themselves have custom (un)marshalling methods.)
package main
import (
"encoding/json"
"fmt"
)
type A struct {
AF1 string
}
type B struct {
BF1 string
}
type X struct {
Things []A
Thangs []B
// Or perhaps json.RawMessage if you just
// want to pass them through.
// Or map of string/int/etc if the value type is fixed.
Extra map[string]interface{}
}
// Marshal Way 1: call unmarshal twice on whole input
type xsub struct {
Things []A `json:"things"`
Thangs []B `json:"thangs"`
}
func (x *X) _UnmarshalJSON(b []byte) error {
// First unmarshall the known keys part:
var tmp xsub
if err := json.Unmarshal(b, &tmp); err != nil {
return err
}
// Then unmarshall the whole thing again:
var vals map[string]interface{}
if err := json.Unmarshal(b, &vals); err != nil {
return err
}
// Everything worked, chuck the map entries for
// "known" fields and store results.
delete(vals, "things")
delete(vals, "thangs")
x.Things = tmp.Things
x.Thangs = tmp.Thangs
x.Extra = vals
return nil
}
// Way 2:
func (x *X) UnmarshalJSON(b []byte) error {
// Only partially decode:
var tmp map[string]json.RawMessage
if err := json.Unmarshal(b, &tmp); err != nil {
return err
}
// Now handle the known fields:
var things []A
if err := json.Unmarshal(tmp["things"], &things); err != nil {
return err
}
var thangs []B
if err := json.Unmarshal(tmp["thangs"], &thangs); err != nil {
return err
}
// And the unknown fields.
var extra map[string]interface{}
// Either:
if true {
// this has more calls to Unmarshal, but may be more desirable
// as it completely skips over the already handled things/thangs.
delete(tmp, "things")
delete(tmp, "thangs")
// If you only needed to store the json.RawMessage for use
// in MarshalJSON then you'd just store "tmp" and stop here.
extra = make(map[string]interface{}, len(tmp))
for k, raw := range tmp {
var v interface{}
if err := json.Unmarshal(raw, &v); err != nil {
return err
}
extra[k] = v
}
} else { // Or:
// just one more call to Unmarshal, but it will waste
// time with things/thangs again.
if err := json.Unmarshal(b, &extra); err != nil {
return err
}
delete(extra, "things")
delete(extra, "thangs")
}
// no error, we can store the results
x.Things = things
x.Thangs = thangs
x.Extra = extra
return nil
}
func (x X) MarshalJSON() ([]byte, error) {
// abusing/reusing x.Extra, could copy map instead
x.Extra["things"] = x.Things
x.Extra["thangs"] = x.Thangs
result, err := json.Marshal(x.Extra)
delete(x.Extra, "things")
delete(x.Extra, "thangs")
return result, err
}
func main() {
inputs := []string{
`{"things": [], "thangs": []}`,
`
{
"things": [
{
"AF1": "foo"
},
{
"AF1": "bar"
}
],
"thangs": [
{
"BF1": "string value"
}
],
"xRandomKey": "not known ahead of time",
"xAreValueTypesKnown": 172
}`,
}
for _, in := range inputs {
fmt.Printf("\nUnmarshal(%q):\n", in)
var x X
err := json.Unmarshal([]byte(in), &x)
if err != nil {
fmt.Println("unmarshal:", err)
} else {
fmt.Printf("\tas X: %+v\n", x)
fmt.Printf("\twith map: %v\n", x.Extra)
out, err := json.Marshal(x)
if err != nil {
fmt.Println("marshal:", err)
continue
}
fmt.Printf("\tRemarshals to: %s\n", out)
}
}
}
答案2
得分: 0
作为Dace C的答案的补充回答。
我想实现与你相同的功能,但我想重用函数而不是硬编码值。
这是我做的:
type DynamicFieldsUnmarshaller interface {
WithExtraFields(map[string]interface{})
Unmarshal([]byte) error
}
type TestObject struct {
Name string `json:"name"`
CustomFields map[string]interface{} `json:"-"`
}
func (o *TestObject) Unmarshal(data []byte) error {
return UnmarshalCustomJSON(data, o)
}
func (o *TestObject) WithExtraFields(f map[string]interface{}) {
o.CustomFields = f
}
func UnmarshalCustomJSON(b []byte, o DynamicFieldsUnmarshaller) error {
if err := json.Unmarshal(b, &o); err != nil {
return err
}
// unmarshal everything to a map
var vals map[string]interface{}
if err := json.Unmarshal(b, &vals); err != nil {
return err
}
if len(vals) == 0 {
return nil
}
fields := reflect.TypeOf(o).Elem()
num := fields.NumField()
for i := 0; i < num; i++ {
field := fields.Field(i)
jsonTag := field.Tag.Get("json")
if jsonTag != "" && jsonTag != "-" {
delete(vals, jsonTag)
}
}
o.WithExtraFields(vals)
return nil
}
这将只将不在结构体中的值添加到map[string]interface{}
字段中。
例如:
body := []byte(`
{
"name":"kilise",
"age": 40
}
`)
var dto TestObject
err := dto.Unmarshal(body)
if err != nil {
panic(err)
}
只会将"age"添加到dto.CustomFields映射中。
请注意,这种解决方案可能并不总是最佳的,因为它没有实现json.Unmarshaler
英文:
As an additional answer for Dace C's answer.
I wanted to achieve the same thing as you, however I wanted to reuse the function and not hardcode the values.
This is what I made:
type DynamicFieldsUnmarshaller interface {
WithExtraFields(map[string]interface{})
Unmarshal([]byte) error
}
type TestObject struct {
Name string `json:"name"`
CustomFields map[string]interface{} `json:"-"`
}
func (o *TestObject) Unmarshal(data []byte) error {
return UnmarshalCustomJSON(data,o)
}
func (o *TestObject) WithExtraFields(f map[string]interface{}) {
o.CustomFields = f
}
func UnmarshalCustomJSON(b []byte, o DynamicFieldsUnmarshaller) error {
if err := json.Unmarshal(b, &o); err != nil {
return err
}
// unmarshal everything to a map
var vals map[string]interface{}
if err := json.Unmarshal(b, &vals); err != nil {
return err
}
if len(vals)== 0 {
return nil
}
fields := reflect.TypeOf(o).Elem()
num := fields.NumField()
for i := 0; i < num; i++ {
field := fields.Field(i)
jsonTag := field.Tag.Get("json")
if jsonTag != "" && jsonTag != "-" {
delete(vals, jsonTag)
}
}
o.WithExtraFields(vals)
return nil
}
This should only add values not in the struct to the map[string]interface{}
field.
For instance:
body := []byte(`
{
"name":"kilise",
"age": 40
}
`)
var dto TestObject
err := dto.Unmarshal(body)
if err != nil {
panic(err)
}
Would only add "age" to dto.CustomFields map.
Note this solution might not always be the best, since it's not implementing json.Unmarshaler
通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库,让每个人都能够通过互相帮助和分享经验来进步。
评论