如何在MongoDB的lookup中使用正则表达式?

huangapple go评论59阅读模式
英文:

How to use regex in foreignkey in mogodb lookup

问题

我应该如何在外键字段上使用通配符进行$lookup?

任务是从Transaction中检索所有记录,其中desc子字符串在Keyword集合中找到。

Transaction中的desc可能以abc<keyword>def的形式出现,与Keyword集合交叉引用,如果找到,则应匹配。

我考虑使用否定匹配的正则表达式.*XXX.*来完成任务。

不确定如何在$lookup中放置正则表达式子句。

Transaction集合:

transactions=[
{
  "_id": {
    "$oid": "6480267ab9fe78e82131b737"
  },
  "date": {
    "$date": "2020-06-22T00:00:00.000Z"
  },
  "desc": "abcKey1def",
},
{
  "_id": {
    "$oid": "6480267ab9fe78e82131b738"
  },
  "date": {
    "$date": "2020-06-23T00:00:00.000Z"
  },
  "desc": "abcdef",
}
]

Keyword集合:

keyword=[
{
  "_id": {
    "$oid": "64816f3828372d84a93cd4ad"
  },
  "code": 123,
  "desc": "Key1"
},
{
  "_id": {
    "$oid": "648174bf28372d84a93cd4b5"
  },
  "code": 456,
  "desc": "Key2",
}
]

我知道我可能需要使用letpipeline...但不确定如何将它们组合在一起。

$lookup:{
  from: "keyword",
  let: {
    desc:'$desc'
  },
  pipeline: [
     ....?
  ]
  
  as: "result"
}
英文:

How can I do a $lookup using wildcard on the foreign key field?

The task is to retrieve all records from Transaction where the the desc substring is not found in the Keyword collection.

The desc in Transaction could come in the form of abc&lt;keyword&gt;def, and cross referencing to Keyword collection, this should be a match if <keyword> is found.

I thought of using a negative match for regex .*XXX.* to achieve the task.

Not sure how to put a regex clause in a $lookup.

Transaction collection:

transactions=[
{
  &quot;_id&quot;: {
    &quot;$oid&quot;: &quot;6480267ab9fe78e82131b737&quot;
  },
  &quot;date&quot;: {
    &quot;$date&quot;: &quot;2020-06-22T00:00:00.000Z&quot;
  },
  &quot;desc&quot;: &quot;abcKey1def&quot;,
},
{
  &quot;_id&quot;: {
    &quot;$oid&quot;: &quot;6480267ab9fe78e82131b738&quot;
  },
  &quot;date&quot;: {
    &quot;$date&quot;: &quot;2020-06-23T00:00:00.000Z&quot;
  },
  &quot;desc&quot;: &quot;abcdef&quot;,
}
]

Keyword collection:

keyword=[
{
  &quot;_id&quot;: {
    &quot;$oid&quot;: &quot;64816f3828372d84a93cd4ad&quot;
  },
  &quot;code&quot;: 123,
  &quot;desc&quot;: &quot;Key1&quot;
},
{
  &quot;_id&quot;: {
    &quot;$oid&quot;: &quot;648174bf28372d84a93cd4b5&quot;
  },
  &quot;code&quot;: 456,
  &quot;desc&quot;: &quot;Key2&quot;,
}
]

I know I probably need to use let and pipeline... but not sure how to put them together.

$lookup:{
  from: &quot;keyword&quot;,
  let: {
    desc:&#39;$desc&#39;
  },
  pipeline: [
     ....?
  ]
  
  as: &quot;result&quot;
}

答案1

得分: 1

一种方法是在 $lookup 中使用 $regexMatch。像这样:

db.transactions.aggregate([
  {
    "$lookup": {
      "from": "keyword",
      "let": {
        desc: "$desc"
      },
      "pipeline": [
        {
          $match: {
            $expr: {
              "$regexMatch": {
                "input": "$$desc",
                "regex": {
                  "$concat": [
                    ".*",
                    "$desc",
                    ".*"
                  ]
                }
              }
            }
          }
        }
      ],
      "as": "matchingDocs"
    }
  },
  {
    "$match": {
      matchingDocs: []
    }
  }
])

Playground 链接。

英文:

One way is to use $regexMatch in $lookup. Like this:

db.transactions.aggregate([
  {
    &quot;$lookup&quot;: {
      &quot;from&quot;: &quot;keyword&quot;,
      &quot;let&quot;: {
        desc: &quot;$desc&quot;
      },
      &quot;pipeline&quot;: [
        {
          $match: {
            $expr: {
              &quot;$regexMatch&quot;: {
                &quot;input&quot;: &quot;$$desc&quot;,
                &quot;regex&quot;: {
                  &quot;$concat&quot;: [
                    &quot;.*&quot;,
                    &quot;$desc&quot;,
                    &quot;.*&quot;
                  ]
                }
              }
            }
          }
        }
      ],
      &quot;as&quot;: &quot;matchingDocs&quot;
    }
  },
  {
    &quot;$match&quot;: {
      matchingDocs: []
    }
  }
])

Playground link.

答案2

得分: 0

你可以将关键字作为变量获取,然后使用该关键字连接关键字。我在这里提取关键字,假设它将始终是abc<key>def。最后,它会过滤掉没有匹配关键字的记录。

https://mongoplayground.net/p/B0SIHY68gz0

db.transactions.aggregate([
  {
    $lookup: {
      from: "keywords",
      let: {
        keywordStr: {
          $substr: [
            "$desc",
            3,
            {
              $subtract: [
                {
                  $strLenCP: "$desc"
                },
                6
              ]
            }
          ]
        }
      },
      pipeline: [
        {
          $match: {
            $expr: {
              $eq: [
                "$desc",
                "$$keywordStr"
              ]
            }
          }
        }
      ],
      as: "keywords"
    }
  },
  {
    $match: {
      keywords: {
        $ne: []
      }
    }
  }
])
英文:

You can get the key as a variable and then join the keywords using the key.
I am extracting the key here, under the assumption that it will always be abc<key>def. At the end it filter out records which does not have matching keywords.

https://mongoplayground.net/p/B0SIHY68gz0

db.transactions.aggregate([
  {
    $lookup: {
      from: &quot;keywords&quot;,
      let: {
        keywordStr: {
          $substr: [
            &quot;$desc&quot;,
            3,
            {
              $subtract: [
                {
                  $strLenCP: &quot;$desc&quot;
                },
                6
              ]
            }
          ]
        }
      },
      pipeline: [
        {
          $match: {
            $expr: {
              $eq: [
                &quot;$desc&quot;,
                &quot;$$keywordStr&quot;
              ]
            }
          }
        }
      ],
      as: &quot;keywords&quot;
    }
  },
  {
    $match: {
      keywords: {
        $ne: []
      }
    }
  }
])

答案3

得分: 0

db.Transaction.aggregate([
  {
    $lookup: {
      from: "Keyword",
      let: { keywordSubstring: { $regexFind: { input: "$desc", regex: "<keyword>", options: "i" } } },
      pipeline: [
        {
          $match: {
            $expr: {
              $not: {
                $regexMatch: {
                  input: "$$keywordSubstring.match",
                  regex: { $concat: [".*", { $escapeRegex: "$$keywordSubstring.match" }, ".*"] },
                  options: "i"
                }
              }
            }
          }
        }
      ],
      as: "keywordMatches"
    }
  }
])
英文:
db.Transaction.aggregate([
  {
    $lookup: {
      from: &quot;Keyword&quot;,
      let: { keywordSubstring: { $regexFind: { input: &quot;$desc&quot;, regex: &quot;&lt;keyword&gt;&quot;, options: &quot;i&quot; } } },
      pipeline: [
        {
          $match: {
            $expr: {
              $not: {
                $regexMatch: {
                  input: &quot;$$keywordSubstring.match&quot;,
                  regex: { $concat: [&quot;.*&quot;, { $escapeRegex: &quot;$$keywordSubstring.match&quot; }, &quot;.*&quot;] },
                  options: &quot;i&quot;
                }
              }
            }
          }
        }
      ],
      as: &quot;keywordMatches&quot;
    }
  }
])

huangapple
  • 本文由 发表于 2023年6月8日 15:21:18
  • 转载请务必保留本文链接:https://go.coder-hub.com/76429487.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定