将Liquid模板渲染为Python字典。

huangapple go评论147阅读模式
英文:

Rendered Liquid template to Python dict

问题

我可以帮你翻译你提供的Liquid模板以及期望的字典格式,但不会回答关于翻译的问题。以下是翻译好的部分:

  1. {
  2. "articleHeadline": "This is the article of the headline",
  3. "intro": "\nIntroduction to the document\n",
  4. "articleHighlights": "\nStep 1: Get up\n\nStep 2: Go to kitchen\n\nStep 3: Get coffee\n",
  5. "articleSummary": "\nSummary of the document\n"
  6. }

如果你有其他需要,请随时告诉我。

英文:

I have the following Liquid template:

  1. {%- capture vars -%}
  2. {%- capture articleHeadline -%}This is the article of the headline{%- endcapture -%}
  3. {%- capture intro -%}
  4. Introduction to the document
  5. {%- endcapture -%}
  6. {%- capture articleHighlights -%}
  7. Step 1: Get up
  8. Step 2: Go to kitchen
  9. Step 3: Get coffee
  10. {%- endcapture -%}
  11. {%- capture articleSummary -%}
  12. Summary of the document
  13. {%- endcapture -%}
  14. {%- endcapture -%}

I'd like to parse this text and end up with a dict of the form:

  1. {
  2. "articleHeadline": "This is the article of the headline",
  3. "intro": "\nIntroduction to the document\n"
  4. "articleHighlights": "\nStep 1: Get up\n\nStep 2: Go to kitchen\n\nStep 3: Get coffee\n",
  5. "articleSummary": "\nSummary of the document\n"
  6. }

I tried Python Liquid but I'm confident it does not reverse engineer templates like this. I also checked out BeautifulSoup for this purpose but it will only parse HTML tags.

Is there a library or method that I can use to parse this (hopefully avoiding regex)?

答案1

得分: 1

以下是您要翻译的部分:

  1. 我从未使用过这个库但它似乎公开了一个返回模板的抽象语法树AST的解析器API从那里开始只需要递归函数来查找所有`capture`块并返回它们的内容
  2. ```python
  3. from pprint import pprint
  4. from liquid.builtin.tags.capture_tag import CaptureNode
  5. from liquid.environment import Environment
  6. template = """
  7. {%- capture vars -%}
  8. {%- capture articleHeadline -%}This is the article of the headline{%- endcapture -%}
  9. {%- capture intro -%}
  10. Introduction to the document
  11. {%- endcapture -%}
  12. {%- capture articleHighlights -%}
  13. Step 1: Get up
  14. Step 2: Go to kitchen
  15. Step 3: Get coffee
  16. {%- endcapture -%}
  17. {%- capture articleSummary -%}
  18. Summary of the document
  19. {%- endcapture -%}
  20. {%- endcapture -%}
  21. """
  22. def get_capture_contents(node):
  23. if isinstance(node, CaptureNode):
  24. yield (node.name, node.block)
  25. if hasattr(node, "children"):
  26. for child in node.children():
  27. yield from get_capture_contents(child.node)
  28. env = Environment()
  29. ast = env.parse(template)
  30. blocks = {}
  31. for stmt in ast.statements:
  32. for cap_name, content in get_capture_contents(stmt):
  33. blocks[cap_name] = str(content)
  34. pprint(blocks)
英文:

How are you that confident that Python Liquid doesn't do this? 将Liquid模板渲染为Python字典。

I've never used the library before, but it seems to expose a parser API that returns the AST (abstract syntax tree) of the template; from there on out it's just a recursive function away to find all capture blocks and return their contents.

  1. from pprint import pprint
  2. from liquid.builtin.tags.capture_tag import CaptureNode
  3. from liquid.environment import Environment
  4. template = """
  5. {%- capture vars -%}
  6. {%- capture articleHeadline -%}This is the article of the headline{%- endcapture -%}
  7. {%- capture intro -%}
  8. Introduction to the document
  9. {%- endcapture -%}
  10. {%- capture articleHighlights -%}
  11. Step 1: Get up
  12. Step 2: Go to kitchen
  13. Step 3: Get coffee
  14. {%- endcapture -%}
  15. {%- capture articleSummary -%}
  16. Summary of the document
  17. {%- endcapture -%}
  18. {%- endcapture -%}
  19. """
  20. def get_capture_contents(node):
  21. if isinstance(node, CaptureNode):
  22. yield (node.name, node.block)
  23. if hasattr(node, "children"):
  24. for child in node.children():
  25. yield from get_capture_contents(child.node)
  26. env = Environment()
  27. ast = env.parse(template)
  28. blocks = {}
  29. for stmt in ast.statements:
  30. for cap_name, content in get_capture_contents(stmt):
  31. blocks[cap_name] = str(content)
  32. pprint(blocks)

prints out

  1. {'articleHeadline': 'This is the article of the headline',
  2. 'articleHighlights': 'Step 1: Get up\n'
  3. '\n'
  4. 'Step 2: Go to kitchen\n'
  5. '\n'
  6. 'Step 3: Get coffee',
  7. 'articleSummary': 'Summary of the document',
  8. 'intro': 'Introduction to the document',
  9. 'vars': 'var articleHeadline = { This is the article of the headline }var '
  10. 'intro = { Introduction to the document }var articleHighlights = { '
  11. 'Step 1: Get up\n'
  12. '\n'
  13. 'Step 2: Go to kitchen\n'
  14. '\n'
  15. 'Step 3: Get coffee }var articleSummary = { Summary of the document }'}

huangapple
  • 本文由 发表于 2023年5月25日 20:44:08
  • 转载请务必保留本文链接:https://go.coder-hub.com/76332407.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定