2023年6月5日 23:11:34go评论198阅读模式

英文:

define an output schema for a nested json in langchain

问题

以下是您要翻译的内容：

"it works but I want to know if theres a better way to go about this."

英文:

Whats the recommended way to define an output schema for a nested json, the method I use doesn't feel ideal.

# adding to planner -&gt; from langchain.experimental.plan_and_execute import load_chat_planner
refinement_response_schemas = [
        ResponseSchema(name=&quot;plan&quot;, description=&quot;&quot;&quot;{&#39;1&#39;: {&#39;step&#39;: &#39;&#39;,&#39;tools&#39;: [],&#39;data_sources&#39;: [],&#39;sub_steps_needed&#39;: bool},
 &#39;2&#39;: {&#39;step&#39;: &#39;&#39;,&#39;tools&#39;: [&lt;empty list&gt;],&#39;data_sources&#39;: [&lt;&gt;], &#39;sub_steps_needed&#39;: bool},}&quot;&quot;&quot;),] #define json schema in description, works but doesn&#39;t feel proper
    
refinement_output_parser = StructuredOutputParser.from_response_schemas(refinement_response_schemas)
refinement_format_instructions = refinement_output_parser.get_format_instructions()
refinement_output_parser.parse(output)

gives:

{&#39;plan&#39;: {&#39;1&#39;: {&#39;step&#39;: &#39;Identify the top 5 strikers in La Liga&#39;,
   &#39;tools&#39;: [],
   &#39;data_sources&#39;: [&#39;sports websites&#39;, &#39;official league statistics&#39;],
   &#39;sub_steps_needed&#39;: False},
  &#39;2&#39;: {&#39;step&#39;: &#39;Identify the top 5 strikers in the Premier League&#39;,
   &#39;tools&#39;: [],
   &#39;data_sources&#39;: [&#39;sports websites&#39;, &#39;official league statistics&#39;],
   &#39;sub_steps_needed&#39;: False},
    ...
  &#39;6&#39;: {&#39;step&#39;: &#39;Given the above steps taken, please respond to the users original question&#39;,
   &#39;tools&#39;: [],
   &#39;data_sources&#39;: [],
   &#39;sub_steps_needed&#39;: False}}}

it works but I want to know if theres a better way to go about this.

答案1

得分: 5

从我所看到的情况来看，建议的方法是使用Pydantic输出解析器，而不是结构化输出解析器... python.langchain.com/docs/modules/model_io/output_parsers/...（有关嵌套处理的说明在这里... youtube.com/watch?v=yD_oDTeObJY）。

例如：

from langchain.output_parsers import PydanticOutputParser
from pydantic import BaseModel, Field, validator
from typing import List, Optional
...
class PlanItem(BaseModel):
    step: str
    tools: Optional[str] = []
    data_sources: Optional[str] = []
    sub_steps_needed: str
class Plan(BaseModel):
    plan: List[PlanItem]
parser = PydanticOutputParser(pydantic_object=Plan)
parser.get_format_instructions()

英文:

From what I can see the recommended approach is to use the pydantic output parser as opposed to the structured output parser... python.langchain.com/docs/modules/model_io/output_parsers/… (and dealing with nesting explained here... youtube.com/watch?v=yD_oDTeObJY).

e.g.

from langchain.output_parsers import PydanticOutputParser
from pydantic import BaseModel, Field, validator
from typing import List, Optional
...
class PlanItem(BaseModel):
    step: str
    tools: Optional[str] = []
    data_sources: Optional[str] = []
    sub_steps_needed: str
class Plan(BaseModel):
    plan: List[PlanItem]
parser = PydanticOutputParser(pydantic_object=Plan)
parser.get_format_instructions()

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在Langchain中为嵌套的JSON定义一个输出模式。

问题

答案1

在Linux上安装tensorflow-decision-forests的问题

如何在%%cython中指定-march=native

不支持在Hydra中使用环境变量的插值类型

Dask/pandas应用函数并返回多行

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。