2023年7月3日 22:22:01go评论63阅读模式

英文:

Set processOptions for doc ai ocr api request

问题

我们建议在我们的OCR处理器上激活一个标志以获得更好的结果，因为我们面临一些问题（比如 "I" 被解析为 "1"）。为了测试这个标志，我想使用Postman，但是在我的请求中添加选项会导致错误。

我的请求：

{
  "skipHumanReview": true,
  "processOptions": {
    ...
  },
  "rawDocument": {
    "mimeType": "application/pdf",
    "content": "..."
  },
  "fieldMask": "text,pages.blocks,pages.paragraphs,pages.formFields"
}

在没有 processOptions 部分的情况下，请求运行正常。但是 processOptions 未被识别。结果如下：

{
  ...
  "message": "Invalid JSON payload received. Unknown name \"processOptions\": Cannot find field."
  ...
}

所使用的处理器版本是 pretrained-ocr-v1.2-2022-11-10。我不知道是否使用了错误的版本（我看到的是最新的版本），还是我的请求有问题（我没有找到带有此标志的示例）。在我们的应用程序中，我们不使用提供的包，而是使用自定义代码（APEX）构建请求，因此我使用那种方式也会遇到同样的问题。

更新：
我找到了一些示例，我确信我的请求是正确的。那么我猜这必须是版本的问题，但 pretrained-ocr-v1.2-2022-11-10 是我可以获取到的最新版本。

有人能解释为什么吗？

英文:

We are recommended to activate a flag on our OCR processor for better results because we face some problems (like "I" parsed as "1". To test the flag I want to use Postman, but adding the options to my requests results in an error.
My Request:

{

 &quot;skipHumanReview&quot;: true,
 &quot;processOptions&quot;: {
    ...
  },

  &quot;rawDocument&quot;: {
    &quot;mimeType&quot;: &quot;application/pdf&quot;,
    &quot;content&quot;: &quot;...&quot;
  },

  &quot;fieldMask&quot;: &quot;text,pages.blocks,pages.paragraphs,pages.formFields&quot;

}

Without the processOptions part, the requests works fine. ProcessOptions is not recognized. The Result is:

{
...
"message": "Invalid JSON payload received. Unknown name "processOptions": Cannot find field."
...
}

The processor version used is pretrained-ocr-v1.2-2022-11-10
I don't know if I have a wrong version (the one used is the newest I see) or if my request is wrong (I didn't find an example with this flag).
In our application we don't use the provided packages, but build the requests with custom code (APEX), so I will have the same problem using that.

Update:
I found some examples and I am sure, that my request is correct. Then I guess it has to be the version, but pretrained-ocr-v1.2-2022-11-10 is the newset one I get offered

Can somebody explain why?

答案1

得分: 0

目前，只有在使用 v1beta3 终端时才能指定 ProcessOptions。 https://documentai.googleapis.com/v1beta3

此文档显示如何将该字段添加到 processRequest。

此 Python 示例显示如何向 ocr_config 添加一些选项，一旦这些选项添加到 v1 终端中，官方示例将会更新以包括此字段。

https://github.com/GoogleCloudPlatform/document-ai-samples/tree/main/pdf-embedded-text

关于此问题的先前答案：

英文:

Currently, ProcessOptions can only be specified when using the v1beta3 endpoint. https://documentai.googleapis.com/v1beta3

This documentation shows how to add the field to the processRequest

This Python sample shows how to add some options to ocr_config the official samples will be updated to include this field once the options are added to the v1 endpoint.

https://github.com/GoogleCloudPlatform/document-ai-samples/tree/main/pdf-embedded-text

Previous Answers about this:

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

为文档 AI OCR API 请求设置 processOptions

问题

答案1

Document AI批处理操作返回不同的有效载荷

如何使用Document Ai提取PDF的不同部分

在Google Document AI API中指定文档语言。

如何从扩展虚拟类的触发器处理程序中调用静态方法在Apex中？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论