问题

我想要能够显示PDF文件的不同部分的列表，就像上显示的那样。我是通过Flutter Web通过REST API调用处理器的。

我尝试使用fieldMask从API响应中获取实体，但对于图片中的文档，我什么都没有得到，不确定应该使用哪些字段来获得所需的响应。

英文:

I want to be able to show a list of different sections of the pdf file like what is shown on the. I'm calling the processor through REST api via Flutter Web.

I tried getiing the entities from the api response using fieldMask but got nothing for the document in the picture, not sure what fields should be used to get the desired response.

答案1

得分: 2

1 文档 OCR 处理器以 Document JSON 格式返回文本和布局信息。UI 中突出显示的每个部分都是 Block 或 Paragraph，您需要解析 JSON 响应以获取每个部分的数据，包括边界框。

您可以参考文档中的处理响应 > 文本、布局和质量分数部分，了解输出的结构以及解析它的代码示例。

您还可以参考这些开源示例 Web 应用程序，展示了与您所要求的类似用例：

英文:

The Document OCR Processor returns text and layout information in the Document JSON format. Each of those sections highlighted in the UI is a Block or a Paragraph, you will need to parse the JSON response to get the data for each section including the bounding boxes.

You can refer to Handle the processing response > Text, layout, and quality scores in the documentation for explanations of how the output is structured and code samples for parsing it.

You can also refer to these open source sample web applications that show use cases similar to what you are asking:

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何使用Document Ai提取PDF的不同部分

问题

答案1

如何使用 API 列出 GCP 项目中的所有图像 URL？

DefaultCredentialsError在尝试收集Google存储的Google默认凭据时发生。

Firestore文件系统优化

在gcloud存储cp中增长清单文件的影响。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论