2020年5月29日 12:59:43go评论94阅读模式

英文:

Java Apache poi: Word - Unable to extract specific texts from document along with numbering and tables

问题

无法从文档中提取特定文本，包括编号和表格。

有关如何解决这个问题的任何想法吗？

英文:

Unable to extract specific texts from document along with numbering and tables.

Any ideas on how to solve this?

答案1

得分: 1

你需要设置位置，仅替换具有以下格式的文本：

r.setText(text, 0);

对于表格，你需要按照以下方式查找：

for (XWPFTable tbl : doc.getTables()) {
    for (XWPFTableRow row : tbl.getRows()) {
        for (XWPFTableCell cell : row.getTableCells()) {
            for (XWPFParagraph p : cell.getParagraphs()) {
                for (XWPFRun r : p.getRuns()) {
                    // ...
                }
            }
            // 替换具有嵌套表格的值
            for (XWPFTable tbl2 : cell.getTables()) {
                for (XWPFTableRow row2 : tbl2.getRows()) {
                    for (XWPFTableCell cell : row.getTableCells()) {
                        for (XWPFParagraph p : cell.getParagraphs()) {
                            for (XWPFRun r : p.getRuns()) {
                                // ...
                            }
                        }
                    }
                }
            }
        }
    }
}

英文:

You need to set with the position to replace only the text with format

r.setText(text, 0);

For Table u need to find this way

    for (XWPFTableRow row : tbl.getRows()) {
	 for (XWPFTableCell cell : row.getTableCells()) {
      for (XWPFParagraph p : cell.getParagraphs()) {
		for (XWPFRun r : p.getRuns()) {
         .....
        }}
        // Replace values with nested table 
        for (XWPFTable tbl2 : cell.getTables()) {
		 for (XWPFTableRow row2 : tbl2.getRows()){
          for (XWPFTableCell cell : row.getTableCells()) {
           for (XWPFParagraph p : cell.getParagraphs()) {
		    for (XWPFRun r : p.getRuns()) {
            ...
          }}
        }}}

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Java Apache poi: Word – 无法提取文档中带有编号和表格的特定文本

问题

答案1

如何使用RestTemplate获取结果并将响应放入List中？

如何在子类中将受保护的静态字段改为公共字段？

更简便的方式将索引（0-7）翻译为字母（A-H）。

Android应用程序在运行一段时间后会自动被终止。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。