2020年10月27日 22:29:07go评论97阅读模式

英文:

pdfbox - performance tuning

问题

如何优化以下代码以减少运行时间，以下代码在加载PDF后未应用任何业务逻辑时耗时为384毫秒。

有什么建议吗？

MultipartFile file= ...;
byte[] pdfByte = file.getBytes();
PDDocument pdfDoc = PDDocument.load(new ByteArrayInputStream(pdfByte));
List<PDSignature> signatures = pdfDoc.getSignatureDictionaries();
pdfDoc.close();

英文:

How to enhance the below code to take less time, the below code takes 384 ms without applying any business logic after load the PDF.

Any suggestions ?

MultipartFile file= ...;
byte[] pdfByte = file.getBytes();
PDDocument pdfDoc = PDDocument.load(new ByteArrayInputStream(pdfByte));
List&lt;PDSignature&gt; signatures = pdfDoc.getSignatureDictionaries();
pdfDoc.close();

答案1

得分: 1

从评论中可以看出，实际问题是如何加快获取签名字节的速度。使用以下代码可避免再次读取文件：

COSString contents = (COSString) signature.getCOSObject().getDictionaryObject(COSName.CONTENTS);
byte [] signatureBytes = contents.getBytes();

在 PDFBox 2.0.22 中将会有一个新的方法 PDSignature.getContents()，无需参数，它不会再次读取 PDF。

另一个加快速度的方法是这样加载 PDF：

PDDocument pdfDoc = PDDocument.load(pdfByte);

因为从 InputStream 加载会创建另一个缓冲副本。

英文:

From the comments it turns out that the real question is how to speed up getting the signature bytes. Use this code to prevent reading the file a second time:

COSString contents = (COSString) signature.getCOSObject().getDictionaryObject(COSName.CONTENTS);
byte [] signatureBytes = contents.getBytes();

In PDFBox 2.0.22 there will be a new method PDSignature.getContents() without parameters which doesn't read the PDF a second time.

Another thing to speed up is to load the PDF like this:

PDDocument pdfDoc = PDDocument.load(pdfByte);

because loading from an InputStream would create another buffered copy.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

pdfbox – 性能优化

问题

答案1

有没有方法将SQL参数的默认值设置为null？

在单个方法中实现BeforeEach。

使用双指针方法背后的直觉是：

解析 Spring 中微服务实例的 IP 地址 [jhipster]

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。