2023年3月20日 23:19:26go评论71阅读模式

英文:

Working with Vision text recognition (Swift) on IOS 15.6 and memory grows ~25MB per recognition. Anyway to free up memory?

问题

我需要对数百张图像逐一进行文本识别，但每次内存会增加约25MB+。我在互联网上搜索过，但找不到是什么导致内存保留或如何释放它。通过插入断点，我可以看到每次调用imageRequestHandler.perform()时大小会增加。以下是相关代码。有什么建议吗？

英文:

I need to do text recognition on hundreds of images one at a time, but each time, memory grows by ~25mb+. I've searched the internet but can't find what is causing the memory retention or how to release it. By putting in breaks, I can see that the jump in size occurs with each call to imageRequestHandler.perform(). Below is the relevant code. Any suggestions?

    func textRecognition(image:CGImage) {
                
        let textRecognitionRequest = VNRecognizeTextRequest(
            completionHandler: self.handleDetectedText)
        textRecognitionRequest.recognitionLevel = .accurate
        textRecognitionRequest.recognitionLanguages = [&quot;en_US&quot;]
        textRecognitionRequest.usesLanguageCorrection = false
        
        // request handler
        let textRequest = [textRecognitionRequest]
        let imageRequestHandler = VNImageRequestHandler(cgImage: image, orientation: .up, options: [:])
        DispatchQueue.global(qos: .userInitiated).async {
            do {
                // perform request
                try imageRequestHandler.perform(textRequest)
            } catch let error {
                print(&quot;Error \(error)&quot;)
            }
        }
    }
    
    func handleDetectedText(request: VNRequest?, error:Error?){
        if let error = error {  print(&quot;ERROR: \(error)&quot;); return  }
        guard let results = request?.results, results.count &gt; 0 else {
            DispatchQueue.main.async {
                self.result_field.isEnabled=false
                self.result_field.text = &quot;Scan failed - Retry&quot;
                let desc = NSMutableAttributedString(string: &quot;Retake Photo&quot;, attributes: [NSAttributedString.Key.font: UIFont.systemFont(ofSize: 22, weight: .regular)])
                self.take_button.setAttributedTitle(desc, for: UIControl.State.normal)
                self.take_button.isHidden = false
                self.take_button.isEnabled = true
            }
            return // code to process the text replaced by the &#39;return&#39; statement
        }}}

答案1

得分: 1

我建议进行分析以查看占用内存的原因。然而，一般来说，我看到有两个可能需要探索以进行潜在改进的领域：

图像可能导致内存使用较高。我们（开发者）倾向于捕获“最佳”图像。然而，文本识别可能不需要高质量大图像（对较小的图像产生相同的结果）。因此，尝试在捕获和/或处理时减小图像的尺寸：
- 尝试使用 AVCapturePhotoSettings 进行实验，特别是 photoQualityPrioritization 和 photo format，
- 尝试减小 CGImage 的大小/可能将其转为黑白。然而，这些操作也不是没有成本的。或尝试将 CVPixelBuffer 直接传递给 VNImageRequestHandler（而不转换为 CGImage - 如果你从相机中获取图像的话）
查看 autoreleasepool 是否有助于内存使用。最明显的位置是 imageRequestHandler.perform：尝试在专用队列上执行它，并为该队列设置 autoreleaseFrequency: .workItem：

private static let performQueue = DispatchQueue(
        label: "your label",
        qos: .userInitiated,
        autoreleaseFrequency: .workItem, // <-- 从默认值 .never 切换
        target: .global(qos: .userInitiated) // <-- 与你现有的优先级相同
    )

// ...

Self.performQueue.async {
    try imageRequestHandler.perform(textRequest)
}

它的作用是：

该队列在执行块之前配置自动释放池，并在块执行完成后释放该池中的对象。

由于识别请求可能具有许多临时对象，你可能会从这个设置中受益，因为这些对象将在请求完成后立即释放（而不是“最终释放”）。不能保证它会有帮助，但值得一试。

再次强调，这些只是建议，但如果减少内存使用对你很重要，确实需要进行性能评估。尽管我认为 25MB 是相当合理的。

英文:

I would recommend profiling to see what takes the memory. However in general I see 2 areas that could be explored for potential improvements:

Image may be causing higher memory usage. Our (developers) tendency is to capture "the best" image. However text recognition may not need a huge high quality image (and give the same results for smaller image). So try to make your image smaller on capturing and/or processing:
- experiment with AVCapturePhotoSettings, namely photoQualityPrioritization and photo format,
- try to reduce CGImage size / maybe make it B&W. Those operations are not without the cost as well though. Or try to pass CVPixelBuffer directly to VNImageRequestHandler (without converting to CGImage - that's if you are taking image from camera)
See if autoreleasepool benefits your memory usage. The most obvious location is imageRequestHandler.perform: try to perform it on a dedicated queue, and set autoreleaseFrequency: .workItem for that queue:

private static let performQueue = DispatchQueue(
        label: &quot;your label&quot;,
        qos: .userInitiated,
        autoreleaseFrequency: .workItem, // &lt;-- switching from default .never
        target: .global(qos: .userInitiated) &lt;-- same priority as the one you have
    )

// ...

Self.performQueue.async {
    try imageRequestHandler.perform(textRequest)
}

What does it do:

> The queue configures an autorelease pool before the execution of a block, and releases the objects in that pool after the block finishes executing.

Since the recognition request may have a lot of temporary objects, you may benefit from this setting, since the objects will be immediately released after the request completes (instead of "eventually"). There's no guarantee that it will help, but worth a try.

Again, those are suggestions, but really performance evaluation is needed if reducing memory is that important for you. Although I think 25MB is quite reasonable, to be honest.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Working with Vision text recognition (Swift) on IOS 15.6 and memory grows ~25MB per recognition. Anyway to free up memory?

问题

答案1

Swift Package Manager支持正则表达式字面量吗？

SwiftUI：如何在List的editMode中使Spacer不可选择？

如何在SceneKit中切换物理体类型

在线程池中始终以最大值搜索特定节点

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论