2023年3月8日 15:51:43go评论115阅读模式

英文:

Realtime conversion of MP3 to PCM using minimp3 produces noise/gaps at beginning of frame

问题

I'm using minimp3 (https://github.com/lieff/minimp3) to implement an audio pipeline where I stream MP3 data to an Unreal Engine 5 client using Web Sockets. Once received, I add the data to a TArray (acting as a buffer), and then decode as much as I can to PCM. Once decoded I add the new samples to an audio source. I then repeat this with all MP3 frames in the data stream.

我正在使用minimp3 (https://github.com/lieff/minimp3) 来实现一个音频流程，通过Web Sockets将MP3数据传送到虚幻引擎5客户端。一旦接收到数据，我将数据添加到一个TArray中（作为缓冲区），然后尽可能多地解码成PCM格式。解码后，我将新的样本添加到音频源。然后我会在数据流中的所有MP3帧上重复这个过程。

I've confirmed that the MP3 is being transmitted correctly, however, the converted PCM has gaps/noise at the beginning of chunks of MP3 frames that distort the audio during playback.

我已经确认MP3正在正确传输，但是转换后的PCM在MP3帧的开始处有间隙/噪音，在播放过程中扭曲了音频。

The noise/gaps seem to occur at the beginning of new chunks. My initial theory was that it's due to the byte reservoir system of MP3 (given that the gap is roughly 3 frames long every time), but I thought minimp3 is supposed to account for that? It's the same instance of mp3d between chunks.

噪音/间隙似乎出现在新块的开头。我的最初理论是这是由于MP3的字节储存系统引起的（考虑到每次间隙大约有3帧长），但我以为minimp3应该会处理这个问题？每个块之间都是同一个mp3d实例。

What do I need to do to fix the decoding issues at the beginning of the chunks?

我需要做什么来解决块的开头解码问题？

英文:

I've confirmed that the MP3 is being transmitted correctly, however, the converted PCM has gaps/noise at the beginning of chunks of MP3 frames that distort the audio during playback.

Attached Code:

https://gist.github.com/DeveloperHarris/c3bd30d6cc689f22ec9e8fc839c80655

The top audio in Audacity is a test MP3 comprised of a 100Hz sine wave received by the Unreal Engine client.
The bottom track is the converted PCM stream.

What do I need to do to fix the decoding issues at the beginning of the chunks?

答案1

得分: 1

Turns out, I needed to read the library docs more carefully:

The decoder will analyze the input buffer to properly sync with the MP3 stream, and will skip ID3 data, as well as any data which is not valid. Short buffers may cause false sync and can produce 'squealing' artefacts. The bigger the size of the input buffer, the more reliable the sync procedure. We recommend having as many as 10 consecutive MP3 frames (~16KB) in the input buffer at a time.

Adding a check at the beginning of my decode function that checks if it has 10 consecutive frames was enough to fix the issue.

英文:

Turns out, I needed to read the library docs more carefully:

Adding a check at the beginning of my decode function that checks if it has 10 consecutive frames was enough to fix the issue.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

MP3转PCM的实时转换使用minimp3在帧开始处产生噪音/间隙。

问题

答案1

Is Internet Explorer group policy setting "EnableGlobalWindowListInIEMode" available on Windows Server 2016 / 2019?

golang: call C++ code in cross platform

寻找大型 switch-case 和 if-else 结构的最有效方法

Problems in implementing adaptive thresholding using CUDA

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论