扬声器模式视频在Azure媒体服务中

huangapple go评论85阅读模式
英文:

Speaker Mode Video in Azure Media Services

问题

我有两个高分辨率本地视频录制文件,来自一次播客采访。

我想将它们合并成一个输出文件,始终显示说话者。

所以我们需要分析音轨,看谁在说话(嘉宾有优先权),然后创建说话者的时间戳数组。

使用类似于我所描述的ffmpeg进行音量分析的示例

然后,我想使用AMS根据时间戳合并视频文件(例如,host.mp4 源文件播放 20 秒,然后 guest.mp4 播放 30 秒,依此类推)。

我该如何操作?

英文:

I have two hi-res local video recording files from a podcast interview.

I would like to merge them into one output file with the speaker showing at all times.

So we'd need to analyse the audio track and see who is speaking (guest has priority) and then create an array of timestamps of the speaker.

Volume analysis example using ffmpeg similar to what I'm describing

Then I'd like to use AMS to merge the video files based on the timestamps (eg. host.mp4 source for 20 seconds then guest.mp4 for 30 seconds, etc)

How would I go about this?

答案1

得分: 2

这听起来像是Azure Video Indexer中的扬声器枚举功能。您可以在此链接中了解更多信息:https://learn.microsoft.com/en-us/azure/azure-video-indexer/video-indexer-overview#videoaudio-ai-features。

英文:

This sounds like the speaker enumeration feature in Azure Video Indexer https://learn.microsoft.com/en-us/azure/azure-video-indexer/video-indexer-overview#videoaudio-ai-features.

huangapple
  • 本文由 发表于 2023年2月6日 22:02:08
  • 转载请务必保留本文链接:https://go.coder-hub.com/75362320.html
  • azure-media-services
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定