问题

&lt;!-- begin snippet: js hide: false console: true babel: false --&gt;

&lt;!-- language: lang-html --&gt;

import openai


openai.organization = &quot;org-xxxxxx&quot;
openai.api_key = &quot;sk-xxxxx&quot;

audio_file_path =  &quot;/Users/tejaksha/Downloads/dhoni.mp4&quot;

# 注意：要使下面的代码工作，您需要使用 OpenAI Python v0.27.0

audio_file= open(audio_file_path, &quot;rb&quot;)
transcript = openai.Audio.transcribe(&quot;whisper-1&quot;, audio_file)

&lt;!-- end snippet --&gt;

在上面的代码中，我能够获得以下输出

&lt;!-- begin snippet: js hide: false console: true babel: false --&gt;

&lt;!-- language: lang-html --&gt;

{
    &quot;text&quot;: &quot;Flat back, just got a little tight to him, he was wagging for it, set up for the slower ball and punished it. The one&#39;s going straight down the ground. And MS Daini just taking control.&quot;
}

&lt;!-- end snippet --&gt;

但我想要的格式如下，带有时间戳，如何使用 OPENAI 转录获得？

我需要的实际格式是

&lt;!-- begin snippet: js hide: false console: true babel: false --&gt;

&lt;!-- language: lang-js --&gt;

{
  &quot;transcript&quot;: [
    {
      &quot;text&quot;: &quot;[Music]&quot;,
      &quot;start&quot;: 7.39,
      &quot;duration&quot;: 4.1
    },
    {
      &quot;text&quot;: &quot;once upon a time&quot;,
      &quot;start&quot;: 16.48,
      &quot;duration&quot;: 4.4
    },
    {
      &quot;text&quot;: &quot;in ancient china there lived three&quot;,
      &quot;start&quot;: 17.6,
      &quot;duration&quot;: 6.64
    },
    {
      &quot;text&quot;: &quot;old monks their names are not remembered&quot;,
      &quot;start&quot;: 20.88,
      &quot;duration&quot;: 6.559
    }
  ]
}

&lt;!-- end snippet --&gt;

英文:

import openai


openai.organization = &quot;org-xxxxxx&quot;
openai.api_key = &quot;sk-xxxxx&quot;

audio_file_path =  &quot;/Users/tejaksha/Downloads/dhoni.mp4&quot;

# Note: you need to be using OpenAI Python v0.27.0 for the code below to work

audio_file= open(audio_file_path, &quot;rb&quot;)
transcript = openai.Audio.transcribe(&quot;whisper-1&quot;, audio_file)

In the above code i as able to get the output

{
    &quot;text&quot;: &quot;Flat back, just got a little tight to him, he was wagging for it, set up for the slower ball and punished it. The one&#39;s going straight down the ground. And MS Daini just taking control.&quot;
}

But i want as the following format with timestamp how to get using OPENAI transcription?

Acutual format that is required for me is

{
  &quot;transcript&quot;: [
    {
      &quot;text&quot;: &quot;[Music]&quot;,
      &quot;start&quot;: 7.39,
      &quot;duration&quot;: 4.1
    },
    {
      &quot;text&quot;: &quot;once upon a time&quot;,
      &quot;start&quot;: 16.48,
      &quot;duration&quot;: 4.4
    },
    {
      &quot;text&quot;: &quot;in ancient china there lived three&quot;,
      &quot;start&quot;: 17.6,
      &quot;duration&quot;: 6.64
    },
    {
      &quot;text&quot;: &quot;old monks their names are not remembered&quot;,
      &quot;start&quot;: 20.88,
      &quot;duration&quot;: 6.559
    }
  ]
}

答案1

得分: 1

我相信 OpenAI API 不支持这样的功能。然而，你可以使用 whisper 库并返回时间戳。

import whisper
model = whisper.load_model("base")
audio = whisper.load_audio(ASRPage.output_file_path)
result = model.transcribe(audio)
print(result["segments"])

这意味着你需要拥有自己的 GPU 或个人电脑来运行推断。

英文:

I believe that the OpenAI API does not support such feature. However, you can use the whisper library and return the timestamps.

import whisper
model = whisper.load_model(&quot;base&quot;)
audio = whisper.load_audio(ASRPage.output_file_path)
result = model.transcribe(audio)
print(result[&quot;segments&quot;])

This does mean that you need to your own GPU or pc to run the inference.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在Python中以JSON格式格式化OpenAI转录并包含时间戳？

问题

答案1

在CSV文件中为列添加前导零。

Canvas由于方法问题未正确绘制线条。

“KeyError: ‘cut’ not found in axis”

字典转换为带有列表作为值的数据框

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论