Home | 简体中文 | 繁体中文 | 杂文 | Github | 知乎专栏 | Facebook | Linkedin | Youtube | 打赏(Donations) | About
知乎专栏

15.5. Automatic Speech Recognition

15.5.1. kaldi

         
docker run -it kaldiasr/kaldi:latest bash
docker run -it --runtime=nvidia kaldiasr/kaldi:gpu-latest bash
        
        
         
docker run -it kaldiasr/kaldi:latest bash        
        
        

15.5.2. OpenAI Whisper

https://github.com/openai/whisper

         
import openai
audio_file= open("/path/to/file/audio.mp3", "rb")
transcript = openai.Audio.transcribe("whisper-1", audio_file)