Skip to content
#

asr

Here are 299 public repositories matching this topic...

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

  • Updated Jun 11, 2020
  • Python
SenYan1999
SenYan1999 commented Aug 18, 2020

I am a newcomer to the audio field. I have some questions when use this project to generate the audio embedding for my multimodality model (text and audio)

I want to use Mockingjay, and run `python preprocess_any.py --feature_type=mel' but get 80 dim features, I just simply change num_mel in utility/audio.py from 80 to 160(I see this model need 160dim mel features in README), is it right?

Th

Improve this page

Add a description, image, and links to the asr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.