Skip to content
#

asr

Here are 509 public repositories matching this topic...

david-95
david-95 commented Aug 8, 2022

English text: The girls stared at their father. Mrs. Bennet said only, "Nonsense, nonsense!"

remove ! then the error disappeared.

Error Log:

Traceback (most recent call last):
File "readText_Paddle.py", line 1175, in
englishTxt2WavbyPaddle(mp3filepath,textforSpeak)
File "readText_Paddle.py", line 147, in englishTxt2WavbyPaddle
wav_file = tts_executor(
File "

profthecopyright
profthecopyright commented May 21, 2022

1. Recorder.SampleData()
(1) 命名建议:这个方法的功能相当于是重采样,而且本质上只会降低采样率,建议改名为ResampleData/DownsampleData更合适。
(2) 现在的的降低采样率实现方式只是按比例舍弃采样(decimation),但是从信号处理的角度,单纯这样做的话会有aliasing effect(混叠),引起严重失真(注意,此现象并非由于采样点减少本身导致的音质变差)。标准的降低采样率方式应当在处理之前加入低通滤波过程。如采样比非整数,则应先提高采样率再降低采样率。参见https://zh.wikipedia.org/zh-my/%E9%99%8D%E9%87%87%E6%A0%B7

2. pcmAbsSum
目前音量显示用信号绝对值之和的平均值,此方法据我所知并非标准做法,而且不反映能量/功率(power)。常

good first issue
pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

  • Updated Mar 14, 2022
  • Python

Improve this page

Add a description, image, and links to the asr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."

Learn more