speech
Here are 628 public repositories matching this topic...
-
Updated
Jun 13, 2020 - JavaScript
-
Updated
Apr 1, 2020 - Swift
-
Updated
Jun 15, 2020 - Python
After training, using some script like this: https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/tools/freeze_graph.py ?
What should be the input node name(s) and output node names used for freeze and inference later?
Thanks!
Due to the 2GB contiguous block allocation limitation of 32 bit systems, this line:
https://github.com/readbeyond/aeneas/blob/master/aeneas/audiofile.py#L447
might throw a MemoryError, causing an "unexpected exception".
We should:
- catch it and log a meaningful error
- document this limitation somewhere
-
Updated
Jun 11, 2020 - Python
-
Updated
Apr 1, 2020 - Python
-
Updated
Jun 3, 2020 - Python
-
Updated
May 22, 2020 - Python
We would benefit from tutorial how to convert DNN models trained either by HTK 3.5.2-BETA or by Kaldi. I am specifically interested how you converted DNN models trained by Kaldi later used in Japanese speech dictation published here.
🐛 Bug
In order to enable inversion of an STFT via the inverse STFT in istft, it is sufficient that the signal windowing obeys the constraint of “Constant OverLap Add” (COLA). This ensures that every point in the input data is equally weighted, thereby avoiding aliasing and allowing full reconstruction.
https://docs.scipy.org/doc/scipy/reference/generated/scipy.signal.check_COLA.html
Le
@lfchener 我看报错这行代码是您最近添加的,我是pip安装的环境, 并按照readme 一步步执行的,但执行到infer时报错,是readme过时了吗?谢谢。
Traceback (most recent call last):
File "/home/shenxz/.IdeaIC2019.2/config/plugins/python-ce/helpers/pydev/pydevd.py", line 1415, in _exec
pydev_imports.execfile(file, globals, locals) # execute the script
File "/home/shenxz/code/DeepSpeech/infer.py", line 152, in
main()
File "/home/sh
It looks like the default initialization scheme with tf.get_variable is uniform Glorot , but the paper uses He's Gaussian.
hi,
as you know, in SoLoud, the number of filters are limited
we should implement more like different reverbs, fir and irr filters, (these could be used to implement HRTF support), Chorus, One Poll, One Zero, Pole Zero, Two Pole, Two Zero, etc
a library exists called stk under zlib license which already implemented these maybe we can implement some of these out
Seek performance
I think the filenames in models.sh referred to on lines 4-9 should refer to kaldi-generic-en-tdnn_f-r20190609* which is downloaded on line 3.
-
Updated
May 26, 2020 - Python
Currently (6.0.15), inline comments (marked by ;) in lines with command names are interpreted to be part of the command name:
Create TextGrid: 0, 1, "tier", "" ; Good
asserterror Command "Remove ; Bad!"
... not available for current selection.
Remove ; Bad!
As shown in the snippet, a comment next to the first command (after the argument list) is correctly ignored, but when used
-
Updated
Feb 10, 2019 - MATLAB
-
Updated
Jun 12, 2020 - JavaScript
-
Updated
May 2, 2019 - Java
-
Updated
Dec 7, 2018 - Jupyter Notebook
Firebase functions support only 9mins.
https://cloud.google.com/functions/docs/concepts/exec#timeout
Is there any way to use webhook or gcp pubsub for async long job inside gcp functions?
https://cloud.google.com/community/tutorials/cloud-functions-async
-
Updated
Jul 24, 2019 - Python
Final goal is to produce a functional Cboard app for iOS.
We have to use the Cordova repository called ccboard which is already producing the Android app:
https://github.com/cboard-org/ccboard
- Update readme file to include app compilation and production
- Find for bugs running the app on iOS
- Write JavaDoc comments for all class variables and functions.
- In many cases, there are multi-line JavaDoc comments where the opening
/**and closing*/could be moved onto the same line as the comment itself. - There are functions with unfinished JavaDoc comments.
- JavaDoc comments on classes can be removed if they have no useful information (Ex. A one-line description of t
-
Updated
May 8, 2020 - Python
-
Updated
Feb 16, 2020 - Python
Improve this page
Add a description, image, and links to the speech topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."

See this thread
https://groups.google.com/forum/?utm_medium=email&utm_source=footer#!msg/kaldi-help/1AMGQ54tn0E/kacpkoizAAAJ
for the problem-- for CUDA 10.1 the rpath we add for CUDA is not sufficient because they put the CUBLAS library somewhere else. Making the issue so we don't lose track of it.