#

mfcc

Here are 137 public repositories matching this topic...

ddbourgin / numpy-ml

Star

Machine learning, in numpy

machine-learning reinforcement-learning word2vec lstm neural-networks gaussian-mixture-models vae topic-modeling attention resnet bayesian-inference wavenet mfcc knn gaussian-processes hidden-markov-models gradient-boosting wgan-gp good-turing-smoothing

Updated Aug 19, 2020
Python

aubio / aubio

Star

a library for audio and music analysis

audio python music c annotation analysis extraction sound beat mfcc onset pitch tempo-tracking

Updated Jul 2, 2020
C

adamstark / Gist

Star

A C++ Library for Audio Analysis

audio music gist c-plus-plus audio-analysis music-information-retrieval fft mfcc mir spectral-analysis pitch-tracking onset-detection

Updated Feb 29, 2020
C++

gionanide / Speech_Signal_Processing_and_Classification

Star

Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].

nlp classifier natural-language-processing feature-extraction nltk gaussian-mixture-models support-vector-machines mfcc principal-component-analysis speech-processing linear-discriminant-analysis isomap spectral-clustering long-short-term-memory kernel-pca spectral-embedding locally-linear-embedding linear-prediction-coefficients speech-utterance

Updated Jul 15, 2020
Python

ar1st0crat / NWaves

Star

.NET library for 1D signal processing focused specifically on audio processing

audio sound-effects dsp fda wav feature-extraction noise signal lpc resampling filtering mfcc pitch mir adaptive-filtering psychoacoustics wavelets sound-synthesis time-stretch

Updated Aug 4, 2020
C#

x4nth055 / emotion-recognition-using-speech

Star

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

machine-learning deep-learning sklearn keras recurrent-neural-networks feature-extraction neural-networks support-vector-machine mfcc librosa emotion-detection gradient-boosting emotion-recognition kneighborsclassifier random-forest-classifier mlp-classifier speech-emotion-recognition emotion-recognizer

Updated Jun 6, 2020
Python

amanbasu / speech-emotion-recognition

Star

Detecting emotions using MFCC features of human speech using Deep Learning

deep-learning tensorflow rnn mfcc

Updated Nov 16, 2018
Jupyter Notebook

GauravWaghmare / Speaker-Identification

Star

A program for automatic speaker identification using deep learning techniques.

keras mfcc speaker-recognition speaker-verification

Updated Feb 28, 2017
Python

tympanix / subsync

Star

Synchronize your subtitles using machine learning

machine-learning neural-network delay subtitles subtitle fix mfcc shift subsync speech-detection shift-subtitle

Updated Mar 30, 2020
Python

SuperKogito / Voice-based-gender-recognition

Star

🔉

👦

👧Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)

data-science machine-learning scikit-learn voice speech gaussian-mixture-models signal gender-recognition gender gmm mfcc speaker gender-classification vocal gender-recognition-by-voice gender-detection mel-frequencies scikit-learn-python

Updated Sep 26, 2019
Python

MycroftAI / sonopy

Star

A simple audio feature extraction library

library sound spectrogram mfcc audio-processing mel-spectrogram

Updated Jul 3, 2019
Python

ZitengWang / python_kaldi_features

Star

python codes to extract MFCC and FBANK speech features for Kaldi

Updated Nov 28, 2018
Python

spafe

SuperKogito / spafe

Star

Open

Error in preprocessing

4

ngragaei commented Jul 27, 2020

frames[-1] = np.append(frames[-1], np.array([0]*(frame_length - len(frames[0]))))

TypeError: can't multiply sequence by non-int of type 'float'

Read more

bug good first issue spafe.utils

Open

add a VAD

Open

missing tests for utils.cepstral.py

Find more good first issues →

supikiti / PNCC

Star

A implementation of Power Normalized Cepstral Coefficients: PNCC

deep-learning speech-recognition mfcc speech-processing robustness pncc speech-enhancement

Updated Aug 11, 2019
Python

georgid / AlignmentDuration

Star

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

python music duration synchronization research deep-learning signal-processing lyrics decoding music-information-retrieval neural-networks alignment hidden-markov-model gmm mfcc upf htk

Updated Mar 9, 2020
Python

aubio / vamp-aubio-plugins

Star

aubio plugins for Vamp

audio music analysis music-information-retrieval beat mfcc aubio onset tempo-tracking tempo tempo-detection onset-detection beat-tracking beat-detection vamp-plugins

Updated Dec 4, 2017
C++

FragJage / SpeakerVoiceIdentifier

Star

SpeakerVoiceIdentifier can recognize the voice of a speaker by learning.

learning classifier recognition voice gmm mfcc speaker identifier

Updated Feb 20, 2017
C++

Live-Audio-MFCC

pulakk / Live-Audio-MFCC

Star

Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorial

node-js mfcc p5-sketches webaudioapi meyda

Updated Jan 3, 2020
JavaScript

zafarrafii / Z

Star

This repository contains a Matlab class, a Python module, a Jupyter notebook, and a Julia module which implement/illustrate several methods/functions for audio signal processing.

audio python music julia matlab jupyter-notebook stft mfcc audio-signal-processing dct dst cqt istft chromagram mdct imdct cqt-kernel cqt-spectrogram

Updated Jul 9, 2020
Jupyter Notebook

skaws2003 / pytorch-mfcc

Star

A pytorch implementation of MFCC.

Updated Aug 13, 2019
Python

ansleliu / ConvolutionaNeuralNetworksToEnhanceCodedSpeech

Star

In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral domain features. The proposed postprocessors in both domains are evaluated for various narrowband and wideband speech codecs in a wide range of conditions. The proposed postprocessor improves speech quality (PESQ) by up to 0.25 MOS-LQO points for G.711, 0.30 points for G.726, 0.82 points for G.722, and 0.26 points for adaptive multirate wideband codec (AMR-WB). In a subjective CCR listening test, the proposed postprocessor on G.711-coded speech exceeds the speech quality of an ITU-T-standardized postfilter by 0.36 CMOS points, and obtains a clear preference of 1.77 CMOS points compared to G.711, even en par with uncoded speech.

keras cnn generative-adversarial-network gan post-processing mfcc speech-processing wasserstein-gan speech-enhancement speech-reconstruction 1d-convolution

Updated Mar 8, 2020
Python

xflywind / scim

Star

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

audio nim wav speech-recognition scientific-computing digital-signal-processing mfcc speech-processing speech-analysis arraymancer

Updated Nov 10, 2019
Nim

sheelabhadra / Emergency-Vehicle-Detection

Star

Python implementation of papers on Emergency Vehicle Detection

machine-learning neural-network self-driving-car mfcc emergency-response audio-processing pitch-detection wavelets

Updated Jun 27, 2019
Jupyter Notebook

SuperKogito / Voice-based-speaker-identification

Star

🔉

👦

👧

👩

👨 Speaker identification using voice MFCCs and GMM

machine-learning scikit-learn voice speech gaussian-mixture-models signal gmm mfcc speaker-recognition vocal mel-frequencies speaker-identification mel-frequency-cepstral-coefficients scikit-learn-python

Updated May 5, 2019
Python

FragIt / fragit-main

Star

FragIt main repository

python fragments molecule mfcc

Updated Apr 1, 2020
Python

alicex2020 / Mandarin-Tone-Classification

Star

Deep learning using CNN for Mandarin Chinese tone classification

deep-neural-networks recognition deep-learning cnn classification mfcc pitch cnn-keras f0 spectrograms

Updated Apr 5, 2019
Jupyter Notebook

amitchone / ASR

Star

A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).

python dtw automatic-speech-recognition mfcc

Updated Apr 23, 2018
Python

GuitarsAI / BasicsMusicalInstrumClassifi

Star

Basics of Musical Instruments Classification using Machine Learning

machine-learning deep-learning svm music-information-retrieval mfcc musical-instruments-classification

Updated Apr 2, 2019
Jupyter Notebook

lzm0706 / DTW-Speech-Recognition

Star

Using MFCC feature and DTW algorithm to recognize rumber 0-9

Updated Nov 20, 2017
Python

orbxball / timit-preprocessor

Star

Extract mfcc vectors and phones from TIMIT dataset

deep-learning phone speech-recognition data-preprocessing mfcc timit-dataset timit

Updated Mar 20, 2018
Shell

Improve this page

Add a description, image, and links to the mfcc topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mfcc topic, visit your repo's landing page and select "manage topics."

You can’t perform that action at this time.