Mel Frequency Cepstral Coefficients (MFCCs)¶

In [3]:

x, fs = librosa.load('simple_loop.wav')
librosa.display.waveplot(x, sr=fs)

Out[3]:

<matplotlib.collections.PolyCollection at 0x10ecb5fd0>

In [4]:

IPython.display.Audio(x, rate=fs)

Out[4]:

`librosa.feature.mfcc`¶

In [5]:

mfccs = librosa.feature.mfcc(x, sr=fs)
print mfccs.shape

(20, 130)

In [6]:

librosa.display.specshow(mfccs, sr=fs, x_axis='time')

Out[6]:

<matplotlib.image.AxesImage at 0x106318c50>

Feature Scaling¶

In [7]:

mfccs = sklearn.preprocessing.scale(mfccs, axis=1)
print mfccs.mean(axis=1)
print mfccs.var(axis=1)

[ -4.64585635e-16  -1.63971401e-16  -1.09314267e-16  -1.09314267e-16
   0.00000000e+00   1.09314267e-16   0.00000000e+00  -1.09314267e-16
  -1.09314267e-16  -2.73285668e-17   1.09314267e-16  -8.19857003e-17
   5.46571335e-17   0.00000000e+00   2.73285668e-17  -4.09928501e-17
   1.09314267e-16   8.19857003e-17   9.56499837e-17   6.83214169e-17]
[ 1.  1.  1.  1.  1.  1.  1.  1.  1.  1.  1.  1.  1.  1.  1.  1.  1.  1.
  1.  1.]

In [8]:

librosa.display.specshow(mfccs, sr=fs, x_axis='time')

Out[8]:

<matplotlib.image.AxesImage at 0x1108b8890>

Mel Frequency Cepstral Coefficients (MFCCs)¶

librosa.feature.mfcc¶

Feature Scaling¶

essentia.standard.MFCC¶

`librosa.feature.mfcc`¶

`essentia.standard.MFCC`¶