Split spectral features and kNN into two separate notebooks, added them to the table of contents

ef2738ab · Leigh Smith · a7b27e87 · ef2738ab · ef2738ab · ef2738ab
Commit ef2738ab authored Jun 21, 2014 by Leigh Smith
4 changed files
--- a/Table_of_Contents.ipynb
+++ b/Table_of_Contents.ipynb
 {
 "metadata": {
  "name": "",
-  "signature": "sha256:a1406d5242149874f7c7a56f0d518aa515d43d2b2c134746256a62c1bc20a53b"
+  "signature": "sha256:fee871f8d8f5b66df2844daf79ffb4871bb3968bdedf0dfc5fe58bc2ef4bc7b6"
 },
 "nbformat": 3,
 "nbformat_minor": 0,
@@ -34,9 +34,9 @@
      "\n",
      "1.  [Example: Zero-Crossing Rate](notebooks/zcr.ipynb)\n",
      "\n",
-      "1.  [Spectral Features](notebooks/)\n",
+      "1.  [Spectral Features](notebooks/spectral_features.ipynb)\n",
      "\n",
-      "1.  [K-Nearest Neighbor](notebooks/)\n",
+      "1.  [K-Nearest Neighbor](notebooks/knn.ipynb)\n",
      "\n",
      "### Day 2\n",
      "\n",

--- a/notebooks/cross_validation.ipynb
+++ b/notebooks/cross_validation.ipynb
 {
 "metadata": {
  "name": "",
-  "signature": "sha256:92cb9f5bc7783db462ab80b4382c5ba9e78c9d448ea4751fd09cfdd52648a0b7"
+  "signature": "sha256:14bd75857e4b8f9809ba4d5f05dc624688b7b01756fb2f49422023301d45173f"
 },
 "nbformat": 3,
 "nbformat_minor": 0,
@@ -25,13 +25,12 @@
      "2. 1 test set is tested using the classifier trained on the remaining 9.\n",
      "3. We then do test/train on all of the other sets and average the percentages. \n",
      "\n",
-      "To achieve the first step (divide our training set into k disjoint subsets), use the function [Kfold](http://scikit-learn.org/stable/modules/generated/sklearn.cross_validation.KFold.html) in the scikit.learn cross_validation package.\n",
+      "To achieve the first step (divide our training set into k disjoint subsets), use the function [Kfold](http://scikit-learn.org/stable/modules/generated/sklearn.cross_validation.KFold.html) in the scikit.learn cross_validation package:\n",
      "\n",
      "    K-Folds cross validation iterator.\n",
      "    Provides train/test indices to split data in train test sets. Split dataset into k consecutive folds (without shuffling).\n",
      "\n",
-      " You can visit the scikit.learn documentation to look at all the other options. This code is also posted as a template in \n",
+      " You can visit the scikit.learn documentation to look at all the other options."
-      " `/usr/ccrma/courses/mir2014/Toolboxes/crossValidationTemplate.py`  "
     ]
    },
    {
@@ -88,6 +87,13 @@
     "language": "python",
     "metadata": {},
     "outputs": []
+    },
+    {
+     "cell_type": "markdown",
+     "metadata": {},
+     "source": [
+      "This code is also posted as a template in `sources/crossValidationTemplate.py`  "
+     ]
    }
   ],
   "metadata": {}

--- a/notebooks/spectral_features_knn.ipynb
+++ b/notebooks/spectral_features_knn.ipynb
--- a/notebooks/spectral_features.ipynb
+++ b/notebooks/spectral_features.ipynb
+{
+ "metadata": {
+  "name": "",
+  "signature": "sha256:6d4d8f02ec3a6e31bac7c35b68acdd27227e96b152d2e69d76283b9c7e1b31a5"
+ },
+ "nbformat": 3,
+ "nbformat_minor": 0,
+ "worksheets": [
+  {
+   "cells": [
+    {
+     "cell_type": "markdown",
+     "metadata": {},
+     "source": [
+      "Spectral Features\n",
+      "-----------------\n",
+      "\n",
+      "For classification, we're going to be using the new features in our arsenal: cherishing those \"spectral moments\" (centroid, bandwidth, skewness, kurtosis) and also examining other spectral statistics."
+     ]
+    },
+    {
+     "cell_type": "markdown",
+     "metadata": {},
+     "source": [
+      "### Training Data\n",
+      "\n",
+      "First off, we want to analyze and feature extract a small collection of audio samples - storing their feature data as our \"training data\".  The commands below read all of the drum example .wav files from the MIR web site into an array, snareFileList.  \n",
+      "\n",
+      "First we define a function to retrieve a list of URLs from a text file."
+     ]
+    },
+    {
+     "cell_type": "code",
+     "collapsed": false,
+     "input": [
+      "import urllib2\n",
+      "\n",
+      "def process_corpus(corpus_URL):\n",
+      "    \"\"\"Read a list of files to process from the text file at corpusURL. Return a list of URLs\"\"\" \n",
+      "    # Open and read each line\n",
+      "    url_list_text_data = urllib2.urlopen(corpus_URL) # it's a file like object and works just like a file\n",
+      "    for file_URL in url_list_text_data: # files are iterable\n",
+      "        yield file_URL.rstrip()"
+     ],
+     "language": "python",
+     "metadata": {},
+     "outputs": [],
+     "prompt_number": 1
+    },
+    {
+     "cell_type": "markdown",
+     "metadata": {},
+     "source": [
+      "Use these commands to read in a list of filenames (samples) in a directory, replacing the URL with a URL to a list of URLs (one per line) indicating where the audio / drum samples are stored."
+     ]
+    },
+    {
+     "cell_type": "code",
+     "collapsed": false,
+     "input": [
+      "snares_URL = \"https://ccrma.stanford.edu/workshops/mir2014/SnareCorpus.txt\"\n",
+      "snare_file_list = [audio_file_URL for audio_file_URL in process_corpus(snares_URL)]"
+     ],
+     "language": "python",
+     "metadata": {},
+     "outputs": [],
+     "prompt_number": 2
+    },
+    {
+     "cell_type": "code",
+     "collapsed": false,
+     "input": [
+      "kicks_URL = \"https://ccrma.stanford.edu/workshops/mir2014/KickCorpus.txt\"\n",
+      "kick_file_list = [audio_file_URL for audio_file_URL in process_corpus(kicks_URL)]"
+     ],
+     "language": "python",
+     "metadata": {},
+     "outputs": [],
+     "prompt_number": 3
+    },
+    {
+     "cell_type": "markdown",
+     "metadata": {},
+     "source": [
+      "To access the filenames contained in the array, use the square brackets [ ] to get to the element that you want to access. For example, to access the text URL file name of the first file in the list, you would type:"
+     ]
+    },
+    {
+     "cell_type": "code",
+     "collapsed": false,
+     "input": [
+      "snare_URL = snare_file_list[0]\n",
+      "snare_URL"
+     ],
+     "language": "python",
+     "metadata": {},
+     "outputs": [
+      {
+       "metadata": {},
+       "output_type": "pyout",
+       "prompt_number": 4,
+       "text": [
+        "'https://ccrma.stanford.edu/workshops/mir2014/audio/drum%20samples/snares/SNARE_01_01.WAV'"
+       ]
+      }
+     ],
+     "prompt_number": 4
+    },
+    {
+     "cell_type": "markdown",
+     "metadata": {},
+     "source": [
+      "When we feature extract a sample collection, we need to sequentially access audio files, segment them (or not), and feature extract them.  Loading a lot of audio files into memory is not always a feasible or desirable operation, so you will create a loop which loads an audio file, feature extracts it, and closes  the audio file.  Note that the only information that we retain in memory are the features that are extracted.\n",
+      "\n",
+      "Create a loop which reads in an audio file, extracts the zero crossing rate, and some spectral statistics. You can use the \"in\" operator to retrieve each audio file URL from process_corpus(), as used above. The feature information for each audio file (the \"feature vector\") should be stored as a feature array, with columns being the features and rows for each file. For example:\n",
+      "\n",
+      "        featuresSnare =\n",
+      "\n",
+      "             0.5730    1.9183    2.9713    0.0004 0.0002\n",
+      "             0.4750    1.4834    2.4463    0.0004  0.0012\n",
+      "             0.5900    2.2857    3.1788    0.0003  0.0041\n",
+      "             0.5090    1.6622    2.6369    0.0004  0.0051\n",
+      "             0.4860    1.4758    2.2085    0.0004  0.0021\n",
+      "             0.6060    2.2119    3.2798    0.0004  0.0651\n",
+      "             0.4990    2.0607    2.7654    0.0004  0.0721\n",
+      "             0.6360    2.3153    3.0256    0.0003  0.0221\n",
+      "             0.5490    2.0137    3.0342    0.0004  0.0016\n",
+      "             0.5900    2.2857    3.1788    0.0003  0.0012\n",
+      " \n",
+      " Within your loop, here's a reminder how to read in your wav files, using an array of audio file URLs:"
+     ]
+    },
+    {
+     "cell_type": "code",
+     "collapsed": false,
+     "input": [
+      "import urllib\n",
+      "from essentia.standard import MonoLoader\n",
+      "\n",
+      "sample_rate = 44100\n",
+      "urllib.urlretrieve(snare_URL, filename='/tmp/localfile.wav')\n",
+      "audio = MonoLoader(filename = '/tmp/localfile.wav', sampleRate = sample_rate)()"
+     ],
+     "language": "python",
+     "metadata": {},
+     "outputs": [],
+     "prompt_number": 5
+    },
+    {
+     "cell_type": "markdown",
+     "metadata": {},
+     "source": [
+      "Here's an example of how to feature extract the first frame from the current audio file..."
+     ]
+    },
+    {
+     "cell_type": "code",
+     "collapsed": false,
+     "input": [
+      " frameSize = 0.100 * sample_rate   # 100ms\n",
+      " currentFrame = audio[0 : frameSize]\n",
+      " # featuresSnare[i, 0] = zcr(currentFrame)\n",
+      " # centroid, bandwidth, skew, kurtosis = spectralMoments(currentFrame, sample_rate, 8192)\n",
+      " # featuresSnare[i, 1:4] = [centroid, bandwidth, skew, kurtosis]"
+     ],
+     "language": "python",
+     "metadata": {},
+     "outputs": [],
+     "prompt_number": 14
+    },
+    {
+     "cell_type": "markdown",
+     "metadata": {},
+     "source": [
+      "4.  First, extract all of the feature data for the kick drums and store it in a feature array.  (For my example, above, I'd put it in \"featuresKick\")\n",
+      "\n",
+      "5.  Next, extract all of the feature data for the snares, storing them in a different array. \n",
+      "Again, the kick and snare features should be separated in two different arrays!\n",
+      " \n",
+      "OK, no more help.  The rest is up to you!"
+     ]
+    }
+   ],
+   "metadata": {}
+  }
+ ]
+}
\ No newline at end of file