For example the image on the left is a spectrogram of me whistling up and down slowly. Read the graph starting from the bottom and going up so you see the slow rise and fall in frequencies as I whistle.
If I wanted to detect this sequence I just need to look for a series of strong intensities from the FFT output at the rising and falling frequencies of the whistle.
From analyzing the spectrogram I can see the notes I want to detect are highest in intensity between these frequencies:
- Note 1: 1,723 to 1,758 hz
- Note 2: 1,934 to 1,969 hz
- Note 3: 1,512 to 1,546 hz
- Note 4: 738 to 773 hz
- Note 5: 1,125 to 1,160 hz
Here's a video below of the tone sequence detection with the answer. As you hear the tones played, look at the spectrogram to see how each note relates to output on the graph.
Continue on to look at another application of the Fourier transform, attempting to detect cat purrs.