Please check this answer, which describes a few approaches to the same problem. Given that bird song is a monophonic signal (only one fundamental frequency at any point in time - as opposed to polyphonic) - and given that the timbre is irrelevant, the most interesting feature to extract for this classification task is a pitch contour.
2 of 2
replaced http://dsp.stackexchange.com/ with https://dsp.stackexchange.com/
pichenettes
- 19.5k
- 1
- 51
- 69