Note that there are some explanatory texts on larger screens.

plurals
  1. POBirdsong audio analysis - finding how well two clips match
    text
    copied!<p>I have ~100 wav audio files at sample rate of 48000 of birds of the same species I'd like to measure the similarity between. I'm starting with wave files, but I know (very slightly) more about working with images, so I assume my analysis will be on the spectrogram images. I have several sample of some birds from different days.</p> <p>Here are some example of the data, along with (apologies for unlabeled axes; x is sample, y is linear frequency times something like 10,000 Hz): <img src="https://i.stack.imgur.com/Cf7fa.png" alt="alt text"> These birdsongs apparently occur in "words", distinct segments of song which is probably the level at which I ought to be comparing; both differences between similar words and the frequency and order of various words. <img src="https://i.stack.imgur.com/nPGZa.png" alt="alt text"></p> <p>I want to try to take out cicada noise - cicadas chirp with pretty consistent frequency, and tend to phase-match, so this shouldn't be too hard. <img src="https://i.stack.imgur.com/cKoLn.png" alt="alt text"></p> <p>It seems like some thresholding might be useful.</p> <p>I'm told that most of the existing literature uses manual classification based on song characteristics, like Pandora Music Genome Project. I want to be like <a href="http://the.echonest.com/platform/" rel="nofollow noreferrer">Echo Nest</a>; using automatic classification. Update: A lot of people do study this.</p> <p>My question is what tools should I use for this analysis? I need to:</p> <ul> <li>Filter/threshold out general noise and keep the music</li> <li>Filter out specific noises like of cicadas</li> <li>Split and classify phrases, syllables, and/or notes in birdsongs</li> <li>Create measures of difference/similarity between parts; something which will pick up differences between birds, minimizing differences between different calls of the same bird</li> </ul> <p>My weapon of choice is numpy/scipy, but might something like openCV might be useful here?</p> <p>Edit: updated my terminology and reworded approach after some research and Steve's helpful answer.</p>
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload