Note that there are some explanatory texts on larger screens.

plurals
  1. PO
    text
    copied!<p>Auditory processing is a very complex task. Human evolution has produced a system so good that we don't realize how good it is. If three persons are talking to you at the same time you will be able to focus in one signal and discard the others, even if they are louder. Noise is very well discarded too. In fact, if you hear human voice played backwards, the first stages of the auditory system will send this signal to a different processing area than if it is real speech signal, because the system will regard it as "no-voice". This is an example of the outstanding abilities humans have.</p> <p>Speech recognition advanced quickly from the 70s because researchers were studying the production of voice. This is a simpler system: vocal chords excited or not, resonation of vocal tractus... it is a mechanical system easy to understand. The main product of this approach is the <a href="http://documents.wolfram.com/applications/signals/CepstralAnalysis.html" rel="noreferrer">cepstral analysis</a>. This led automatic speech recognition (ASR) to achieve acceptable results. But this is a sub-optimal approach. Noise separation is quite bad, even when it works more or less in clean environments, it is not going to work with loud music in the background, not as humans will.</p> <p>The optimal approach depends on the understanding of the auditory system. Its first stages in the cochlea, the inferior colliculus... but also the brain is involved. And we don't know so much about this. It is being a difficult change of paradigm. </p> <p>Professor Hynek Hermansky compared in <a href="http://www.asp.ogi.edu/publications/pdf/hermansky_france97_1.pdf" rel="noreferrer">a paper</a> the current state of the research with when humans wanted to fly. We didn't know what was the secret &mdash;The feathers? wings flapping?&mdash; until we discovered Bernoulli's force.</p>
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload