Note that there are some explanatory texts on larger screens.

plurals
  1. PO
    text
    copied!<p>You could try <a href="http://code.google.com/p/pyspeech/" rel="nofollow noreferrer">PySpeech</a>. For some more info see <a href="https://stackoverflow.com/questions/4611373/pyspeech-python-transcribe-mp3-files">pyspeech (python) - Transcribe mp3 files?</a>. I have never used this, but I believe it leverages the built in speech recognition engine of Windows. This will let you convert the Wav files to text and then you can do a text compare. </p> <p>To use the Windows speech engine and use a wav file for input there are two requirements.</p> <ol> <li>Use an inproc recognizer (SpeechRecognitionEngine). Shared recognizers cannot use Wav files as input.</li> <li>On the recognizer object call SetInputToWaveFile to specify your input wav file.</li> </ol> <p>You may have to resample the wav files because the speech recognition engines only support certain sample rates. </p> <ul> <li>8 bits per sample</li> <li>single channel mono</li> <li>22,050 samples per second</li> <li>PCM encoding</li> </ul> <p>works well on Windows. See <a href="https://stackoverflow.com/a/6203533/90236">https://stackoverflow.com/a/6203533/90236</a> for some more info.</p> <p>For some more background on the windows speech engines, you might take a look at <a href="https://stackoverflow.com/questions/4213035/sapi-and-windows-7-problem/4217638#4217638">SAPI and Windows 7 Problem</a> and <a href="https://stackoverflow.com/questions/2977338/what-is-the-difference-between-system-speech-recognition-and-microsoft-speech-re">What is the difference between System.Speech.Recognition and Microsoft.Speech.Recognition?</a></p>
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload