Monday, July 18, 2011

Sounds like...

I went to Gamefest 2011, and although I'm no longer really an audio programmer, I attended two of the audio presentations on the second day. The first one was on voice recognition on the 360, and even though it was actually a kind of middleware tech speak advertising moment, it was quite interesting.

For me the highlights were the different things I hadn't really come across before:
Use of Mel-frequency cepstrum coefficients for phoneme detection
Use of the fisher kernel or polynomial kernel to make the support vector machine learning work.
use of the Viterbi algorithm to provide a good result from the Hidden Markov Model of the phonemes.

I've only previous used Markov Chains for compression and mistakenly thought hidden Markov Model was the same thing. Back to my university books on ANNs I guess.
