HomeForumsOpenEarsaudio recognition not speech

This topic has 2 voices, contains 1 reply, and was last updated by  Joseph S. Wisniewski 239 days ago.

Viewing 2 posts - 1 through 2 (of 2 total)
Author Posts
Author Posts
September 22, 2011 at 4:10 pm #7611

uncleunvoid

Trying to find out about audio recognition of sounds, like ball hitting wall, etc. Anyone and directions to point me towards of if OpenEars can be tweaked?

September 22, 2011 at 6:50 pm #7612

Joseph S. Wisniewski

It’s possible, but very difficult. You’d have to tweak the underlying Sphinx recognizer.

Sphinx is normally set up as a “cepstral” recognizer. That’s an approach where it tries to recognize the process that makes a sound, when the process (for example, the passage of sound through the throat, mouth, teeth, and lips) is more important than the sound itself.

You’re after the signature of the sound itself. I remember a paper from Avery Wang on how the “Shazam” music recognizer worked that described this.

To do it with Sphinx, you need to recognize the basic sound, so you need to set up the Sphinx front end to use “energy” features, not “cepstral” and delta cepstral features. Then you need to collect some sounds of a ball hitting a wall and train a Sphinx acoustic model to recognize that.

Good luck, and have fun.

Viewing 2 posts - 1 through 2 (of 2 total)

You must be logged in to reply to this topic.