Detecting Speech transients

Tagged: transient detection

This topic has 2 replies, 2 voices, and was last updated 10 years, 11 months ago by Sjakelien.

Viewing 3 posts - 1 through 3 (of 3 total)

Advertisement: “Don't want OpenEars™ to guess one of your vocabulary words when it hears an unknown word? Rejecto can help!”

Author

Posts
May 8, 2013 at 3:12 pm #1017174

Sjakelien
Participant

Dear people,
I’m an iOS developer with little knowledge of audio signal processing. I’m looking for a method that will take a piece of speech audio recorded by my app, and returns the position of transients in that audio.
I hope/expect that these positions will indicate the start of a new word.
I know exactly what words will be spoken, I just need to split up the file into individual sound files.
I’ve done quite a lot of internet searching, but I can only find extremely mathematical documents that are really way beyond my comprehension.
I would figure though, that word/transient detection is one of the core tasks of OpenEar, and I was wondering if such functionality would be available for such usage in your library.
If not, I would be very grateful, if you could point me in a direction that, given my profile, could help me out.
Sincerely yours
Sjakelien

May 8, 2013 at 3:19 pm #1017175

Halle Winkler
Politepix

Welcome,

Sorry, I don’t have any info for you. Have you asked at the CMU Sphinx forum about whether any variations of the Sphinx project have transient detection abilities as part of their API? If Pocketsphinx does, it is probably possible to modify OpenEars to support this (but you would have to undertake that on your own).

May 8, 2013 at 3:29 pm #1017176

Sjakelien
Participant

No, thanks, I will give it a try.
Author

Posts

Viewing 3 posts - 1 through 3 (of 3 total)

You must be logged in to reply to this topic.