Dear people,
I’m an iOS developer with little knowledge of audio signal processing. I’m looking for a method that will take a piece of speech audio recorded by my app, and returns the position of transients in that audio.
I hope/expect that these positions will indicate the start of a new word.
I know exactly what words will be spoken, I just need to split up the file into individual sound files.
I’ve done quite a lot of internet searching, but I can only find extremely mathematical documents that are really way beyond my comprehension.
I would figure though, that word/transient detection is one of the core tasks of OpenEar, and I was wondering if such functionality would be available for such usage in your library.
If not, I would be very grateful, if you could point me in a direction that, given my profile, could help me out.
Sincerely yours
Sjakelien