Detect speech, record it, then chop it up by word.

This topic has 1 reply, 2 voices, and was last updated 9 years, 10 months ago by Halle Winkler.

Viewing 2 posts - 1 through 2 (of 2 total)

Advertisement: “RapidEars is an OpenEars™ plugin that lets you perform speech recognition while the user is still speaking!”

Author

Posts
June 4, 2014 at 7:12 pm #1021512

geareddev
Participant

Hi,

I would like to detect speech(OpenEars), record it (SaveThatWave), and then chop the recorded file up by word. I can manage the actual chopping up of sounds, but I’d need the timestamps for each detected word so that I knew where to cut. Do any of the politepix products provide that information?

Thank You

June 4, 2014 at 7:28 pm #1021515
Halle Winkler
Politepix
Hi,

RapidEars has an API for receiving start times and end times from individual words:
```
- (void) rapidEarsDidDetectLiveSpeechAsWordArray:(NSArray *) words
scoreArray:		(NSArray *) 	scores
startTimeArray:		(NSArray *) 	startTimes
endTimeArray:		(NSArray *) 	endTimes 

- (void) rapidEarsDidDetectFinishedSpeechAsWordArray:(NSArray *) 	words
scoreArray:		(NSArray *) 	scores
startTimeArray:		(NSArray *) 	startTimes
endTimeArray:		(NSArray *) 	endTimes 
```
You have to also set these setters to TRUE where you set up PocketsphinxController:
```
- (void) setReturnSegments:(BOOL)returnSegments; 
- (void) setReturnSegmentTimes:(BOOL)returnSegmentTimes; 
```
I’ve never attempted to use these times to do a spot-edit of a SaveThatWave file, and my assumption is that they will be accurate but have some kind of offset since SaveThatWave almost certainly doesn’t use the same instant to start its timing as RapidEars correlates its timing to.
Author

Posts

Viewing 2 posts - 1 through 2 (of 2 total)

You must be logged in to reply to this topic.