Detect speech, record it, then chop it up by word.

Home Forums OpenEars Detect speech, record it, then chop it up by word.

Viewing 2 posts - 1 through 2 (of 2 total)

  • Author
    Posts
  • #1021512
    geareddev
    Participant

    Hi,

    I would like to detect speech(OpenEars), record it (SaveThatWave), and then chop the recorded file up by word. I can manage the actual chopping up of sounds, but I’d need the timestamps for each detected word so that I knew where to cut. Do any of the politepix products provide that information?

    Thank You

    #1021515
    Halle Winkler
    Politepix

    Hi,

    RapidEars has an API for receiving start times and end times from individual words:

    - (void) rapidEarsDidDetectLiveSpeechAsWordArray:(NSArray *) words
    scoreArray:		(NSArray *) 	scores
    startTimeArray:		(NSArray *) 	startTimes
    endTimeArray:		(NSArray *) 	endTimes 
    
    - (void) rapidEarsDidDetectFinishedSpeechAsWordArray:(NSArray *) 	words
    scoreArray:		(NSArray *) 	scores
    startTimeArray:		(NSArray *) 	startTimes
    endTimeArray:		(NSArray *) 	endTimes 
    

    You have to also set these setters to TRUE where you set up PocketsphinxController:

    - (void) setReturnSegments:(BOOL)returnSegments; 
    - (void) setReturnSegmentTimes:(BOOL)returnSegmentTimes; 
    

    I’ve never attempted to use these times to do a spot-edit of a SaveThatWave file, and my assumption is that they will be accurate but have some kind of offset since SaveThatWave almost certainly doesn’t use the same instant to start its timing as RapidEars correlates its timing to.

Viewing 2 posts - 1 through 2 (of 2 total)
  • You must be logged in to reply to this topic.