Hypothesis space

Home Forums OpenEars Hypothesis space

Tagged: 

Viewing 9 posts - 1 through 9 (of 9 total)

  • Author
    Posts
  • #4079
    akaniklaus
    Participant

    Dear Halle,

    For the current project that I am working now. I need to get the set of all hypothesis for each recognized word, together with their confidence (I think this should be calculated using prior probabilities) values.

    Do you think that this is possible? If it is, can you please help me out? I am also starting to try out but I wanted to ask you in advance as you were very helpful on my previous question.

    I will also write here if I have a problem or successful attempt in order to help other users of OpenEars having similar questions.

    Thank you very much…

    Sincerely,

    #4080
    Halle Winkler
    Politepix

    Hi akaniklaus,

    Can you describe more specifically which aspect of that task you are having difficulty with and in what area of the implementation? If I were doing this I guess I would put the utterances in an NSMutableDictionary as they come in, or possible write them out to a plist or some combination of the above depending on the eventual use of the data. I presume that you know the expected recognition, and the utterance id (a good key for the dictionary entry), the hypothesis, and the confidence score are all delivered by the OpenEarsEventsObserver method.

    #4081
    akaniklaus
    Participant

    Dear Halle,

    Sorry, i just had a baby yesterday; so, I did not have time to check it in detail.

    I do not think that I will have difficulties on how to use the data once I get it.

    pocketsphinxDidReceiveHypothesis callback function of the OpenEarsEventsObserver, seems to deliver only the hypothesis with the highest prior probability (for each utterance)

    However, I want to learn all of hypotheses together with their confidence values to reduce the effect of the word recognition error rate in my research.

    Is there any function of OpenEars that I can do that? Maybe not all of them, but N-best hypothesis for each recognized word would also be appreciated.

    In other words, I need to get the alternatives for each recognized word (similar to a confusion set).

    #4082
    Halle Winkler
    Politepix

    Congratulations!

    OK, this is actually pretty far outside of the OpenEars purview since it isn’t really designed as a research tool that propagates every available Pocketsphinx function through to an Objective-C interface, but it should be no problem getting it answered as a Pocketsphinx question over in the Sphinx forums as long as you are very precise:

    http://sourceforge.net/p/cmusphinx/discussion/help/

    I answer these ones occasionally when it’s fairly self-evident in the Pocketsphinx codebase, but this one is not to me (and if I’m not mistaken, it also looks like it might be different for LMs versus FSGs) so it’s probably a better call to go to the source. Once you know the C implementation details, I can help you with the details of returning the data through OpenEars if needed.

    #4083
    akaniklaus
    Participant

    Thank you very much.

    Then, I guess that I will try it with the python wrapper of Pocketsphinx first, to understand which functions are useful because it is not very clear in its codebase as you also said.

    I understood how to return the data through OpenEars thanks to your very well commented code, I do not think that it will be problem. Just need to solve C implementation details.

    Have a nice day. Sincerely,

    #4084
    edgecase
    Participant

    Hi akaniklaus & Halle, I just wanted to say I’d also like to use the multiple hypotheses returned by sphinx, through OpenEars if possible, so if you do get this figured out please post a follow-up – even if it’s not in the official release I’d be interested to help test :)

    #4085
    Halle Winkler
    Politepix

    OK, I’m still up for taking a look and maybe posting sample code for returning it through OpenEars if akaniklaus has some known-working C code for this later. This is unlikely to become a feature of OpenEars because it’s (IMO) esoteric, but it is probably easy to patch in a new OpenEarsEventsObserver delegate method.

    #8705
    rem1974
    Participant

    Hi all,
    Would be great to know if you managed to get the n-best list and scores, and not only one hypothesys and score.

    Please update if you did.

    #1016414
    Halle Winkler
    Politepix

    Just following up here that OpenEars has had N-Best hypotheses and scoring since version 1.1.

Viewing 9 posts - 1 through 9 (of 9 total)
  • You must be logged in to reply to this topic.