Dictionary for OpenEars.

Home Forums OpenEars Dictionary for OpenEars.

Viewing 6 posts - 1 through 6 (of 6 total)

  • Author
    Posts
  • #1017590
    priya.bagrecha13
    Participant

    Hi,

    In the given sample code two OpenEars1.dic is used. I tried to change it to cmu07a.dic. But it is not recognizing words from cmu07a.dic, but it is taking words from OpenEars1.dic. Please let me know how can i make my own dictionary and use it in the sample app.

    #1017591
    Halle Winkler
    Politepix

    Welcome,

    Please use the tutorial to learn how vocabularies work in OpenEars: https://www.politepix.com/openears/tutorial

    It can also be read in the documentation that is shipped with the distribution.

    #1017592
    priya.bagrecha13
    Participant

    Hi Halle,

    Thanks for your quick response. I am quite new for iOS. So please forgive me i am asking any silly questions. I have read the tutorial. According to that i have to change path for the dictionary. I did that. But it is not recognizing the correct word. It is recognizing the same words as there were in openears1.dic

    #1017593
    Halle Winkler
    Politepix

    Hi Priya,

    OK, not a problem. The vocabulary is not defined by use of a dictionary file or by changing the path to a dictionary file. The only information inside a dictionary file is how the words are pronounced, which is only half of the required information that PocketsphinxController needs in order to use a specific vocabulary.

    Instead a vocabulary is created dynamically by giving an NSArray of words or phrase to the generateLanguageModelFromArray:withFilesNamed: method of LanguageModelGenerator. These output a matched pair of a language model and a dictionary, both of which must be used together and in the specific format output by LanguageModelGenerator. You can see exactly how if you go to the tutorial, select “Offline Speech Recognition”, and exactly follow the instructions under “Using LanguageModelGenerator”.

    The second issue is that in offline recognition, your vocabulary will only be accurately perceived if it is between 1-500 words (approximately; actual results will vary and I have seen accurate models around 1000 words and less-accurate models that were very small but consisted of very similar-sounding one-syllable words or words that are very uncommon in English.). So cmu07a.dic is not possible to use with OpenEars because it has about 80,000 words.

    #1017595
    priya.bagrecha13
    Participant

    Hi Halle,

    Thank You very much for your quick response. It is working fine for me.

    #1017596
    Halle Winkler
    Politepix

    Super, glad to hear it.

Viewing 6 posts - 1 through 6 (of 6 total)
  • You must be logged in to reply to this topic.