Tagged: Openears Clarifications
| Author | Posts |
|---|---|
| Author | Posts |
| January 10, 2012 at 9:48 pm #8378 | |
|
ramshad |
I would like to make a blog post to thank Halle for the fabulous Openears Library and clarify more about it. Details of Openears 2.I have used the English Voxforge acoustic model with Openears(from https://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/) 3.I am using Dynamic Language model creator class. Clarifications need 2.I have read Mr.darbienapp comment from this blog as “I created an ARPA model with just two choices (YES|NO) and tested it on my iPhone device. My finding is that the accuracy of the recognition when in quiet environment (silent input level around -105dB) is really excellent! I’m getting accuracy of about 99% as the recognition is rarely wrong.” what is the mean of ARPA language model?? is it a common name for .languagemodel and .dic or generated using Sphinx Knowledge Base Tool??? I think JSGF grammar will not support dynamic language model creation. 3.Using an ARPA language model,you suggested as “If you have phrases or sentences that you would like to have recognized in their entirety, you can put them all on a line” is it possible to do this for dynamic language model creation??? 4.There are 2 screens in my application.one of the screen need to detect 10-15 words.Other one need to detect only 1-5 words. So is there any settings to set the listening time dynamically(for screen one:5 seconds,for screen two:20 seconds )?? 5.Is there any default method available in openears to identify the application is resumed? Thanks. How can i identify input level(eg:105dB)?? |
| January 11, 2012 at 4:39 pm #8380 | |
|
Halle |
Hi Ramshad, Welcome. I encourage you to use the current version of OpenEars (.913) and to read all of the documentation, since most of your questions are answered there. Briefly: 1. The default acoustic model should do fine with a California accent. It works best with US accents. I would expect a >15-word model to have high accuracy with the default hmm so I would expect that your issue is due to something else, such as incorrect configuration or something else about your testbed that doesn’t replicate how your clients are using your app. It could also be an issue with the actual model, for instance maybe you are trying to detect isolated syllables, or numbers, and not complete words or phrases. But you should know firsthand how the accuracy is, it shouldn’t be necessary to get the info about that from your clients indirectly. In a pinch, you can get some good speech recordings of some people with the accent you need to check and play it back out of a very good speaker with the phone facing it to see what is happening. |
| January 11, 2012 at 7:22 pm #8381 | |
|
ramshad |
Hi Halle, I have put a sentence or space-delimited series of words into LanguageModelGenerator.Let’s say “IN FRONT OF”. However after the Language model creation, the resultant Openears log shows it as separate 3 words.like IN i didn’t see the word “IN FRONT OF’ in console log!!. Could you please Provide me few sample free/paid Iphone/Ipad application names which used Openears. |
| January 11, 2012 at 7:40 pm #8382 | |
|
Halle |
The log that is printed out at the start of initializing pocketsphinx isn’t a log of the ARPA language model, it’s just a list of words that the app knows. You could only verify that the words have a higher probability by looking at the probabilities in the ARPA model which is created that ends with the suffix .languagemodel. This would require getting acquainted with the meaning of the information in the ARPA model by doing some self-directed research on the format.
I think Google is the best tool for this. |
| January 11, 2012 at 8:51 pm #8383 | |
|
ramshad |
Hi Halle, Thanks for the support. :) |
You must be logged in to reply to this topic.

OpenEars
Our Flying Friends