OpenEars 1.1, the OpenEars plugin system, and RapidEars 1.0, a plugin for OpenEars for live recognition!
Today is a big release day for Politepix: OpenEars 1.1 and the introduction of the OpenEars plugin system! Featuring RapidEars, a plugin for OpenEars that does recognition on live speech.
I am very happy to be introducing multiple Cool Things this Monday. The first is the 1.1 version of OpenEars, which was a bit behind schedule but I hope was worth waiting for, since it introduces…n-best! So you can now see the top “hits” for word matching and the scores for them, rather than only seeing the best hypothesis. You can just turn on n-best and set the number of best hypotheses you’d like in PocketsphinxController, and then add the n-best delegate method of OpenEarsEventsObserver to receive them wherever you’d like. The docs and sample app have been updated to explain how.
The other big new feature of OpenEars 1.1 is that it now has a method that is entirely designed to make it easier for you to test, which is:
– (void) runRecognitionOnWavFileAtPath:(NSString *)wavPath usingLanguageModelAtPath:(NSString *)languageModelPath dictionaryAtPath:(NSString *)dictionaryPath languageModelIsJSGF:(BOOL)languageModelIsJSGF;
This just runs recognition using your normal settings on a single WAV audio file. You can use this to create a testbed in case, for instance, you are trying to see the effect that changes to a language model or acoustic model or other settings have on recognition. There’s more about both of these new features in the documentation and as always, feel free to ask your questions on the forums.
The OpenEars Platform and Plugins
The next big new thing is that OpenEars now has plugins. Over the coming months Politepix will be introducing new drop-in plugin frameworks for your OpenEars app that will open up new functionality. OpenEars will remain free and will also continue to be developed, with new features being added that make use of the potential of its dependencies, such as today’s introduction of n-best. For in-demand functionality that requires major committed development time and brand-new internal and dependency code, those features will be produced as plug-ins sold at the Politepix shop. So OpenEars is now both a free recognition, TTS and language modeling SDK, and also a platform so you can mix and match the exact extended functionality you need without making your app more complex.
Without further ado, it’s my pleasure to introduce Politepix’s first plugin for the OpenEars platform: RapidEars! RapidEars is a plugin that adds live recognition capabilities to PocketsphinxController, so by trading off some accuracy, you can run recognition on speech as it is recorded into the device instead of waiting for a pause and evaluating afterwards. This uses more CPU than OpenEars so it’s advisable to use this for apps targeted at the newest devices. It’s great for games or any app where you want immediate responsiveness. There is nothing like this available using any other speech recognition product because they are all network-based and OpenEars is processed on the device. RapidEars is €999 to license and it has a free demo version for development that can be used for four minutes at a time so you can test it thoroughly before licensing. Installation is as simple as dragging the new plugin into your project and importing its classes in order to use new PocketsphinxController and OpenEarsEventsObserver methods.
You can read more about RapidEars here and download a demo version of the plugin here. Questions for this OpenEars plugin and future ones can also be asked on the OpenEars forums, just mention that you’re asking about RapidEars.
That’s it, have fun with these new products!