PocketSphinx VAD & Noise Robust Processing

Home Forums OpenEars PocketSphinx VAD & Noise Robust Processing

Viewing 7 posts - 1 through 7 (of 7 total)

  • Author
    Posts
  • #1020481
    andrew
    Participant

    Hi Halle,

    Does the latest version of OpenEars (as well as the plugins) already include the latest VAD and noise processing improvements that were added to PocketSphinx?

    If not, do you have a plan for when those will be added?
    If so, do I need to do anything in order to enable them?

    The reason I ask, is I’m using OpenEars in my app, but it doesn’t work well at all if there is background noise. In a quiet room, the recognition accuracy is very good though.

    Thanks!
    Andrew

    #1020482
    Halle Winkler
    Politepix

    The VAD for Pocketsphinx .8 and greater is the same one that has been in Sphinxbase since 2010 – is it possible you’re thinking of a planned feature rather than a shipped one? It is expected that the VAD estimation and recognition won’t work as well in a noisy environment.

    #1020499
    andrew
    Participant

    Nickolay said noise-robust processing and VAD have been added in subversion, but maybe they aren’t part of an official release yet.

    For my use of openears, it doesn’t work well in a noisy environment, but it works very well in a quite room. Improved noise-robust processing and VAD are critical to make my project useful outside of a lab.

    Thanks!

    #1020504
    Halle Winkler
    Politepix

    It hasn’t been added in the bleeding-edge Sphinx svn repo under cont_ad.h/cont_ad_base.c – if it is part of a brand new area of source in svn let me know the names of the source files and I’ll take a look. Good noise robustness is important for everyone’s project, but performance in very noisy environments is always going to be less accurate and with mistakes in speech/silence estimation. To set expectations, if there is new VAD implementation and it is mature enough to go into production code, it might be possible to look at and maybe add starting in April or May – this isn’t an area of the project where changes can be made quickly or with any kind of pressure because it heavily affects every app that uses the project.

    #1020605
    andrew
    Participant

    Hi Halle, the new VAD code was merged into trunk just over a week ago (r12368-r12371). Is it difficult to merge the new VAD code in OpenEars?

    #1020610
    Halle Winkler
    Politepix

    Sure, check out my previous response for info about timeframe for being able to check it out.

    #1020663
    Halle Winkler
    Politepix

    OK, I’ve had a chance to view the code and it is still a moving target, with fixes for crashes in related feature code being checked in. This is something that will have to come in once a stable version is released, probably as .9 of Sphinxbase/Pocketsphinx.

Viewing 7 posts - 1 through 7 (of 7 total)
  • You must be logged in to reply to this topic.