Poor accuracy for spanish language

This topic has 12 replies, 2 voices, and was last updated 9 years ago by Halle Winkler.

Viewing 13 posts - 1 through 13 (of 13 total)

Advertisement: “Did you know OpenEars™ can use rules-based grammars to recognize fixed phrases? And RuleORama lets you use them with RapidEars!”

Author

Posts
April 6, 2015 at 8:33 pm #1025321

oscarjiv91
Participant

Hello again. I’m trying to recognize when an specific word is said. In this case, a name (Carlos, David, etc). I’m having good results with english, but in spanish I have a lot of false positives. I’m using Rejecto, and tried the vadThrestold between 2.0 and 4.0. Is there a way to improve the accuracy? Thanks

April 6, 2015 at 8:54 pm #1025323

Halle Winkler
Politepix

Hello,

The Spanish vadThreshold has to be even higher to aggressively exclude incidental noise – I think it needs to be something like 4.1-4.4.

April 6, 2015 at 9:02 pm #1025324

oscarjiv91
Participant

Thanks for your quick reply.

The thing is that I need to recognize the words from like 2 meters away, and if I set the vadThreshold with a very high value, it does not recognize the word I want. Adapting the acoustic model would improve the accuracy?

April 6, 2015 at 9:02 pm #1025325

oscarjiv91
Participant

or training the acoustic model?

April 6, 2015 at 9:08 pm #1025326

Halle Winkler
Politepix

There should be a sensitivity setting which is low enough to work from that distance and high enough to exclude incidental noise.

April 6, 2015 at 9:14 pm #1025327

oscarjiv91
Participant

This “sensitivity setting” is the vadThreshold? or another variable used by openears?

April 6, 2015 at 9:18 pm #1025328

Halle Winkler
Politepix

vadThreshold, correct. This issue:

I have a lot of false positives

isn’t an accuracy issue but a sensitivity issue. It doesn’t need to be fixed by improving accuracy but by increasing vadThreshold (and probably using Rejecto if the vocabulary is small) to the point that incidental noise isn’t prompting non-null hypotheses to be returned.

April 6, 2015 at 9:25 pm #1025329

oscarjiv91
Participant

Ok thanks Halle! Will test it more with higher values. I’m using Rejecto btw :)

April 6, 2015 at 9:30 pm #1025331

Halle Winkler
Politepix

Super, I would just operate from the expectation that there is going to be a combination of vadThreshold and Rejecto that will give the right results, and experiment from there. Keep in mind that Rejecto can also have a larger or smaller weighting which will increase and decrease its effects. Rejecto is more concerned with rejecting speech at the same loudness as the desired speech but that is not speech represented in the app vocabulary, and vadThreshold is somewhat more concerned with rejecting incidental sounds and non-user speech, but it’s a known issue that Spanish needs higher baseline vadThreshold settings than English.

April 6, 2015 at 9:38 pm #1025332

oscarjiv91
Participant

Yes, I’m always setting the Rejecto weight to 2.0. Setting the vadThreshold to 3.0 in spanish, I get more false positives than expected when somebody is talking (especially when there are words with similar vowels), even if there is no background noises. I guess this is an accuracy issue, right?

April 6, 2015 at 9:50 pm #1025333

Halle Winkler
Politepix

With the Spanish model, vadThreshold has to be set much higher (as above, something around 4.0 to 4.3). You should turn Rejecto off and find the ideal setting for vadThreshold first, and then use Rejecto to exclude out-of-vocabulary speech that remains. This isn’t an accuracy issue, since there is no way to be accurate with utterances which aren’t in the vocabulary – they are simply unknown. This is a combination of oversensitivity and out-of-vocabulary recognition, for which vadThreshold and Rejecto are respectively used.

April 6, 2015 at 10:18 pm #1025335

oscarjiv91
Participant

I appreciate your help Halle! This was very helpful. Thanks

April 6, 2015 at 10:28 pm #1025336

Halle Winkler
Politepix

You’re welcome!
Author

Posts

Viewing 13 posts - 1 through 13 (of 13 total)

You must be logged in to reply to this topic.