Determining Word Similarity

This topic has 9 replies, 2 voices, and was last updated 8 years ago by Halle Winkler.

Viewing 10 posts - 1 through 10 (of 10 total)

Advertisement: “NeatSpeech is great-sounding offline speech synthesis, compatible with iOS6.1, and you can even edit pronunciations!”

Author

Posts
April 15, 2016 at 1:26 am #1030061

stressdesign
Participant

Hi,
I understand that while the application is able to determine the best sound match(s) to words in the grammar it would be helpful to understand how closely words in our grammar are inherently. E.g. “Jim” sounds somewhat similar to “Tim”, whereas “Jim” sounds nothing like “Mike”.

Clearly, the application interprets spoken phonemes and creates a scored match to words in the grammar. Is this data easy accessible in the files generated?

If we can easily determine likeness up front, it can enable us and other developers to programmatically find non-unique words and force disambiguation.

April 15, 2016 at 8:38 am #1030062

Halle Winkler
Politepix

Welcome,

Are you asking for scoring?

April 15, 2016 at 8:42 am #1030063

stressdesign
Participant

Yes.. specifically a quantified value of phonetic closeness in words.

I know the app scores inputed audio to words in the grammar, but a score or value of matches between words in the grammar would help to establish a more unique grammar.

April 15, 2016 at 8:51 am #1030064

Halle Winkler
Politepix

Scoring is returned with most APIs, although its applicability is inherently limited. You can search these forums for more info about that.

a quantified value of phonetic closeness in words

This is a score.

To clarify, when you refer to the app or application, are you referring to someone’s particular app implementation of the OpenEars framework?

April 15, 2016 at 8:53 am #1030065

stressdesign
Participant

To be clear, i’m not asking about the score between a spoken word and the matching words in the grammar. I’m able to see that already. What I’m asking about the relative phonetic closeness of words in the grammar.

April 15, 2016 at 8:55 am #1030066

Halle Winkler
Politepix

Their closeness to each other?

April 15, 2016 at 8:58 am #1030067

stressdesign
Participant

exactly.. asking the app to discern between tim and trim is much harder than between tim and time. tim and trim are phonetically similar. It would be very helpful to have that quantified easily to assist (possibly by warnings) of grammars that need altering.

April 15, 2016 at 9:09 am #1030068

Halle Winkler
Politepix

Got it. This is getting a little bit beyond the scope of an ASR tool and more into the region of text analysis. As you dive into a goal like this, many questions start to come up about what is “good handling” for advanced cases and then it gets multiplied by the different requirements for language models versus grammars versus RuleORama grammars, and then things that may or may not be equally likely across languages. It is also something that can usually be judged by visual observation of your own grammar.

In an individual app (versus a framework like OpenEars) you can probably restrict the range of what is likely much more, making this simpler to implement for your own specific case. You can check out the file LanguageModelGeneratorLookupList.text in the acoustic model for the language you’re using and (for instance) load it into a data structure like an NSDictionary in order to be able to access it to do evaluation of closeness according to the needs of your application, if there is no opportunity to just look at the grammar at the time of creation and consider whether it contains similar words.

April 15, 2016 at 9:15 am #1030069

stressdesign
Participant

ok, i’ll give that a shot. thank you!

April 15, 2016 at 9:17 am #1030070

Halle Winkler
Politepix

You’re welcome!
Author

Posts

Viewing 10 posts - 1 through 10 (of 10 total)

You must be logged in to reply to this topic.