Rejecto not rejecting?

This topic has 4 replies, 2 voices, and was last updated 8 years, 11 months ago by Halle Winkler.

Viewing 5 posts - 1 through 5 (of 5 total)

Advertisement: “Rejecto is a plugin for OpenEars™ and RapidEars that lets you ignore speech that isn't in your vocabulary!”

Author

Posts
April 29, 2015 at 8:34 am #1025546

mjlee
Participant

Hello – I am using Rejecto with OpenEars, and will be working with speakers of conversational English.. average conversation with many words. Does this work with a vocabulary of around 1000 words? Nearly every time I say a word not in the vocabulary, I get a hypothesis containing words in the vocabulary. In other words, non-vocabulary words are not being rejected at all. For example, if I say just “paraphrase” (not in the vocabulary), I get “CARE” and “AFRAID”.

Thank you in advance for your help!

Here are my settings:
Not using RapidEars
vadThreshold = default
durationOfSilenceBeforeAnalyze = default
rejectoWeight = 2.0;
rejectoUsingVowelsOnly = NO

The vocabulary: (approx 1000 words)

NSArray *firstLanguageArray =
@[@”ACCEPT”,@”FREED”,@”PARTY”,@”ABANDON”,@”ENRAGE”,@”MADDENING”,@”SNOB”,@”ACCEPT”,@”FREEING”,@”PARTY”,@”ABUSE”,@”ENVY”,@”MADDER”,@”SOB”,@”ACCEPTED”,@”FREELY”,@”PASSION”,@”ABUSE”,@”ENVIOUS”,@”MAD”,@”SOBBED”,@”ACCEPTING”,@”PEACE”,@”ACHE”,@”ENVY”,@”MANIAC”,@”SOBBING”,@”ACCEPTS”,@”FREER”,@”PERFECT”,@”ACHING”,@”EVIL”,@”MASOCHIST”,@”SOBS”,@”ACTIVE”,@”FREE”,@”PLAY”,@”ADVERSE”,@”ADVERSARY”,@”EXCRUCIATING”,@”MELANCHOLY”,@”SOLEMN”,@”ADMIRE”,@”ADMIRATION”,@”FRIEND”,@”PLAYED”,@”AFRAID”,@”EXHAUST”,@”MESS”,@”SORROW”,@”ADORE”,@”ADORATION”,@”FUN”,@”PLAYFUL”,@”AGGRAVATE”,@”FAIL”,@”MESSY”,@”SORRY”,@”ADVANTAGE”,@”FUN”,@”PLAYING”,@”AGGRESSIVE”,@”FAKE”,@”MISER”,@”SPITE”,@”ADVENTURE”,@”GENEROUS”,@”PLAYS”,@”AGITATE”,@”FATAL”,@”MISS”,@”STAMMER”,@”AFFECTION”,@”GENTLE”,@”PLEASANT”,@”AGONIZE”,@”FATIGUE”,@”MISSED”,@”STANK”,@”AGREE”,@”GENTLER”,@”PLEASE”,@”AGONY”,@”FAULT”,@”MISSES”,@”STARTLE”,@”STARTLING”,@”AGREEABLE”,@”GENTLEST”,@”PLEASING”,@”ALARM”,@”FEAR”,@”MISSING”,@”STEAL”,@”AGREED”,@”GENTLY”,@”PLEASURE”,@”ALONE”,@”FEARED”,@”MISTAKE”,@”STENCH”,@”AGREEING”,@”GIGGLE”,@”GIGGLING”,@”POPULAR”,@”ANGER”,@”FEARFUL”,@”MOCK”,@”STINK”,@”AGREEMENT”,@”GIVER”,@”POSITIVE”,@”ANGRY”,@”FEARING”,@”MOCKED”,@”STRAIN”,@”AGREES”,@”GIVING”,@”PRAISE”,@”ANGUISH”,@”FEARS”,@”MOCKERY”,@”STRANGE”,@”ALRIGHT”,@”GLAD”,@”PRECIOUS”,@”ANNOY”,@”FEROCIOUS”,@”FEROCITY”,@”MOCKING”,@”STRESS”,@”AMAZE”,@”GLADLY”,@”PRETTY”,@”ANTAGONIZE”,@”ANTAGONISTIC”,@”FEUD”,@”MOCKS”,@”STRUGGLE”,@”AMOROUS”,@”GLAMOR”,@”PRETTY”,@”ANXIETY”,@”ANXIOUS”,@”FIERY”,@”MOLEST”,@”STUBBORN”,@”AMUSE”,@”GLAMOUR”,@”PRIDE”,@”APATHY”,@”APATHETIC”,@”FIGHT”,@”STUNK”,@”GLORY”,@”PRIVILEGE”,@”APPALL”,@”FIRED”,@”MOODY”,@”STUNNED”,@”APPRECIATE”,@”APPRECIATIVE”,@”PRIZE”,@”APPREHENSIVE”,@”FLUNK”,@”MOODY”,@”STUNS”,@”ASSURE”,@”GOOD”,@”PROFIT”,@”FOE”,@”MORON”,@”STUPID”,@”ATTACHMENT”,@”GOODNESS”,@”PROMISE”,@”ARGUE”,@”FOOL”,@”MOURN”,@”STUTTER”,@”ATTRACT”,@”ATTRACTION”,@”GORGEOUS”,@”PROUD”,@”ARROGANT”,@”ARROGANCE”,@”FORBID”,@”MURDER”,@”SUBMISSIVE”,@”AWARD”,@”GRACE”,@”RADIANT”,@”RADIANCE”,@”ASHAME”,@”FOUGHT”,@”NAG”,@”SUCK”,@”AWESOME”,@”GRACED”,@”READINESS”,@”ASSAULT”,@”FRANTIC”,@”NASTY”,@”SUCKED”,@”BEAUTY”,@”BEAUTIFUL”,@”GRACEFUL”,@”READY”,@”ASSHOLE”,@”FREAK”,@”NEEDY”,@”SUCKER”,@”BELOVED”,@”GRACES”,@”REASSURE”,@”ATTACK”,@”FRIGHT”,@”NEGLECT”,@”SUCKS”,@”BENEFICIAL”,@”GRACIOUS”,@”RELAX”,@”ADVERSE”,@”FRUSTRATE”,@”NERD”,@”BENEFIT”,@”GRAND”,@”RELIEF”,@”AVOID”,@”FUCK”,@”NERVOUS”,@”SUFFER”,@”BENEFITS”,@”GRANDE”,@”RELIEVE”,@”AWFUL”,@”FUCKED”,@”NEUROTIC”,@”SUFFERED”,@”BENEFIT”,@”GRATEFUL”,@”RESOLVE”,@”AWKWARD”,@”FUCKER”,@”NUMB”,@”SUFFERER”,@”BENEVOLENT”,@”BENEVOLENCE”,@”GRATITUDE”,@”RESPECT”,@”BAD”,@”FUCKING”,@”OBNOXIOUS”,@”SUFFERING”,@”BENIGN”,@”GREAT”,@”BASHFUL”,@”FUCKS”,@”OBSESS”,@”OBSESSION”,@”SUFFERS”,@”BEST”,@”GRIN”,@”REWARD”,@”BASTARD”,@”FUME”,@”OFFENSE”,@”SUSPICION”,@”SUSPICIOUS”,@”BETTER”,@”GRIN”,@”GRINNING”,@”RICH”,@”BATTLE”,@”FUMING”,@”OFFEND”,@”TANTRUM”,@”BLESS”,@”GRINS”,@”BEATEN”,@”FURIOUS”,@”OFFENSE”,@”TEARS”,@”BOLD”,@”HA”,@”ROMANCE”,@”BITCH”,@”FURY”,@”OUTRAGE”,@”TEASE”,@”BONUS”,@”ROMANTIC”,@”BITTER”,@”GEEK”,@”OVERWHELM”,@”TEMPER”,@”BRAVE”,@”HANDSOME”,@”SAFE”,@”BLAME”,@”GLOOM”,@”PAIN”,@”TEMPERS”,@”BRIGHT”,@”HAPPY”,@”SATISFY”,@”BORE”,@”GODDAM”,@”PAINED”,@”TENSE”,@”BRILLIANT”,@”BRILLIANCE”,@”HAPPY”,@”SAVE”,@”BORING”,@”GOSSIP”,@”PAINFUL”,@”TENSE”,@”CALM”,@”HARMLESS”,@”BOTHER”,@”GRAVE”,@”PAIN”,@”TENSION”,@”CARE”,@”HARMONY”,@”HARMONIOUS”,@”SECURE”,@”BROKE”,@”GREED”,@”PAINS”,@”TERRIBLE”,@”TERRIBLY”,@”CARED”,@”HEARTFELT”,@”SENTIMENTAL”,@”SENTIMENTALITY”,@”BRUTAL”,@”BRUTALITY”,@”GRIEF”,@”PANIC”,@”TERRIFIED”,@”CAREFREE”,@”HEARTWARMING”,@”SHARE”,@”BURDEN”,@”GRIEVE”,@”PARANOID”,@”PARANOIA”,@”TERRIFIES”,@”CAREFUL”,@”HEAVEN”,@”SHARED”,@”CARELESS”,@”GRIM”,@”PATHETIC”,@”TERRIFY”,@”CARES”,@”HEH”,@”SHARES”,@”CHEAT”,@”GROSS”,@”PECULIAR”,@”PECULIARITY”,@”TERRIFYING”,@”CARING”,@”HELPER”,@”SHARING”,@”COMPLAIN”,@”GROUCH”,@”PERVERT”,@”PERVERSE”,@”TERROR”,@”CASUAL”,@”HELPFUL”,@”SILLY”,@”CONFRONT”,@”PESSIMISM”,@”PESSIMIST”,@”THIEF”,@”CASUALLY”,@”HELPING”,@”SILLY”,@”CONFUSE”,@”CONFUSION”,@”GUILT”,@”PETRIFY”,@”CERTAIN”,@”HELPS”,@”SINCERE”,@”SINCERITY”,@”CONTEMPT”,@”CONTEMPTUOUS”,@”HARASS”,@”PETTY”,@”THREAT”,@”CHALLENGE”,@”HERO”,@”SMART”,@”CONTRADICT”,@”HARM”,@”PETTY”,@”TICKED”,@”CHAMP”,@”HILARIOUS”,@”SMILE”,@”CRAP”,@”HARMED”,@”PHOBIA”,@”TIMID”,@”CHARITY”,@”HOHO”,@”SOCIABLE”,@”CRAPPY”,@”HARMFUL”,@”PISS”,@”TORTURE”,@”CHARM”,@”HONEST”,@”CRAZY”,@”HARMING”,@”PITIFUL”,@”TOUGH”,@”CHEER”,@”HONOR”,@”SPECIAL”,@”CRIED”,@”HARMS”,@”PITY”,@”TRAGEDY”,@”CHERISH”,@”HONOR”,@”SPLENDID”,@”CRIES”,@”HATE”,@”POISON”,@”TRAGIC”,@”CHUCKLE”,@”HOPE”,@”STRENGTH”,@”CRITICAL”,@”HATED”,@”PREJUDICE”,@”PREJUDICIAL”,@”TRAUMA”,@”TRAUMATIC”,@”CLEVER”,@”HOPED”,@”STRONG”,@”CRITICIZE”,@”HATEFUL”,@”PRESSURE”,@”TREMBLE”,@”TREMBLING”,@”COMEDY”,@”COMEDIAN”,@”HOPEFUL”,@”SUCCEED”,@”CRUDE”,@”HATER”,@”PRICK”,@”TRICK”,@”COMFORT”,@”HOPEFULLY”,@”SUCCESS”,@”CRUEL”,@”HATES”,@”PROBLEM”,@”TRITE”,@”COMMITMENT”,@”HOPEFULNESS”,@”SUN”,@”CRUSHED”,@”HATING”,@”PROTEST”,@”TRIVIA”,@”TRIVIAL”,@”COMPASSION”,@”HOPES”,@”SUN”,@”CRY”,@”HATRED”,@”PROTESTED”,@”TROUBLE”,@”COMPLIMENT”,@”HOPING”,@”SUN”,@”CRYING”,@”HEARTBREAK”,@”PROTESTING”,@”TURMOIL”,@”CONFIDENCE”,@”HUG”,@”SUNSHINE”,@”CUNT”,@”PUKE”,@”UGH”,@”CONFIDENT”,@”HUG”,@”SUPER”,@”CUT”,@”HEARTLESS”,@”PUNISH”,@”UGLY”,@”CONFIDENTLY”,@”HUGS”,@”SUPERIOR”,@”SUPERIORITY”,@”CYNIC”,@”CYNICISM”,@”HELL”,@”RAGE”,@”UNATTRACTIVE”,@”CONSIDERATE”,@”HUMOR”,@”SUPPORT”,@”DAMAGE”,@”HELLISH”,@”RAGING”,@”UNCERTAIN”,@”CONTENTED”,@”HUMOR”,@”SUPPORTED”,@”DAMN”,@”HELPLESS”,@”RANCID”,@”UNCOMFORTABLE”,@”CONTENTMENT”,@”HURRAH”,@”HURRAY”,@”SUPPORTER”,@”DANGER”,@”HESITATE”,@”HESITATION”,@”RAPE”,@”CONVINCE”,@”IDEAL”,@”SUPPORTING”,@”DAZE”,@”HOMESICK”,@”RAPING”,@”UNEASE”,@”COOL”,@”IMPORTANT”,@”IMPORTANCE”,@”SUPPORTIVE”,@”DECAY”,@”HOPELESS”,@”RAPIST”,@”UNFORTUNATE”,@”COURAGE”,@”COURAGEOUS”,@”IMPRESS”,@”IMPRESSION”,@”SUPPORTS”,@”DEFEAT”,@”HORRIBLE”,@”HORRID”,@”REBEL”,@”REBELLIOUS”,@”UNFRIENDLY”,@”CREATE”,@”IMPROVE”,@”SUPREME”,@”SUPREMACY”,@”DEFECT”,@”DEFECTION”,@”HOSTILE”,@”HOSTILITY”,@”REEK”,@”UNGRATEFUL”,@”CREATIVE”,@”CREATING”,@”IMPROVING”,@”SURE”,@”DEFENCE”,@”HUMILIATE”,@”HUMILIATION”,@”REGRET”,@”UNHAPPY”,@”CREDIT”,@”INCENTIVE”,@”SURPRISE”,@”DEFENSE”,@”HURT”,@”REJECT”,@”REJECTION”,@”UNIMPORTANT”,@”CUTE”,@”INNOCENT”,@”INNOCENCE”,@”SWEET”,@”DEGRADE”,@”IDIOT”,@”RELUCTANT”,@”RELUCTANCE”,@”INSPIRE”,@”SWEETHEART”,@”DEPRESS”,@”DEPRESSION”,@”IGNORE”,@”IGNORANT”,@”REMORSE”,@”UNKIND”,@”DARING”,@”INTELLIGENT”,@”INTELLIGENCE”,@”SWEETIE”,@”DEPRIVED”,@”DEPRIVATION”,@”IMMORAL”,@”IMMORALITY”,@”REPRESS”,@”REPRESSION”,@”DARLIN”,@”DARLING”,@”INTEREST”,@”SWEETLY”,@”DESPAIR”,@”IMPATIENT”,@”IMPATIENCE”,@”RESENT”,@”UNPLEASANT”,@”DEAR”,@”INVIGORATE”,@”SWEETNESS”,@”DESPERATE”,@”DESPERATION”,@”IMPERSONAL”,@”RESIGN”,@”RESIGNATION”,@”UNPROTECTED”,@”DEFINITE”,@”JOKE”,@”SWEETS”,@”DESPISE”,@”IMPOLITE”,@”RESTLESS”,@”UNSAVORY”,@”DEFINITELY”,@”JOKING”,@”TALENT”,@”DESTROY”,@”INADEQUATE”,@”INADEQUACY”,@”REVENGE”,@”UNSUCCESSFUL”,@”DELECTABLE”,@”JOLLY”,@”DESTRUCT”,@”INDECISIVE”,@”RIDICULOUS”,@”RIDICULE”,@”UNSURE”,@”DELICATE”,@”JOY”,@”TENDER”,@”DEVASTATE”,@”INEFFECTUAL”,@”RIGID”,@”UNWELCOME”,@”DELICIOUS”,@”KEEN”,@”TERRIFIC”,@”DEVIL”,@”INFERIOR”,@”INFERIORITY”,@”RISK”,@”UPSET”,@”DELIGHT”,@”KIDDING”,@”THANK”,@”DIFFICULT”,@”INHIBIT”,@”INHIBITION”,@”ROTTEN”,@”UPTIGHT”,@”DETERMINATION”,@”KIND”,@”THANKED”,@”DISADVANTAGE”,@”INSECURE”,@”RUDE”,@”USELESS”,@”DETERMINED”,@”KINDLY”,@”THANKFUL”,@”DISAGREE”,@”INSINCERE”,@”RUIN”,@”VAIN”,@”DEVOTE”,@”DEVOTION”,@”KINDNESS”,@”THANKS”,@”DISAPPOINT”,@”INSULT”,@”SAD”,@”VANITY”,@”DIGNITY”,@”KISS”,@”THOUGHTFUL”,@”DISASTER”,@”INTERRUPT”,@”INTERRUPTION”,@”SADDEN”,@”VICIOUS”,@”DIVINE”,@”THRILL”,@”DISCOMFORT”,@”INTIMIDATE”,@”SADLY”,@”VICTIM”,@”DYNAMIC”,@”DYNAMO”,@”DYNAMISM”,@”LAUGH”,@”TOLERANT”,@”TOLERANCE”,@”DISCOURAGE”,@”IRRATIONAL”,@”SADNESS”,@”VILE”,@”EAGER”,@”LIBERTY”,@”LIBERTINE”,@”TRANQUIL”,@”DISGUST”,@”IRRITATE”,@”IRRITABLE”,@”IRRITABILITY”,@”SARCASM”,@”SARCASTIC”,@”VILLAIN”,@”EASE”,@”LIKE”,@”TREASURE”,@”ISOLATE”,@”ISOLATION”,@”SAVAGE”,@”VIOLATE”,@”VIOLATION”,@”EASIEST”,@”EASIER”,@”LIKEABLE”,@”TREAT”,@”DISILLUSION”,@”JADED”,@”SCARE”,@”VIOLENT”,@”EASILY”,@”LIKED”,@”TRIUMPH”,@”DISLIKE”,@”JEALOUS”,@”SCARING”,@”VULNERABLE”,@”VULNERABILITY”,@”LIKES”,@”TRUE”,@”DISLIKED”,@”JERK”,@”SCARY”,@”VULTURE”,@”EASING”,@”LIKING”,@”DISLIKES”,@”JERKED”,@”SKEPTIC”,@”WAR”,@”EASY”,@”LIVELY”,@”TRUER”,@”DISLIKING”,@”JERKS”,@”SCREAM”,@”WARFARE”,@”ECSTASY”,@”ECSTATIC”,@”TRUEST”,@”DISMAY”,@”KILL”,@”SCREW”,@”WAR”,@”EFFICIENT”,@”EFFICIENCY”,@”TRULY”,@”DISSATISFY”,@”DISSATISFACTION”,@”LAME”,@”SELFISH”,@”WARRING”,@”ELEGANT”,@”ELEGANCE”,@”LOVE”,@”TRUST”,@”DISTRACT”,@”LAZY”,@”SERIOUS”,@”WARS”,@”ENCOURAGE”,@”LOVED”,@”TRUTH”,@”DISTRAUGHT”,@”LAZY”,@”SERIOUSLY”,@”WEAK”,@”ENERGY”,@”ENERGIZE”,@”LOVELY”,@”USEFUL”,@”DISTRESS”,@”LIABILITY”,@”SERIOUSNESS”,@”WEAPON”,@”ENGAGE”,@”LOVER”,@”VALUABLE”,@”DISTRUST”,@”LIAR”,@”SEVERE”,@”WEEP”,@”ENJOY”,@”LOVES”,@”VALUE”,@”DISTURB”,@”LIED”,@”SHAKE”,@”WEIRD”,@”ENTERTAIN”,@”LOVING”,@”VALUED”,@”DOMINATE”,@”DOMINATION”,@”LIES”,@”SHAKY”,@”WEPT”,@”ENTHUSIASM”,@”ENTHUSIASTIC”,@”ENTHUSIAST”,@”LOYAL”,@”VALUES”,@”DOOM”,@”LONE”,@”SHAKY”,@”WHINE”,@”EXCEL”,@”EXCELLENCE”,@”LUCK”,@”VALUING”,@”LONGING”,@”SHAME”,@”WHINING”,@”EXCITE”,@”LUCKED”,@”VIGOR”,@”DOUBT”,@”LOSE”,@”SHIT”,@”WHORE”,@”FAB”,@”LUCKY”,@”VIGOR”,@”DREAD”,@”LOSER”,@”SHOCK”,@”WICKED”,@”FABULOUS”,@”LUCKS”,@”VIRTUE”,@”DULL”,@”LOSES”,@”SHOOK”,@”WIMP”,@”FAITH”,@”LUCKY”,@”VIRTUOUS”,@”VIRTUOSITY”,@”DUMB”,@”LOSING”,@”SHY”,@”WITCH”,@”FANTASTIC”,@”MADLY”,@”VITAL”,@”DUMP”,@”LOSS”,@”SICKEN”,@”WOE”,@”FAVOR”,@”MAGNIFICENT”,@”WARM”,@”DWELL”,@”LOST”,@”SIN”,@”WORRY”,@”FAVOR”,@”MERIT”,@”WEALTH”,@”EGOTIST”,@”LOUSY”,@”LOUSE”,@”SINISTER”,@”WORSE”,@”FEARLESS”,@”MERRY”,@”WELCOME”,@”EMBARRASS”,@”LOW”,@”SINS”,@”WORST”,@”FESTIVE”,@”NEAT”,@”WELL”,@”EMOTIONAL”,@”LUCKLESS”,@”SKEPTIC”,@”WORTHLESS”,@”FIESTA”,@”NICE”,@”WIN”,@”EMPTY”,@”LUDICROUS”,@”SLUT”,@”WRONG”,@”FINE”,@”NURTURE”,@”WIN”,@”ENEMIES”,@”LYING”,@”SMOTHER”,@”YEARN”,@”FLATTER”,@”OK”,@”WINS”,@”ENEMY”,@”MAD”,@”SMUG”,@”FLAWLESS”,@”OKAY”,@”WISDOM”,@”FLEXIBLE”,@”FLEXIBILITY”,@”OKAYS”,@”WISE”,@”FLIRT”,@”WON”,@”FOND”,@”WONDERFUL”,@”FONDLY”,@”OPENNESS”,@”WORSHIP”,@”FONDNESS”,@”OPPORTUNE”,@”WORTHWHILE”,@”FORGAVE”,@”OPTIMAL”,@”WOW”,@”FORGIVE”,@”OPTIMAL”,@”OPTIMIZE”,@”YAY”,@”FREE”,@”ORIGINAL”,@”FREE”,@”OUTGOING”,@”FREEBIE”,@”PAINLESS”,@”PALATABLE”,@”PARADISE”];

Log:

2015-04-29 00:24:45.036 OpenEarsSampleApp[1850:615324] Starting OpenEars logging for OpenEars version 2.03 on 64-bit device (or build): iPhone running iOS version: 8.100000
2015-04-29 00:24:45.037 OpenEarsSampleApp[1850:615324] Creating shared instance of OEPocketsphinxController
2015-04-29 00:24:45.041 OpenEarsSampleApp[1850:615324] Number of words in vocabulary: 970
2015-04-29 00:24:45.371 OpenEarsSampleApp[1850:615324] I’m done running performDictionaryLookup and it took 0.223488 seconds
2015-04-29 00:24:45.385 OpenEarsSampleApp[1850:615324] Starting dynamic language model generation
## Vocab generated by v2 of the CMU-Cambridge Statistcal
## Language Modeling toolkit.
##
## Includes 978 words ##
wfreq2vocab : Done.
text2idngram
Vocab : /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.vocab
Output idngram : /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.idngram
N-gram buffer size : 10
Hash table size : 5000
Temp directory : /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/cmuclmtk-VbUMBA
Max open files : 20
FOF size : 10
n : 3
Initialising hash table…
Reading vocabulary…
Allocating memory for the n-gram buffer…
Reading text into the n-gram buffer…
20,000 n-grams processed for each “.”, 1,000,000 for each line.

Sorting n-grams…
Writing sorted n-grams to temporary file /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/cmuclmtk-VbUMBA/1
Merging 1 temporary files…

2-grams occurring: N times > N times Sug. -spec_num value
0 1953 1982
1 1890 63 73
2 58 5 15
3 4 1 11
4 0 1 11
5 0 1 11
6 0 1 11
7 0 1 11
8 0 1 11
9 0 1 11
10 0 1 11

3-grams occurring: N times > N times Sug. -spec_num value
0 2929 2968
1 2837 92 102
2 86 6 16
3 6 0 10
4 0 0 10
5 0 0 10
6 0 0 10
7 0 0 10
8 0 0 10
9 0 0 10
10 0 0 10
text2idngram : Done.

read_wlist_into_siht: a list of 978 words was read from “/var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.vocab”.
read_wlist_into_array: a list of 978 words was read from “/var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.vocab”.
Unigram was renormalized to absorb a mass of 0.491688
prob[UNK] = 1e-99
ARPA-style 3-gram will be written to /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.arpa
idngram2lm : Done.
INFO: cmd_ln.c(702): Parsing command line:
sphinx_lm_convert \
-i /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.arpa \
-o /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.DMP \
-debug 10

Current configuration:
[NAME] [DEFLT] [VALUE]
-case
-debug 10
-help no no
-i /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.arpa
-ienc
-ifmt
-logbase 1.0001 1.000100e+00
-mmap no no
-o /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.DMP
-oenc utf8 utf8
-ofmt

INFO: ngram_model_arpa.c(504): ngrams 1=978, 2=1952, 3=976
INFO: ngram_model_arpa.c(137): Reading unigrams
INFO: ngram_model_arpa.c(543): 978 = #unigrams created
INFO: ngram_model_arpa.c(197): Reading bigrams
INFO: ngram_model_arpa.c(561): 1952 = #bigrams created
INFO: ngram_model_arpa.c(562): 7 = #prob2 entries
INFO: ngram_model_arpa.c(570): 5 = #bo_wt2 entries
INFO: ngram_model_arpa.c(294): Reading trigrams
INFO: ngram_model_arpa.c(583): 976 = #trigrams created
INFO: ngram_model_arpa.c(584): 4 = #prob3 entries
INFO: ngram_model_dmp.c(518): Building DMP model…
INFO: ngram_model_dmp.c(548): 978 = #unigrams created
INFO: ngram_model_dmp.c(649): 1952 = #bigrams created
INFO: ngram_model_dmp.c(650): 7 = #prob2 entries
INFO: ngram_model_dmp.c(657): 5 = #bo_wt2 entries
INFO: ngram_model_dmp.c(661): 976 = #trigrams created
INFO: ngram_model_dmp.c(662): 4 = #prob3 entries
2015-04-29 00:24:45.502 OpenEarsSampleApp[1850:615324] Done creating language model with CMUCLMTK in 0.116773 seconds.
INFO: cmd_ln.c(702): Parsing command line:
sphinx_lm_convert \
-i /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.arpa \
-o /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.DMP \
-debug 10

Current configuration:
[NAME] [DEFLT] [VALUE]
-case
-debug 10
-help no no
-i /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.arpa
-ienc
-ifmt
-logbase 1.0001 1.000100e+00
-mmap no no
-o /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.DMP
-oenc utf8 utf8
-ofmt

INFO: ngram_model_arpa.c(504): ngrams 1=978, 2=1952, 3=976
INFO: ngram_model_arpa.c(137): Reading unigrams
INFO: ngram_model_arpa.c(543): 978 = #unigrams created
INFO: ngram_model_arpa.c(197): Reading bigrams
INFO: ngram_model_arpa.c(561): 1952 = #bigrams created
INFO: ngram_model_arpa.c(562): 9 = #prob2 entries
INFO: ngram_model_arpa.c(570): 5 = #bo_wt2 entries
INFO: ngram_model_arpa.c(294): Reading trigrams
INFO: ngram_model_arpa.c(583): 976 = #trigrams created
INFO: ngram_model_arpa.c(584): 5 = #prob3 entries
INFO: ngram_model_dmp.c(518): Building DMP model…
INFO: ngram_model_dmp.c(548): 978 = #unigrams created
INFO: ngram_model_dmp.c(649): 1952 = #bigrams created
INFO: ngram_model_dmp.c(650): 9 = #prob2 entries
INFO: ngram_model_dmp.c(657): 5 = #bo_wt2 entries
INFO: ngram_model_dmp.c(661): 976 = #trigrams created
INFO: ngram_model_dmp.c(662): 5 = #prob3 entries
2015-04-29 00:24:45.688 OpenEarsSampleApp[1850:615324] I’m done running dynamic language model generation and it took 0.646351 seconds
2015-04-29 00:24:45.689 OpenEarsSampleApp[1850:615324] Attempting to start listening session from startListeningWithLanguageModelAtPath:
2015-04-29 00:24:45.694 OpenEarsSampleApp[1850:615324] User gave mic permission for this app.
2015-04-29 00:24:45.694 OpenEarsSampleApp[1850:615324] Valid setSecondsOfSilence value of 0.700000 will be used.
2015-04-29 00:24:45.695 OpenEarsSampleApp[1850:615324] Successfully started listening session from startListeningWithLanguageModelAtPath:
2015-04-29 00:24:45.695 OpenEarsSampleApp[1850:615345] Starting listening.
2015-04-29 00:24:45.695 OpenEarsSampleApp[1850:615345] about to set up audio session
2015-04-29 00:24:45.819 OpenEarsSampleApp[1850:615349] Audio route has changed for the following reason:
2015-04-29 00:24:46.046 OpenEarsSampleApp[1850:615345] done starting audio unit
INFO: cmd_ln.c(702): Parsing command line:
\
-lm /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.DMP \
-vad_prespeech 10 \
-vad_postspeech 69 \
-vad_threshold 2.000000 \
-remove_noise yes \
-remove_silence yes \
-bestpath no \
-lw 6.500000 \
-dict /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.dic \
-hmm /private/var/mobile/Containers/Bundle/Application/789ABCF3-0765-4A02-AB6F-D875F4F7033C/OpenEarsSampleApp.app/AcousticModelEnglish.bundle

Current configuration:
[NAME] [DEFLT] [VALUE]
-agc none none
-agcthresh 2.0 2.000000e+00
-allphone
-allphone_ci no no
-alpha 0.97 9.700000e-01
-argfile
-ascale 20.0 2.000000e+01
-aw 1 1
-backtrace no no
-beam 1e-48 1.000000e-48
-bestpath yes no
-bestpathlw 9.5 9.500000e+00
-bghist no no
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-compallsen no no
-debug 0
-dict /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.dic
-dictcase no no
-dither no no
-doublebw no no
-ds 1 1
-fdict
-feat 1s_c_d_dd 1s_c_d_dd
-featparams
-fillprob 1e-8 1.000000e-08
-frate 100 100
-fsg
-fsgusealtpron yes yes
-fsgusefiller yes yes
-fwdflat yes yes
-fwdflatbeam 1e-64 1.000000e-64
-fwdflatefwid 4 4
-fwdflatlw 8.5 8.500000e+00
-fwdflatsfwin 25 25
-fwdflatwbeam 7e-29 7.000000e-29
-fwdtree yes yes
-hmm /private/var/mobile/Containers/Bundle/Application/789ABCF3-0765-4A02-AB6F-D875F4F7033C/OpenEarsSampleApp.app/AcousticModelEnglish.bundle
-input_endian little little
-jsgf
-kdmaxbbi -1 -1
-kdmaxdepth 0 0
-kdtree
-keyphrase
-kws
-kws_plp 1e-1 1.000000e-01
-kws_threshold 1 1.000000e+00
-latsize 5000 5000
-lda
-ldadim 0 0
-lextreedump 0 0
-lifter 0 0
-lm /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.DMP
-lmctl
-lmname
-logbase 1.0001 1.000100e+00
-logfn
-logspec no no
-lowerf 133.33334 1.333333e+02
-lpbeam 1e-40 1.000000e-40
-lponlybeam 7e-29 7.000000e-29
-lw 6.5 6.500000e+00
-maxhmmpf 10000 10000
-maxnewoov 20 20
-maxwpf -1 -1
-mdef
-mean
-mfclogdir
-min_endfr 0 0
-mixw
-mixwfloor 0.0000001 1.000000e-07
-mllr
-mmap yes yes
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-nwpen 1.0 1.000000e+00
-pbeam 1e-48 1.000000e-48
-pip 1.0 1.000000e+00
-pl_beam 1e-10 1.000000e-10
-pl_pbeam 1e-5 1.000000e-05
-pl_window 0 0
-rawlogdir
-remove_dc no no
-remove_noise yes yes
-remove_silence yes yes
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-sendump
-senlogdir
-senmgau
-silprob 0.005 5.000000e-03
-smoothspec no no
-svspec
-tmat
-tmatfloor 0.0001 1.000000e-04
-topn 4 4
-topn_beam 0 0
-toprule
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-usewdphones no no
-uw 1.0 1.000000e+00
-vad_postspeech 50 69
-vad_prespeech 10 10
-vad_threshold 2.0 2.000000e+00
-var
-varfloor 0.0001 1.000000e-04
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wbeam 7e-29 7.000000e-29
-wip 0.65 6.500000e-01
-wlen 0.025625 2.562500e-02

2015-04-29 00:24:46.052 OpenEarsSampleApp[1850:615349] There was a category change. The new category is AVAudioSessionCategoryPlayAndRecord
INFO: cmd_ln.c(702): Parsing command line:
\
-nfilt 25 \
-lowerf 130 \
-upperf 6800 \
-feat 1s_c_d_dd \
-svspec 0-12/13-25/26-38 \
-agc none \
-cmn current \
-varnorm no \
-transform dct \
-lifter 22 \
-cmninit 40

Current configuration:
[NAME] [DEFLT] [VALUE]
-agc none none
-agcthresh 2.0 2.000000e+00
-alpha 0.97 9.700000e-01
-ceplen 13 13
-cmn current current
-cmninit 8.0 40
-dither no no
-doublebw no no
-feat 1s_c_d_dd 1s_c_d_dd
-frate 100 100
-input_endian little little
-lda
-ldadim 0 0
-lifter 0 22
-logspec no no
-lowerf 133.33334 1.300000e+02
-ncep 13 13
-nfft 512 512
-nfilt 40 25
-remove_dc no no
-remove_noise yes yes
-remove_silence yes yes
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-smoothspec no no
-svspec 0-12/13-25/26-38
-transform legacy dct
-unit_area yes yes
-upperf 6855.4976 6.800000e+03
-vad_postspeech 50 69
-vad_prespeech 10 10
-vad_threshold 2.0 2.000000e+00
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wlen 0.025625 2.562500e-02

INFO: acmod.c(252): Parsed model-specific feature parameters from /private/var/mobile/Containers/Bundle/Application/789ABCF3-0765-4A02-AB6F-D875F4F7033C/OpenEarsSampleApp.app/AcousticModelEnglish.bundle/feat.params
INFO: feat.c(715): Initializing feature stream to type: ‘1s_c_d_dd’, ceplen=13, CMN=’current’, VARNORM=’no’, AGC=’none’
INFO: cmn.c(143): mean[0]= 12.00, mean[1..12]= 0.0
INFO: acmod.c(171): Using subvector specification 0-12/13-25/26-38
INFO: mdef.c(518): Reading model definition: /private/var/mobile/Containers/Bundle/Application/789ABCF3-0765-4A02-AB6F-D875F4F7033C/OpenEarsSampleApp.app/AcousticModelEnglish.bundle/mdef
2015-04-29 00:24:46.070 OpenEarsSampleApp[1850:615349] This is not a case in which OpenEars notifies of a route change. At the close of this function, the new audio route is —SpeakerMicrophoneBuiltIn—. The previous route before changing to this route was <AVINFO: mdef.c(531): Found byte-order mark BMDF, assuming this is a binary mdef file
INFO: bin_mdef.c(336): Reading binary model definition: /private/var/mobile/Containers/Bundle/Application/789ABCF3-0765-4A02-AB6F-D875F4F7033C/OpenEarsSampleApp.app/AcousticModelEnglish.bundle/mdef
AudioSessionRouteDescription: 0x174603720,
inputs = (null);
outputs = (
“<AVAudioSessionPortDescription: 0x174603750, type = Speaker; name = Speaker; UID = Speaker; selectedDataSource = (null)>”
)>.
INFO: bin_mdef.c(516): 46 CI-phone, 168344 CD-phone, 3 emitstate/phone, 138 CI-sen, 6138 Sen, 32881 Sen-Seq
INFO: tmat.c(206): Reading HMM transition probability matrices: /private/var/mobile/Containers/Bundle/Application/789ABCF3-0765-4A02-AB6F-D875F4F7033C/OpenEarsSampleApp.app/AcousticModelEnglish.bundle/transition_matrices
INFO: acmod.c(124): Attempting to use SCHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /private/var/mobile/Containers/Bundle/Application/789ABCF3-0765-4A02-AB6F-D875F4F7033C/OpenEarsSampleApp.app/AcousticModelEnglish.bundle/means
INFO: ms_gauden.c(292): 1 codebook, 3 feature, size:
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /private/var/mobile/Containers/Bundle/Application/789ABCF3-0765-4A02-AB6F-D875F4F7033C/OpenEarsSampleApp.app/AcousticModelEnglish.bundle/variances
INFO: ms_gauden.c(292): 1 codebook, 3 feature, size:
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(354): 0 variance values floored
INFO: s2_semi_mgau.c(904): Loading senones from dump file /private/var/mobile/Containers/Bundle/Application/789ABCF3-0765-4A02-AB6F-D875F4F7033C/OpenEarsSampleApp.app/AcousticModelEnglish.bundle/sendump
INFO: s2_semi_mgau.c(928): BEGIN FILE FORMAT DESCRIPTION
INFO: s2_semi_mgau.c(991): Rows: 512, Columns: 6138
INFO: s2_semi_mgau.c(1023): Using memory-mapped I/O for senones
INFO: s2_semi_mgau.c(1294): Maximum top-N: 4 Top-N beams: 0 0 0
INFO: dict.c(320): Allocating 5197 * 32 bytes (162 KiB) for word entries
INFO: dict.c(333): Reading main dictionary: /var/mobile/Containers/Data/Application/81636750-54D2-4106-A28D-7332A254D365/Library/Caches/LangModel.dic
INFO: dict.c(213): Allocated 7 KiB for strings, 11 KiB for phones
INFO: dict.c(336): 1092 words read
INFO: dict.c(342): Reading filler dictionary: /private/var/mobile/Containers/Bundle/Application/789ABCF3-0765-4A02-AB6F-D875F4F7033C/OpenEarsSampleApp.app/AcousticModelEnglish.bundle/noisedict
INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(345): 9 words read
INFO: dict2pid.c(396): Building PID tables for dictionary
INFO: dict2pid.c(406): Allocating 46^3 * 2 bytes (190 KiB) for word-initial triphones
INFO: dict2pid.c(132): Allocated 51152 bytes (49 KiB) for word-final triphones
INFO: dict2pid.c(196): Allocated 51152 bytes (49 KiB) for single-phone word triphones
INFO: ngram_model_arpa.c(79): No \data\ mark in LM file
INFO: ngram_model_dmp.c(166): Will use memory-mapped I/O for LM file
INFO: ngram_model_dmp.c(220): ngrams 1=978, 2=1952, 3=976
INFO: ngram_model_dmp.c(266): 978 = LM.unigrams(+trailer) read
INFO: ngram_model_dmp.c(312): 1952 = LM.bigrams(+trailer) read
INFO: ngram_model_dmp.c(338): 976 = LM.trigrams read
INFO: ngram_model_dmp.c(363): 9 = LM.prob2 entries read
INFO: ngram_model_dmp.c(383): 5 = LM.bo_wt2 entries read
INFO: ngram_model_dmp.c(403): 5 = LM.prob3 entries read
INFO: ngram_model_dmp.c(431): 4 = LM.tseg_base entries read
INFO: ngram_model_dmp.c(487): 978 = ascii word strings read
INFO: ngram_search_fwdtree.c(99): 245 unique initial diphones
INFO: ngram_search_fwdtree.c(148): 0 root, 0 non-root channels, 49 single-phone words
INFO: ngram_search_fwdtree.c(186): Creating search tree
INFO: ngram_search_fwdtree.c(192): before: 0 root, 0 non-root channels, 49 single-phone words
INFO: ngram_search_fwdtree.c(326): after: max nonroot chan increased to 3047
INFO: ngram_search_fwdtree.c(339): after: 245 root, 2919 non-root channels, 48 single-phone words
INFO: ngram_search_fwdflat.c(157): fwdflat: min_ef_width = 4, max_sf_win = 25
2015-04-29 00:24:46.165 OpenEarsSampleApp[1850:615345] There was no previous CMN value in the plist so we are using the fresh CMN value 42.000000.
2015-04-29 00:24:46.166 OpenEarsSampleApp[1850:615345] Listening.
2015-04-29 00:24:46.168 OpenEarsSampleApp[1850:615345] Project has these words or phrases in its dictionary:
___REJ_AA
___REJ_AE
___REJ_AH
___REJ_AO
___REJ_AW
___REJ_AY
___REJ_B
___REJ_CH
___REJ_D
___REJ_DH
___REJ_EH
___REJ_ER
___REJ_EY
___REJ_F
___REJ_G
___REJ_HH
___REJ_IH
___REJ_IY
___REJ_JH
___REJ_K
___REJ_L
___REJ_M
___REJ_N
___REJ_NG
___REJ_OW
___REJ_OY
___REJ_P
___REJ_R
___REJ_S
___REJ_SH
___REJ_T
…and 1062 more.
2015-04-29 00:24:46.169 OpenEarsSampleApp[1850:615345] Recognition loop has started
2015-04-29 00:24:46.233 OpenEarsSampleApp[1850:615324] Local callback: Pocketsphinx is now listening.
2015-04-29 00:24:46.235 OpenEarsSampleApp[1850:615324] Local callback: Pocketsphinx started.
2015-04-29 00:24:47.815 OpenEarsSampleApp[1850:615345] Speech detected…
INFO: ngram_search.c(462): Resized backpointer table to 10000 entries
2015-04-29 00:24:49.605 OpenEarsSampleApp[1850:615345] End of speech detected…
INFO: cmn_prior.c(131): cmn_prior_update: from < 42.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 38.09 0.20 -10.59 2.18 2.53 -5.46 -8.91 -10.75 3.32 -0.04 -5.32 6.43 2.00 >
INFO: ngram_search_fwdtree.c(1550): 6261 words recognized (33/fr)
INFO: ngram_search_fwdtree.c(1552): 425506 senones evaluated (2240/fr)
INFO: ngram_search_fwdtree.c(1556): 288250 channels searched (1517/fr), 45570 1st, 52353 last
INFO: ngram_search_fwdtree.c(1559): 10175 words for which last channels evaluated (53/fr)
INFO: ngram_search_fwdtree.c(1561): 11847 candidate words for entering last phone (62/fr)
INFO: ngram_search_fwdtree.c(1564): fwdtree 0.40 CPU 0.210 xRT
INFO: ngram_search_fwdtree.c(1567): fwdtree 3.21 wall 1.690 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 96 words
INFO: ngram_search_fwdflat.c(938): 3622 words recognized (19/fr)
INFO: ngram_search_fwdflat.c(940): 101319 senones evaluated (533/fr)
INFO: ngram_search_fwdflat.c(942): 96622 channels searched (508/fr)
INFO: ngram_search_fwdflat.c(944): 10499 words searched (55/fr)
INFO: ngram_search_fwdflat.c(947): 5903 word transitions (31/fr)
INFO: ngram_search_fwdflat.c(950): fwdflat 0.15 CPU 0.081 xRT
INFO: ngram_search_fwdflat.c(953): fwdflat 0.15 wall 0.080 xRT
2015-04-29 00:24:49.761 OpenEarsSampleApp[1850:615345] Pocketsphinx heard “CARE AFRAID” with a score of (0) and an utterance ID of 0.
2015-04-29 00:24:49.762 OpenEarsSampleApp[1850:615324] Local callback: The received hypothesis is CARE AFRAID with a score of 0 and an ID of 0

April 29, 2015 at 8:50 am #1025547

mjlee
Participant

Additional info:
Spoken words that are in the vocabulary are correctly being identified.
There is almost no background noise, and I am speaking with the device near my mouth, as if I’m talking on the phone.
Nearly every non-vocabulary word I say is not rejected, and recognized as a vocabulary word.

April 29, 2015 at 10:05 am #1025549

Halle Winkler
Politepix

Welcome,

Yes, sorry, I would say that this is expected behavior for a vocabulary so large that there are several perfect rhyming matches for OOV (out of vocabulary) utterances in the vocabulary, when the utterances are being spoken directly into the device at the same level as the rest of the interaction. If it were possible for Rejecto to be so highly weighted that it would override a large vocabulary’s ability to match these utterances, there would be too much confusion to confidently identify in-vocabulary utterances as well. This is actually the same with something like Siri – if you insert an OOV word directly into an otherwise-contextually-understandable Siri interaction (e.g. a name Siri doesn’t know, a foreign word, or something like medical jargon which isn’t in the relevant vocab) it will either be transcribed into something else or the question will be thrown out.

It has only been in the last couple of device versions that a 1000-word vocabulary could be reasonably made use of in OpenEars, so Rejecto is more geared towards use with a smaller vocabulary which has some “holes” in it in terms of not having multiple rhyming matches for the majority of OOV utterances.

Is this a realistic interaction for your app, that there is clear speech directed at it intentionally under ideal circumstances but the speech is unknown to the app? Generally OOV rejection is more optimized for tuning out speech which isn’t intended for the app, ruling out non-speech sounds rather than submitting them for hypotheses, and otherwise avoiding input of a more accidental nature. Users directing a lot of clear intentional speech towards the app that the app doesn’t know about might more come under the category of user education requirements than something needing a technological fix – it depends a bit on how you see this occurring.

Regarding the very large vocabulary, do you know that you can switch between smaller vocabularies almost instantly? So if your requirement is to present a sentence the user is supposed to say correctly, you can just attempt to recognize those words plus the Rejecto phonemes rather than the entire vocabulary at once.

April 29, 2015 at 4:33 pm #1025554

mjlee
Participant

Thanks for the fast reply. I tried cutting it down to about 200 words, with similar thought somewhat better results. From what I read about Rejecto, it seemed that it is designed to reject words that are OOV. But it sounds like what it’s really designed for is rejecting non-word utterances? Switching to a small vocabulary won’t work in my case, as my goal is to let the user say anything, and detect only certain words and reject all others. It sounds like OpenEars with or without Rejecto is probably not a good solution for what I’m trying to do?

April 29, 2015 at 4:43 pm #1025555

Halle Winkler
Politepix

Rejecto is fine at rejecting OOV utterances, but it isn’t going to work ideally in a language model is large enough that it has several other perfect rhymes for the OOV utterances if they are being spoken directly to the app, since they will compete quite evenly with Rejecto’s rejection. I would be surprised if the results were the same with 200 words, perhaps you should create a replication case to show me if you have time:

https://www.politepix.com/forums/topic/how-to-create-a-minimal-case-for-replication/

It is possible that an app concept that is based on the user saying any possible sentence and then a very large vocabulary of keywords being detected within that isn’t a good fit for offline recognition. This is basically a variation on free-form dictation, so if you can’t make the vocabulary small enough that it isn’t full of rhyming combinations for OOV words and you can’t restrict the search space of the input, it will probably work better to use a cloud-based large vocabulary dictation SDK instead of an offline speech UI SDK, and just identify and then process everything.
Author

Posts

Viewing 5 posts - 1 through 5 (of 5 total)

You must be logged in to reply to this topic.