olegnaumenko

Forum Replies Created

Viewing 3 posts - 1 through 3 (of 3 total)

Advertisement: “RapidEars is an OpenEars™ plugin that lets you perform speech recognition while the user is still speaking!”

Author

Posts
January 16, 2018 at 4:06 pm in reply to: Can utterances only bring back what is in dictionary? #1032204

olegnaumenko
Participant

Thank You for the reply.
Everything is standard as in example, except, for this log i changed mode to grammar and supplied several phrases I would like it to recognize (as in top of the log). XCode 9.2, iPhone 5s. I am not native English speaker but I speak it not bad. And the point of the experiment is to only fire when the proper phrase is said, in proper English.

January 3, 2018 at 5:49 pm in reply to: Can utterances only bring back what is in dictionary? #1032188

olegnaumenko
Participant

OpenEars 2.506, iPhone 5s, speaking into built-in mic from 8..10 inch distance, using grammar mode with vocabulary:

@{OneOfTheseWillBeSaidOnce:@[@”HELLO ROBOT”,
@”HEY THERE”,
@”GREEN CROCODILE”,
@”HELLO PEOPLE”,
@”HEY YOU NERD”,
@”EMERGENCY SITUATION”]}

when saying, “I didn’t say that” I get “HEY THERE”. Often when I pronounce “Hello People” I get “HEY THERE”. I understand that these sound similar. Is there a way to get the probability for detection so I can filter out hypotheses with low credibility? Or is probability unavailable in JSGF mode?

the log goes here:

2018-01-03 18:40:15.732414+0200 OpenEarsTest[2672:954570] Starting OpenEars logging for OpenEars version 2.506 on 64-bit device (or build): iPhone running iOS version: 10.300000
2018-01-03 18:40:15.756330+0200 OpenEarsTest[2672:954570] Since there is no cached version, loading the language model lookup list for the acoustic model called AcousticModelEnglish
2018-01-03 18:40:15.813194+0200 OpenEarsTest[2672:954570] I’m done running performDictionaryLookup and it took 0.039237 seconds
2018-01-03 18:40:15.856756+0200 OpenEarsTest[2672:954570] Creating shared instance of OEPocketsphinxController
2018-01-03 18:40:15.873651+0200 OpenEarsTest[2672:954570] Attempting to start listening session from startListeningWithLanguageModelAtPath:
2018-01-03 18:40:15.880222+0200 OpenEarsTest[2672:954570] User gave mic permission for this app.
2018-01-03 18:40:15.882161+0200 OpenEarsTest[2672:954570] setSecondsOfSilence wasn’t set, using default of 0.700000.
2018-01-03 18:40:15.883578+0200 OpenEarsTest[2672:954626] Starting listening.
2018-01-03 18:40:15.883754+0200 OpenEarsTest[2672:954626] About to set up audio session
2018-01-03 18:40:16.130446+0200 OpenEarsTest[2672:954626] Creating audio session with default settings.
2018-01-03 18:40:16.130549+0200 OpenEarsTest[2672:954626] Done setting audio session category.
2018-01-03 18:40:16.131371+0200 OpenEarsTest[2672:954638] Audio route has changed for the following reason:
2018-01-03 18:40:16.135606+0200 OpenEarsTest[2672:954638] There was a category change. The new category is AVAudioSessionCategoryPlayAndRecord
2018-01-03 18:40:16.200989+0200 OpenEarsTest[2672:954638] This is not a case in which OpenEars notifies of a route change. At the close of this method, the new audio route will be <Input route or routes: “MicrophoneBuiltIn”. Output route or routes: “Speaker”>. The previous route before changing to this route was “<AVAudioSessionRouteDescription: 0x174007e80,
inputs = (
“<AVAudioSessionPortDescription: 0x174008160, type = MicrophoneBuiltIn; name = iPhone Microphone; UID = Built-In Microphone; selectedDataSource = Front>”
);
outputs = (
“<AVAudioSessionPortDescription: 0x174007d40, type = Speaker; name = Speaker; UID = Speaker; selectedDataSource = (null)>”
)>”.
2018-01-03 18:40:16.208455+0200 OpenEarsTest[2672:954626] Done setting preferred sample rate to 16000.000000 – now the real sample rate is 16000.000000
2018-01-03 18:40:16.209213+0200 OpenEarsTest[2672:954626] number of channels is already the preferred number of 1 so not setting it.
2018-01-03 18:40:16.210498+0200 OpenEarsTest[2672:954626] Done setting session’s preferred I/O buffer duration to 0.128000 – now the actual buffer duration is 0.128000
2018-01-03 18:40:16.210616+0200 OpenEarsTest[2672:954626] Done setting up audio session
2018-01-03 18:40:16.212290+0200 OpenEarsTest[2672:954638] Audio route has changed for the following reason:
2018-01-03 18:40:16.214772+0200 OpenEarsTest[2672:954626] About to set up audio IO unit in a session with a sample rate of 16000.000000, a channel number of 1 and a buffer duration of 0.128000.
2018-01-03 18:40:16.234177+0200 OpenEarsTest[2672:954638] There was a category change. The new category is AVAudioSessionCategoryPlayAndRecord
2018-01-03 18:40:16.239235+0200 OpenEarsTest[2672:954638] This is not a case in which OpenEars notifies of a route change. At the close of this method, the new audio route will be <Input route or routes: “MicrophoneBuiltIn”. Output route or routes: “Speaker”>. The previous route before changing to this route was “<AVAudioSessionRouteDescription: 0x174007d20,
inputs = (
“<AVAudioSessionPortDescription: 0x174007e90, type = MicrophoneBuiltIn; name = iPhone Microphone; UID = Built-In Microphone; selectedDataSource = Bottom>”
);
outputs = (
“<AVAudioSessionPortDescription: 0x170008640, type = Receiver; name = Receiver; UID = Built-In Receiver; selectedDataSource = (null)>”
)>”.
2018-01-03 18:40:16.248024+0200 OpenEarsTest[2672:954626] Done setting up audio unit
2018-01-03 18:40:16.248110+0200 OpenEarsTest[2672:954626] About to start audio IO unit
2018-01-03 18:40:16.528044+0200 OpenEarsTest[2672:954626] Done starting audio unit
INFO: pocketsphinx.c(145): Parsed model-specific feature parameters from /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/feat.params
Current configuration:
[NAME] [DEFLT] [VALUE]
-agc none none
-agcthresh 2.0 2.000000e+00
-allphone
-allphone_ci no no
-alpha 0.97 9.700000e-01
-ascale 20.0 2.000000e+01
-aw 1 1
-backtrace no no
-beam 1e-48 1.000000e-48
-bestpath yes yes
-bestpathlw 9.5 9.500000e+00
-ceplen 13 13
-cmn current current
-cmninit 8.0 40
-compallsen no no
-debug 0
-dict /var/mobile/Containers/Data/Application/2DD76771-D5C9-4303-AE69-7DC22AB5849E/Library/Caches/FirstGrammarModel.dic
-dictcase no no
-dither no no
-doublebw no no
-ds 1 1
-fdict /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/noisedict
-feat 1s_c_d_dd 1s_c_d_dd
-featparams /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/feat.params
-fillprob 1e-8 1.000000e-08
-frate 100 100
-fsg
-fsgusealtpron yes yes
-fsgusefiller yes yes
-fwdflat yes yes
-fwdflatbeam 1e-64 1.000000e-64
-fwdflatefwid 4 4
-fwdflatlw 8.5 8.500000e+00
-fwdflatsfwin 25 25
-fwdflatwbeam 7e-29 7.000000e-29
-fwdtree yes yes
-hmm /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle
-input_endian little little
-jsgf /var/mobile/Containers/Data/Application/2DD76771-D5C9-4303-AE69-7DC22AB5849E/Library/Caches/FirstGrammarModel.gram
-keyphrase
-kws
-kws_delay 10 10
-kws_plp 1e-1 1.000000e-01
-kws_threshold 1 1.000000e+00
-latsize 5000 5000
-lda
-ldadim 0 0
-lifter 0 22
-lm
-lmctl
-lmname
-logbase 1.0001 1.000100e+00
-logfn
-logspec no no
-lowerf 133.33334 1.300000e+02
-lpbeam 1e-40 1.000000e-40
-lponlybeam 7e-29 7.000000e-29
-lw 6.5 1.000000e+00
-maxhmmpf 30000 30000
-maxwpf -1 -1
-mdef /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/mdef
-mean /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/means
-mfclogdir
-min_endfr 0 0
-mixw
-mixwfloor 0.0000001 1.000000e-07
-mllr
-mmap yes yes
-ncep 13 13
-nfft 512 512
-nfilt 40 25
-nwpen 1.0 1.000000e+00
-pbeam 1e-48 1.000000e-48
-pip 1.0 1.000000e+00
-pl_beam 1e-10 1.000000e-10
-pl_pbeam 1e-10 1.000000e-10
-pl_pip 1.0 1.000000e+00
-pl_weight 3.0 3.000000e+00
-pl_window 5 5
-rawlogdir
-remove_dc no no
-remove_noise yes yes
-remove_silence yes yes
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-sendump /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/sendump
-senlogdir
-senmgau
-silprob 0.005 5.000000e-03
-smoothspec no no
-svspec 0-12/13-25/26-38
-tmat /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/transition_matrices
-tmatfloor 0.0001 1.000000e-04
-topn 4 4
-topn_beam 0 0
-toprule
-transform legacy dct
-unit_area yes yes
-upperf 6855.4976 6.800000e+03
-uw 1.0 1.000000e+00
-vad_postspeech 50 69
-vad_prespeech 20 10
-vad_startspeech 10 10
-vad_threshold 2.0 2.300000e+00
-var /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/variances
-varfloor 0.0001 1.000000e-04
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wbeam 7e-29 7.000000e-29
-wip 0.65 6.500000e-01
-wlen 0.025625 2.562500e-02

INFO: feat.c(715): Initializing feature stream to type: ‘1s_c_d_dd’, ceplen=13, CMN=’current’, VARNORM=’no’, AGC=’none’
INFO: cmn.c(143): mean[0]= 12.00, mean[1..12]= 0.0
INFO: acmod.c(164): Using subvector specification 0-12/13-25/26-38
INFO: mdef.c(518): Reading model definition: /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/mdef
INFO: mdef.c(531): Found byte-order mark BMDF, assuming this is a binary mdef file
INFO: bin_mdef.c(336): Reading binary model definition: /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/mdef
INFO: bin_mdef.c(516): 46 CI-phone, 168344 CD-phone, 3 emitstate/phone, 138 CI-sen, 6138 Sen, 32881 Sen-Seq
INFO: tmat.c(206): Reading HMM transition probability matrices: /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/transition_matrices
INFO: acmod.c(117): Attempting to use PTM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/means
INFO: ms_gauden.c(292): 1 codebook, 3 feature, size:
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/variances
INFO: ms_gauden.c(292): 1 codebook, 3 feature, size:
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(354): 0 variance values floored
INFO: ptm_mgau.c(805): Number of codebooks doesn’t match number of ciphones, doesn’t look like PTM: 1 != 46
INFO: acmod.c(119): Attempting to use semi-continuous computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/means
INFO: ms_gauden.c(292): 1 codebook, 3 feature, size:
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/variances
INFO: ms_gauden.c(292): 1 codebook, 3 feature, size:
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(294): 512×13
INFO: ms_gauden.c(354): 0 variance values floored
INFO: s2_semi_mgau.c(904): Loading senones from dump file /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/sendump
INFO: s2_semi_mgau.c(928): BEGIN FILE FORMAT DESCRIPTION
INFO: s2_semi_mgau.c(991): Rows: 512, Columns: 6138
INFO: s2_semi_mgau.c(1023): Using memory-mapped I/O for senones
INFO: s2_semi_mgau.c(1294): Maximum top-N: 4 Top-N beams: 0 0 0
INFO: phone_loop_search.c(114): State beam -225 Phone exit beam -225 Insertion penalty 0
INFO: dict.c(320): Allocating 4119 * 32 bytes (128 KiB) for word entries
INFO: dict.c(333): Reading main dictionary: /var/mobile/Containers/Data/Application/2DD76771-D5C9-4303-AE69-7DC22AB5849E/Library/Caches/FirstGrammarModel.dic
INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(336): 14 words read
INFO: dict.c(358): Reading filler dictionary: /var/containers/Bundle/Application/A9C4195E-904F-44F1-906A-F193DC56484D/OpenEarsTest.app/AcousticModelEnglish.bundle/noisedict
INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(361): 9 words read
INFO: dict2pid.c(396): Building PID tables for dictionary
INFO: dict2pid.c(406): Allocating 46^3 * 2 bytes (190 KiB) for word-initial triphones
INFO: dict2pid.c(132): Allocated 51152 bytes (49 KiB) for word-final triphones
INFO: dict2pid.c(196): Allocated 51152 bytes (49 KiB) for single-phone word triphones
INFO: jsgf.c(691): Defined rule: <FirstGrammarModel.g00000>
INFO: jsgf.c(691): Defined rule: PUBLIC <FirstGrammarModel.rule_0>
INFO: fsg_model.c(215): Computing transitive closure for null transitions
INFO: fsg_model.c(277): 0 null transitions added
INFO: fsg_search.c(227): FSG(beam: -1080, pbeam: -1080, wbeam: -634; wip: -5, pip: 0)
INFO: fsg_model.c(428): Adding silence transitions for <sil> to FSG
INFO: fsg_model.c(448): Added 9 silence word transitions
INFO: fsg_model.c(428): Adding silence transitions for <sil> to FSG
INFO: fsg_model.c(448): Added 9 silence word transitions
INFO: fsg_model.c(428): Adding silence transitions for [BREATH] to FSG
INFO: fsg_model.c(448): Added 9 silence word transitions
INFO: fsg_model.c(428): Adding silence transitions for [COUGH] to FSG
INFO: fsg_model.c(448): Added 9 silence word transitions
INFO: fsg_model.c(428): Adding silence transitions for [NOISE] to FSG
INFO: fsg_model.c(448): Added 9 silence word transitions
INFO: fsg_model.c(428): Adding silence transitions for [SMACK] to FSG
INFO: fsg_model.c(448): Added 9 silence word transitions
INFO: fsg_model.c(428): Adding silence transitions for [UH] to FSG
INFO: fsg_model.c(448): Added 9 silence word transitions
INFO: fsg_search.c(173): Added 4 alternate word transitions
INFO: fsg_lextree.c(110): Allocated 846 bytes (0 KiB) for left and right context phones
INFO: fsg_lextree.c(256): 134 HMM nodes in lextree (81 leaves)
INFO: fsg_lextree.c(259): Allocated 19296 bytes (18 KiB) for all lextree nodes
INFO: fsg_lextree.c(262): Allocated 11664 bytes (11 KiB) for lextree leafnodes
2018-01-03 18:40:16.733045+0200 OpenEarsTest[2672:954626] There is no CMN plist so we are using the fresh CMN value 40.000000.
2018-01-03 18:40:16.733680+0200 OpenEarsTest[2672:954626] Listening.
2018-01-03 18:40:16.734432+0200 OpenEarsTest[2672:954626] Project has these words or phrases in its dictionary:
CROCODILE
EMERGENCY
EMERGENCY(2)
GREEN
HELLO
HELLO(2)
HEY
NERD
PEOPLE
ROBOT
ROBOT(2)
SITUATION
THERE
YOU
2018-01-03 18:40:16.734542+0200 OpenEarsTest[2672:954626] Recognition loop has started
2018-01-03 18:40:16.734957+0200 OpenEarsTest[2672:954570] Successfully started listening session from startListeningWithLanguageModelAtPath:
2018-01-03 18:40:16.752577+0200 OpenEarsTest[2672:954570] Pocketsphinx is now listening.
2018-01-03 18:40:17.022659+0200 OpenEarsTest[2672:954626] Speech detected…
2018-01-03 18:40:17.023302+0200 OpenEarsTest[2672:954570] Pocketsphinx has detected speech.
2018-01-03 18:40:18.126128+0200 OpenEarsTest[2672:954626] End of speech detected…
2018-01-03 18:40:18.127059+0200 OpenEarsTest[2672:954570] Pocketsphinx has detected a period of silence, concluding an utterance.
INFO: cmn_prior.c(131): cmn_prior_update: from < 40.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 45.87 15.93 -4.69 -5.94 -3.90 -3.12 -2.24 1.58 -1.77 -1.60 -4.82 -5.68 0.49 >
INFO: fsg_search.c(843): 115 frames, 5325 HMMs (46/fr), 10570 senones (91/fr), 439 history entries (3/fr)

ERROR: “fsg_search.c”, line 913: Final result does not match the grammar in frame 115
2018-01-03 18:40:18.130912+0200 OpenEarsTest[2672:954626] Pocketsphinx heard “” with a score of (0) and an utterance ID of 0.
2018-01-03 18:40:18.131202+0200 OpenEarsTest[2672:954626] Hypothesis was null so we aren’t returning it. If you want null hypotheses to also be returned, set OEPocketsphinxController’s property returnNullHypotheses to TRUE before starting OEPocketsphinxController.
2018-01-03 18:40:20.958072+0200 OpenEarsTest[2672:954624] Speech detected…
2018-01-03 18:40:20.959169+0200 OpenEarsTest[2672:954570] Pocketsphinx has detected speech.
2018-01-03 18:40:22.609818+0200 OpenEarsTest[2672:954626] End of speech detected…
2018-01-03 18:40:22.610542+0200 OpenEarsTest[2672:954570] Pocketsphinx has detected a period of silence, concluding an utterance.
INFO: cmn_prior.c(131): cmn_prior_update: from < 45.87 15.93 -4.69 -5.94 -3.90 -3.12 -2.24 1.58 -1.77 -1.60 -4.82 -5.68 0.49 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 49.01 10.38 1.63 1.36 2.62 -2.81 -8.39 1.88 -3.63 -1.06 -3.82 -2.39 -1.53 >
INFO: fsg_search.c(843): 167 frames, 11050 HMMs (66/fr), 17759 senones (106/fr), 1380 history entries (8/fr)

2018-01-03 18:40:22.614643+0200 OpenEarsTest[2672:954626] Pocketsphinx heard “HEY THERE” with a score of (0) and an utterance ID of 1.
2018-01-03 18:40:22.634868+0200 OpenEarsTest[2672:954570] The received hypothesis is HEY THERE with a score of 0 and an ID of 1
2018-01-03 18:40:25.315471+0200 OpenEarsTest[2672:954624] Speech detected…
2018-01-03 18:40:25.316027+0200 OpenEarsTest[2672:954570] Pocketsphinx has detected speech.
2018-01-03 18:40:27.083584+0200 OpenEarsTest[2672:954627] End of speech detected…
2018-01-03 18:40:27.084296+0200 OpenEarsTest[2672:954570] Pocketsphinx has detected a period of silence, concluding an utterance.
INFO: cmn_prior.c(131): cmn_prior_update: from < 49.01 10.38 1.63 1.36 2.62 -2.81 -8.39 1.88 -3.63 -1.06 -3.82 -2.39 -1.53 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 50.11 9.56 2.45 2.06 4.52 -5.13 -9.10 1.93 -4.76 -0.55 -3.99 -0.50 -2.36 >
INFO: fsg_search.c(843): 189 frames, 14014 HMMs (74/fr), 21753 senones (115/fr), 1671 history entries (8/fr)

2018-01-03 18:40:27.087052+0200 OpenEarsTest[2672:954627] Pocketsphinx heard “HEY THERE” with a score of (0) and an utterance ID of 2.
2018-01-03 18:40:27.093560+0200 OpenEarsTest[2672:954570] The received hypothesis is HEY THERE with a score of 0 and an ID of 2
2018-01-03 18:40:29.268660+0200 OpenEarsTest[2672:954625] Speech detected…
2018-01-03 18:40:29.269468+0200 OpenEarsTest[2672:954570] Pocketsphinx has detected speech.
2018-01-03 18:40:31.318108+0200 OpenEarsTest[2672:954625] End of speech detected…
2018-01-03 18:40:31.319683+0200 OpenEarsTest[2672:954570] Pocketsphinx has detected a period of silence, concluding an utterance.
INFO: cmn_prior.c(131): cmn_prior_update: from < 50.11 9.56 2.45 2.06 4.52 -5.13 -9.10 1.93 -4.76 -0.55 -3.99 -0.50 -2.36 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 49.19 10.69 3.28 1.81 3.41 -5.30 -9.70 2.74 -3.79 -1.62 -3.66 -0.23 -1.29 >
INFO: fsg_search.c(843): 215 frames, 4460 HMMs (20/fr), 9700 senones (45/fr), 587 history entries (2/fr)

2018-01-03 18:40:31.321215+0200 OpenEarsTest[2672:954625] Pocketsphinx heard “HELLO PEOPLE” with a score of (0) and an utterance ID of 3.
2018-01-03 18:40:31.326913+0200 OpenEarsTest[2672:954570] The received hypothesis is HELLO PEOPLE with a score of 0 and an ID of 3
2018-01-03 18:40:32.592207+0200 OpenEarsTest[2672:954627] Speech detected…
2018-01-03 18:40:32.592810+0200 OpenEarsTest[2672:954570] Pocketsphinx has detected speech.
INFO: cmn_prior.c(99): cmn_prior_update: from < 49.19 10.69 3.28 1.81 3.41 -5.30 -9.70 2.74 -3.79 -1.62 -3.66 -0.23 -1.29 >
INFO: cmn_prior.c(116): cmn_prior_update: to < 50.16 10.24 3.44 2.45 4.35 -5.52 -9.80 2.17 -4.13 -1.48 -3.39 -0.10 -1.88 >
2018-01-03 18:40:34.129945+0200 OpenEarsTest[2672:954627] End of speech detected…
2018-01-03 18:40:34.130669+0200 OpenEarsTest[2672:954570] Pocketsphinx has detected a period of silence, concluding an utterance.
INFO: cmn_prior.c(131): cmn_prior_update: from < 50.16 10.24 3.44 2.45 4.35 -5.52 -9.80 2.17 -4.13 -1.48 -3.39 -0.10 -1.88 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 48.14 10.65 3.09 2.29 3.56 -5.10 -9.35 2.25 -3.69 -1.62 -2.82 -0.10 -1.23 >
INFO: fsg_search.c(843): 154 frames, 7536 HMMs (48/fr), 13619 senones (88/fr), 653 history entries (4/fr)

2018-01-03 18:40:34.133792+0200 OpenEarsTest[2672:954627] Pocketsphinx heard “HEY THERE” with a score of (0) and an utterance ID of 4.
2018-01-03 18:40:34.138785+0200 OpenEarsTest[2672:954570] The received hypothesis is HEY THERE with a score of 0 and an ID of 4

January 3, 2018 at 4:09 pm in reply to: Can utterances only bring back what is in dictionary? #1032186

olegnaumenko
Participant

Also using Grammar method currently,
Having the same problem (wrong results while phrase not in dict is being said).

Is there any help in probability / score numbers? Is there any way to get the credibility index of current hypothesis / utterance? This would greatly help,

Will any of Your paid plugins help improve this?
Author

Posts

Viewing 3 posts - 1 through 3 (of 3 total)