OpenEars doesn't stop recognition loop immediately on iPhone 4

Home Forums OpenEars OpenEars doesn't stop recognition loop immediately on iPhone 4

Viewing 18 posts - 1 through 18 (of 18 total)

  • Author
    Posts
  • #1021128
    Harz
    Participant

    I’m developing speech recognition application based on OpenEars and RapidEars. It works fine on iPhone 5 and 5s, but there is some bug on iPhone 4: OpenEars doesn’t stop recognition loop immediately when stopListening method is called. It’s delay correlates with quantity of recognized words in current loop and can be up to 1 minute and more… So, how can I stop recognition loop immediately (or at least during several seconds)? Below is openears log. Thanks in advice.

    2014-05-06 18:11:37.404 MyApp[2221:907] Stop button pressed
    time delta 1399389097.435529 partial result fwdtree
    partial: <s>: …[recognized words]… :(0000001)
    INFO: file_omitted(0): cmn_prior_update: from < 12.80 -1.94 -0.17 0.24 -0.35 -0.17 -0.14 -0.17 -0.31 -0.08 -0.17 -0.13 -0.13 >
    INFO: file_omitted(0): cmn_prior_update: to < 12.93 -2.03 -0.19 0.26 -0.36 -0.15 -0.15 -0.13 -0.34 -0.06 -0.16 -0.14 -0.10 >
    INFO: file_omitted(0): idx 1380 is final (1380)
    INFO: file_omitted(0): fwdtree: marking arc buffer final
    INFO: file_omitted(0): fwdtree: allocated 4096 arcs (48 KiB)
    INFO: file_omitted(0): fwdtree: allocated 512 start frame entries (2 KiB)
    INFO: file_omitted(0): Waiting for 1 consumers to finish
    INFO: file_omitted(0): idx 1380 is final (1380)
    time delta 1399389118.439060 end 0000001
    2014-05-06 18:11:58.439 MyApp[2221:907] Status: Detected end of speech, Code: 4
    INFO: file_omitted(0): fwdflat canceled
    INFO: file_omitted(0): fwdtree canceled
    INFO: file_omitted(0): TOTAL fwdtree 48.961 wall 2.135 xRT
    INFO: file_omitted(0): TOTAL fwdflat 77.410 wall 3.376 xRT
    2014-05-06 18:11:58.605 MyApp[2221:907] Status: It has stopped listening, Code: 3

    #1021129
    Halle Winkler
    Politepix

    Welcome,

    Are both the frameworks up to the current version? OpenEars is 1.7 and RapidEars is 1.65.

    #1021130
    Harz
    Participant

    Yes, OpenEars version is 1.7 (with minor changes in the code) and RapidEars is 1.65 (Licensed).

    #1021131
    Halle Winkler
    Politepix

    What are the changes to OpenEars?

    #1021132
    Harz
    Participant

    Modified method pathToModel to enable model updates via Wi-Fi.

    #1021133
    Halle Winkler
    Politepix

    OK, can you show the full logging output of both OpenEarsLogging and verbosePocketsphinx from the beginning to the end of the app session? It’s fine to omit anything that isn’t logging that OpenEars creates and to omit specific vocabulary as long as the vocabulary count isn’t changed.

    #1021140
    Harz
    Participant

    I’ve turned off own application’s logging. Here is full log:
    But it was running on iPhone 5, not iPhone 4. And my problem didn’t took place in this case. I’ll be ready to show logging from iPhone 4 with discussed bug tomorrow, should I remove strings with partial results of recognized text?

    #1021141
    Harz
    Participant

    2014-05-06 21:07:08.962 MyApp[4651:60b] otherApps (
    “com.apple.mobilesafari”,
    “com.youmi.FileMasterEN”
    )
    2014-05-06 21:07:17.950 MyApp[4651:60b] ‘Open Ears’ has been Initialized
    2014-05-06 21:07:17.952 MyApp[4651:60b] Starting OpenEars logging for OpenEars version 1.7 on 32-bit device: iPhone running iOS version: 7.000000
    2014-05-06 21:07:37.318 MyApp[4651:60b] Bundle Name : Model
    2014-05-06 21:07:37.320 MyApp[4651:60b] OE Ext : Model Path is /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle
    2014-05-06 21:07:37.321 MyApp[4651:60b] OE Ext : Model Path is /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle
    2014-05-06 21:07:37.325 MyApp[4651:60b] Accuracy set to 20.
    2014-05-06 21:07:37.327 MyApp[4651:60b] OE Ext : Model Path is /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle
    2014-05-06 21:07:37.336 MyApp[4651:60b] User gave mic permission for this app.
    2014-05-06 21:07:37.338 MyApp[4651:60b] setting sample rate to 16000
    2014-05-06 21:07:37.339 MyApp[4651:60b] The audio session has never been initialized so we will do that now.
    2014-05-06 21:07:37.339 MyApp[4651:60b] Checking and resetting all audio session settings.
    2014-05-06 21:07:37.340 MyApp[4651:60b] audioCategory is incorrect, we will change it.
    2014-05-06 21:07:37.341 MyApp[4651:60b] audioCategory is now on the correct setting of kAudioSessionCategory_PlayAndRecord.
    2014-05-06 21:07:37.342 MyApp[4651:60b] bluetoothInput is incorrect, we will change it.
    2014-05-06 21:07:37.343 MyApp[4651:60b] bluetooth input is now on the correct setting of 1.
    2014-05-06 21:07:37.344 MyApp[4651:60b] Output Device: ReceiverAndMicrophone.
    2014-05-06 21:07:37.345 MyApp[4651:60b] categoryDefaultToSpeaker is incorrect, we will change it.
    2014-05-06 21:07:37.346 MyApp[4651:60b] CategoryDefaultToSpeaker is now on the correct setting of 1.
    2014-05-06 21:07:37.347 MyApp[4651:60b] preferredBufferSize is incorrect, we will change it.
    2014-05-06 21:07:37.348 MyApp[4651:60b] PreferredBufferSize is now on the correct setting of 0.128000.
    2014-05-06 21:07:37.349 MyApp[4651:60b] preferredSampleRateCheck is incorrect, we will change it.
    2014-05-06 21:07:37.350 MyApp[4651:60b] preferred hardware sample rate is now on the correct setting of 16000.000000.
    2014-05-06 21:07:37.385 MyApp[4651:60b] AudioSessionManager startAudioSession has reached the end of the initialization.
    2014-05-06 21:07:37.386 MyApp[4651:60b] Exiting startAudioSession.
    2014-05-06 21:07:37.394 MyApp[4651:650b] setSecondsOfSilence value of 0.000000 was too large or too small or was NULL, using default of 0.700000.
    2014-05-06 21:07:37.400 MyApp[4651:600f] Audio route has changed for the following reason:
    2014-05-06 21:07:37.401 MyApp[4651:600f] There has been a change of category
    2014-05-06 21:07:37.402 MyApp[4651:600f] The previous audio route was Speaker
    2014-05-06 21:07:37.403 MyApp[4651:600f] This is not a case in which OpenEars performs a route change voluntarily. At the close of this function, the audio route is SpeakerAndMicrophone
    2014-05-06 21:07:37.409 MyApp[4651:650b] Project has these words or phrases in its dictionary:
    [deleted]
    2014-05-06 21:07:37.411 MyApp[4651:650b] Recognition loop has started
    INFO: file_omitted(0): Parsing command line:
    \
    -lm /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/base.lm.dmp \
    -dict /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/base.dic \
    -hmm /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle \
    -lw 6.5 \
    -samprate 16000 \
    -rawlogdir /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Library/Caches

    Current configuration:
    [NAME] [DEFLT] [VALUE]
    -adchdr 0 0
    -adcin no no
    -agc none none
    -agcthresh 2.0 2.000000e+00
    -align
    -alpha 0.97 9.700000e-01
    -arcdumpdir
    -ascale 20.0 2.000000e+01
    -aw 1 1
    -backtrace no no
    -beam 1e-48 1.000000e-48
    -bestpath yes yes
    -bestpathlw 9.5 9.500000e+00
    -bghist no no
    -cepdir
    -cepext .mfc .mfc
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -compallsen no no
    -ctl
    -ctlcount -1 -1
    -ctlincr 1 1
    -ctloffset 0 0
    -debug 0
    -dict /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/base.dic
    -dictcase no no
    -dither no no
    -doublebw no no
    -ds 1 1
    -fdict
    -feat 1s_c_d_dd 1s_c_d_dd
    -featparams
    -fillprob 1e-8 1.000000e-08
    -frate 100 100
    -fsg
    -fsgusealtpron yes yes
    -fsgusefiller yes yes
    -fwdflat yes yes
    -fwdflatbeam 1e-64 1.000000e-64
    -fwdflatlw 8.5 8.500000e+00
    -fwdflatsfwin 25 25
    -fwdflatwbeam 7e-29 7.000000e-29
    -fwdtree yes yes
    -fwdtreelm
    -hmm /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle
    -hyp
    -hypprefix
    -input_endian little little
    -jsgf
    -kdmaxbbi -1 -1
    -kdmaxdepth 0 0
    -kdtree
    -latsize 512 512
    -lda
    -ldadim 0 0
    -lextreedump 0 0
    -lifter 0 0
    -lm /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/base.lm.dmp
    -lmctl
    -lmname default default
    -logbase 1.0001 1.000100e+00
    -logfn
    -logspec no no
    -lowerf 133.33334 1.333333e+02
    -lpbeam 1e-40 1.000000e-40
    -lponlybeam 7e-29 7.000000e-29
    -lw 6.5 6.500000e+00
    -maxhmmpf -1 -1
    -maxnewoov 20 20
    -maxsilfr -1 -1
    -maxwpf 50 50
    -mdef
    -mean
    -mfclogdir
    -min_endfr 0 0
    -mixw
    -mixwfloor 0.0000001 1.000000e-07
    -mllr
    -mmap yes yes
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 40
    -nwpen 1.0 1.000000e+00
    -pbeam 1e-48 1.000000e-48
    -pip 1.0 1.000000e+00
    -pl_beam 1e-10 1.000000e-10
    -pl_pbeam 1e-5 1.000000e-05
    -pl_window 0 0
    -rawlogdir /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Library/Caches
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 1.600000e+04
    -seed -1 -1
    -sendump
    -senlogdir
    -senmgau
    -silprob 0.005 5.000000e-03
    -smoothspec no no
    -svspec
    -tmat
    -tmatfloor 0.0001 1.000000e-04
    -topn 4 4
    -topn_beam 0 0
    -toprule
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+03
    -usewdphones no no
    -uw 1.0 1.000000e+00
    -var
    -varfloor 0.0001 1.000000e-04
    -varnorm no no
    -verbose no no
    -vm
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wbeam 7e-29 7.000000e-29
    -wip 0.65 6.500000e-01
    -wlen 0.025625 2.562500e-02

    INFO: file_omitted(0): Parsing command line:
    \
    -nfilt 40 \
    -lowerf 133.3334 \
    -upperf 6855.4976 \
    -feat s2_4x \
    -agc none \
    -cmn current \
    -varnorm no \
    -fwdflat yes

    Current configuration:
    [NAME] [DEFLT] [VALUE]
    -agc none none
    -agcthresh 2.0 2.000000e+00
    -alpha 0.97 9.700000e-01
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -dither no no
    -doublebw no no
    -feat 1s_c_d_dd s2_4x
    -frate 100 100
    -input_endian little little
    -lda
    -ldadim 0 0
    -lifter 0 0
    -logspec no no
    -lowerf 133.33334 1.333334e+02
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 40
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 1.600000e+04
    -seed -1 -1
    -smoothspec no no
    -svspec
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+03
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wlen 0.025625 2.562500e-02

    INFO: file_omitted(0): Parsed model-specific feature parameters from /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/feat.params
    INFO: file_omitted(0): Reading HMM transition probability matrices: /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/transition_matrices
    INFO: file_omitted(0): Loading senones from dump file /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/sendump
    INFO: file_omitted(0): BEGIN FILE FORMAT DESCRIPTION
    INFO: file_omitted(0): Rows: 64, Columns: 6147
    INFO: file_omitted(0): Using memory-mapped I/O for senones
    INFO: file_omitted(0): Building PID tables for dictionary
    INFO: file_omitted(0): Allocating 49^3 * 2 bytes (229 KiB) for word-initial triphones
    INFO: file_omitted(0): Allocated 29008 bytes (28 KiB) for word-final triphones
    INFO: file_omitted(0): Allocated 29008 bytes (28 KiB) for single-phone word triphones
    INFO: file_omitted(0): No \data\ mark in LM file
    INFO: file_omitted(0): Number of word mappings: 2522
    INFO: file_omitted(0): Number of word mappings: 2522
    INFO: file_omitted(0): Initialized arc buffer ‘fwdtree’, each arc occupies 12 bytes
    2014-05-06 21:07:38.845 MyApp[4651:650b] Starting openAudioDevice on the device.
    2014-05-06 21:07:38.846 MyApp[4651:650b] Audio unit wrapper successfully created.
    2014-05-06 21:07:38.854 MyApp[4651:650b] Set audio route to SpeakerAndMicrophone
    2014-05-06 21:07:38.855 MyApp[4651:650b] Restoring SmartCMN value of 11.306060
    2014-05-06 21:07:38.856 MyApp[4651:650b] Checking and resetting all audio session settings.
    2014-05-06 21:07:38.858 MyApp[4651:650b] audioCategory is correct, we will leave it as it is.
    2014-05-06 21:07:38.858 MyApp[4651:650b] bluetoothInput is correct, we will leave it as it is.
    2014-05-06 21:07:38.859 MyApp[4651:650b] Output Device: SpeakerAndMicrophone.
    2014-05-06 21:07:38.860 MyApp[4651:650b] categoryDefaultToSpeaker is correct, we will leave it as it is.
    2014-05-06 21:07:38.861 MyApp[4651:650b] preferredBufferSize is correct, we will leave it as it is.
    2014-05-06 21:07:38.862 MyApp[4651:650b] preferredSampleRateCheck is correct, we will leave it as it is.
    2014-05-06 21:07:38.863 MyApp[4651:650b] Setting the variables for the device and starting it.
    2014-05-06 21:07:38.863 MyApp[4651:650b] Looping through ringbuffer sections and pre-allocating them.
    2014-05-06 21:07:39.359 MyApp[4651:650b] Started audio output unit.
    2014-05-06 21:07:39.361 MyApp[4651:650b] Calibration has started
    2014-05-06 21:07:41.566 MyApp[4651:650b] Calibration has completed
    2014-05-06 21:07:41.568 MyApp[4651:650b] Listening.
    2014-05-06 21:07:49.319 MyApp[4651:650b] Speech detected…
    INFO: file_omitted(0): arc_buffer_producer_start_utt
    time delta 1399399669.323025 start 0000000
    INFO: file_omitted(0): arc_buffer_consumer_start_utt
    time delta 1399399669.324462 start 0000000
    time delta 1399399669.324843 partial result fwdtree
    partial: <s>:(0000000)
    time delta 1399399669.341870 partial result fwdtree

    _[deleted partial recognized results]_

    2014-05-06 21:08:04.284 MyApp[4651:60b] In ContinuousAudioUnit
    INFO: file_omitted(0): cmn_prior_update: from < 10.43 -1.69 -0.17 0.14 -0.38 -0.18 0.06 -0.16 -0.21 -0.02 -0.05 -0.10 -0.06 >
    INFO: file_omitted(0): cmn_prior_update: to < 10.42 -1.69 -0.17 0.14 -0.38 -0.18 0.06 -0.16 -0.21 -0.02 -0.05 -0.10 -0.06 >
    INFO: file_omitted(0): idx 1404 is final (1404)
    INFO: file_omitted(0): fwdtree: marking arc buffer final
    INFO: file_omitted(0): fwdtree: allocated 4096 arcs (48 KiB)
    INFO: file_omitted(0): fwdtree: allocated 512 start frame entries (2 KiB)
    INFO: file_omitted(0): Waiting for 1 consumers to finish
    INFO: file_omitted(0): idx 1404 is final (1404)
    2014-05-06 21:08:04.959 MyApp[4651:650b] Stopping audio unit.
    2014-05-06 21:08:05.091 MyApp[4651:650b] Audio Output Unit stopped, cleaning up variable states.
    2014-05-06 21:08:05.092 MyApp[4651:650b] This device is not recording, so first we will set its recording status to 0
    2014-05-06 21:08:05.093 MyApp[4651:650b] The audio unit is running so we are going to dispose of its instance
    INFO: file_omitted(0): fwdtree canceled
    INFO: file_omitted(0): TOTAL fwdtree 6.994 wall 0.498 xRT
    INFO: file_omitted(0): fwdflat canceled
    INFO: file_omitted(0): TOTAL fwdflat 27.945 wall 1.990 xRT
    2014-05-06 21:08:05.108 MyApp[4651:650b] No longer listening.

    #1021142
    Halle Winkler
    Politepix

    Sent you an email – get in touch when you have a moment so I can assist you with this.

    #1021148
    Halle Winkler
    Politepix

    Thanks for getting in touch! OK, so, I’ve taken some time to look at your logging output and I think what I’m seeing is that this isn’t using one of the two supported 8khz acoustic models that ship with RapidEars, is that correct? It looks like this is probably one of the 16khz Voxforge models if I’m not mistaken, which is why the performance is going to be slow with RapidEars. Can you let me know if there are any other modifications before we look into if there are any ways to improve the performance for the slower 16khz models on the old devices? A few of the other Sphinx runtime arguments look a bit unexpected to me so it would be good to know if any other changes have been made so that the troubleshooting time can be put to the most efficient use.

    #1021150
    Harz
    Participant

    Thanks a lot Halle. I’m using 16kHz semicontinous model generated from wavs and text corpus. There are no other modifications in OpenEars except pathToModel method.
    Does it make sense to downsample model to 8kHz without losing an acuracy?

    #1021152
    Halle Winkler
    Politepix

    Ah OK, you are using your own model, very cool. It is possible that the 16khz isn’t the only issue but it is definitely going to interact with any other issues on the older devices because that’s a lot more analysis to do.

    Probably it’s best to start out by showing the logging from the iPhone 4 from an incident when it is showing this issue so I can get a look at which stage in the listening loop the behavior is happening in, and then we can look at possible causes and maybe find a way to fix or improve the behavior (I can’t guarantee there is a fix for this since it isn’t one of the models I ship, but I’m happy to work with you to try to improve things). It’s fine removing the vocabulary from the logs, but do let me know the reported size of the vocabulary so I can get a sense of what we’re trying to do here.

    #1021153
    Halle Winkler
    Politepix

    BTW, let me suggest that you create a replicable case for yourself using PocketsphinxController’s pathToTestFile property which will let you use prerecorded audio in place of live audio in the live recognition loop. That way it will be easy for you to verify whether any changes help and you’ll also have some replicable data for me in case you want to send me a test case. It also just makes development a lot easier when you don’t have to provide live input yourself while working on a bug.

    #1021154
    Harz
    Participant

    Thank you for advice. I will implement it soon.

    Currently our vocabulary contains 2800 words including (2) and (3) variants for some words. But we tested application on about 8000 words vocabulary and 20M language model (lm.dmp) It worked fast enough with RapidEars on iPhones 5/5s.

    #1021155
    Halle Winkler
    Politepix

    OK, I think this is probably going to be the root of the issue – RapidEars is only designed for use with the shipped acoustic models and with the normal OpenEars-required language model maximum size of around 200-500 words depending. I can imagine your specs working some of the time on a 5S, but it’s too much overhead for an iPhone 4 to support. I think you’re just seeing a delayed search time on a slow device for a 16khz model doing large vocabulary recognition – speed considerations are the reason that even stock OpenEars doesn’t use 16khz models and big vocabularies, and RapidEars is more sensitive to performance considerations since it has to do much more.

    If you’re finding the performance acceptable on devices later than the iPhone 4, I think what I would do is to make a downsampled version of your acoustic model, ship it alongside your 16khz model, and have the iPhone 4 and earlier fall back to using the 8khz model. This might help your issue sufficiently, but it also might be necessary for you to have the iPhone 4 fall back to a smaller vocabulary as well in order to achieve normal performance. Those would be my starting suggestions so that you can preserve your ideal performance for your users whose devices can support it without giving the older phones a job to do that they aren’t up for.

    #1021156
    Halle Winkler
    Politepix

    I thought of another way you can fall back to some faster iPhone 4 performance along with the other two suggestions. In the file PocketsphinxRunConfig.h there is this sequence:

    #if __LP64__ 
    #define kBEAM @"1e-96" // -beam	Beam width applied to every frame in Viterbi search (smaller values mean wider beam), defaults to 1e-48
    #else
    #define kBEAM @"1e-66" // -beam	Beam width applied to every frame in Viterbi search (smaller values mean wider beam), defaults to 1e-48
    #endif

    You can set the value 1e-66 to a larger value such as 1e-48 for faster searches on older devices. I would leave it as-is for the devices that are already performing satisfactorily.

    #1021158
    Harz
    Participant

    Thanks a lot for your help. We appreciate your advice and suggestions. Soon we’ll regenerate our model to 8kHz and test performance again. In any way it will be nice to realize possibility to stop delayed recognition, because it’s don’t take any sense, but spend too much time. Here is log from iPhone 4. As you can see, delay between sending stopListenning and actually it’s stopping is about 3 minutes.

    2014-05-07 16:36:59.904 MyApp[2864:907] ‘Open Ears’ has been Initialized
    2014-05-07 16:36:59.924 MyApp[2864:907] Starting OpenEars logging for OpenEars version 1.7 on 32-bit device: iPhone running iOS version: 6.100000
    2014-05-07 16:37:01.531 MyApp[2864:907] Bundle Name : Model
    2014-05-07 16:37:01.534 MyApp[2864:907] OE Ext : Model Path is /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle
    2014-05-07 16:37:01.537 MyApp[2864:907] OE Ext : Model Path is /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle
    2014-05-07 16:37:01.563 MyApp[2864:907] Accuracy set to 10.
    2014-05-07 16:37:01.566 MyApp[2864:907] OE Ext : Model Path is /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle
    2014-05-07 16:37:01.573 MyApp[2864:907] setting sample rate to 16000
    2014-05-07 16:37:01.584 MyApp[2864:907] The audio session has never been initialized so we will do that now.
    2014-05-07 16:37:01.585 MyApp[2864:907] Checking and resetting all audio session settings.
    2014-05-07 16:37:01.588 MyApp[2864:907] audioCategory is incorrect, we will change it.
    2014-05-07 16:37:01.590 MyApp[2864:907] audioCategory is now on the correct setting of kAudioSessionCategory_PlayAndRecord.
    2014-05-07 16:37:01.592 MyApp[2864:907] bluetoothInput is incorrect, we will change it.
    2014-05-07 16:37:01.593 MyApp[2864:907] bluetooth input is now on the correct setting of 1.
    2014-05-07 16:37:01.596 MyApp[2864:907] Output Device: ReceiverAndMicrophone.
    2014-05-07 16:37:01.601 MyApp[2864:907] categoryDefaultToSpeaker is incorrect, we will change it.
    2014-05-07 16:37:01.603 MyApp[2864:907] CategoryDefaultToSpeaker is now on the correct setting of 1.
    2014-05-07 16:37:01.605 MyApp[2864:907] preferredBufferSize is incorrect, we will change it.
    2014-05-07 16:37:01.607 MyApp[2864:907] PreferredBufferSize is now on the correct setting of 0.128000.
    2014-05-07 16:37:01.609 MyApp[2864:907] preferredSampleRateCheck is incorrect, we will change it.
    2014-05-07 16:37:01.651 MyApp[2864:907] preferred hardware sample rate is now on the correct setting of 16000.000000.
    2014-05-07 16:37:01.756 MyApp[2864:907] AudioSessionManager startAudioSession has reached the end of the initialization.
    2014-05-07 16:37:01.758 MyApp[2864:907] Exiting startAudioSession.
    2014-05-07 16:37:01.772 MyApp[2864:5c03] setSecondsOfSilence value of 0.000000 was too large or too small or was NULL, using default of 0.700000.
    2014-05-07 16:37:01.813 MyApp[2864:5c03] Project has these words or phrases in its dictionary:
    [del]
    …and 2775 more.
    2014-05-07 16:37:01.819 MyApp[2864:5c03] Recognition loop has started
    2014-05-07 16:37:01.884 MyApp[2864:907] Audio route has changed for the following reason:
    2014-05-07 16:37:01.885 MyApp[2864:907] There has been a change of category
    2014-05-07 16:37:01.887 MyApp[2864:907] The previous audio route was Speaker
    2014-05-07 16:37:01.900 MyApp[2864:907] This is not a case in which OpenEars performs a route change voluntarily. At the close of this function, the audio route is SpeakerAndMicrophone
    INFO: file_omitted(0): Parsing command line:
    \
    -lm /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/base.lm.dmp \
    -dict /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/base.dic \
    -hmm /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle \
    -lw 6.5 \
    -samprate 16000 \
    -rawlogdir /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Library/Caches

    Current configuration:
    [NAME] [DEFLT] [VALUE]
    -adchdr 0 0
    -adcin no no
    -agc none none
    -agcthresh 2.0 2.000000e+00
    -align
    -alpha 0.97 9.700000e-01
    -arcdumpdir
    -ascale 20.0 2.000000e+01
    -aw 1 1
    -backtrace no no
    -beam 1e-48 1.000000e-48
    -bestpath yes yes
    -bestpathlw 9.5 9.500000e+00
    -bghist no no
    -cepdir
    -cepext .mfc .mfc
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -compallsen no no
    -ctl
    -ctlcount -1 -1
    -ctlincr 1 1
    -ctloffset 0 0
    -debug 0
    -dict /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/base.dic
    -dictcase no no
    -dither no no
    -doublebw no no
    -ds 1 1
    -fdict
    -feat 1s_c_d_dd 1s_c_d_dd
    -featparams
    -fillprob 1e-8 1.000000e-08
    -frate 100 100
    -fsg
    -fsgusealtpron yes yes
    -fsgusefiller yes yes
    -fwdflat yes yes
    -fwdflatbeam 1e-64 1.000000e-64
    -fwdflatlw 8.5 8.500000e+00
    -fwdflatsfwin 25 25
    -fwdflatwbeam 7e-29 7.000000e-29
    -fwdtree yes yes
    -fwdtreelm
    -hmm /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle
    -hyp
    -hypprefix
    -input_endian little little
    -jsgf
    -kdmaxbbi -1 -1
    -kdmaxdepth 0 0
    -kdtree
    -latsize 512 512
    -lda
    -ldadim 0 0
    -lextreedump 0 0
    -lifter 0 0
    -lm /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/base.lm.dmp
    -lmctl
    -lmname default default
    -logbase 1.0001 1.000100e+00
    -logfn
    -logspec no no
    -lowerf 133.33334 1.333333e+02
    -lpbeam 1e-40 1.000000e-40
    -lponlybeam 7e-29 7.000000e-29
    -lw 6.5 6.500000e+00
    -maxhmmpf -1 -1
    -maxnewoov 20 20
    -maxsilfr -1 -1
    -maxwpf 50 50
    -mdef
    -mean
    -mfclogdir
    -min_endfr 0 0
    -mixw
    -mixwfloor 0.0000001 1.000000e-07
    -mllr
    -mmap yes yes
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 40
    -nwpen 1.0 1.000000e+00
    -pbeam 1e-48 1.000000e-48
    -pip 1.0 1.000000e+00
    -pl_beam 1e-10 1.000000e-10
    -pl_pbeam 1e-5 1.000000e-05
    -pl_window 0 0
    -rawlogdir /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Library/Caches
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 1.600000e+04
    -seed -1 -1
    -sendump
    -senlogdir
    -senmgau
    -silprob 0.005 5.000000e-03
    -smoothspec no no
    -svspec
    -tmat
    -tmatfloor 0.0001 1.000000e-04
    -topn 4 4
    -topn_beam 0 0
    -toprule
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+03
    -usewdphones no no
    -uw 1.0 1.000000e+00
    -var
    -varfloor 0.0001 1.000000e-04
    -varnorm no no
    -verbose no no
    -vm
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wbeam 7e-29 7.000000e-29
    -wip 0.65 6.500000e-01
    -wlen 0.025625 2.562500e-02

    INFO: file_omitted(0): Parsing command line:
    \
    -nfilt 40 \
    -lowerf 133.3334 \
    -upperf 6855.4976 \
    -feat s2_4x \
    -agc none \
    -cmn current \
    -varnorm no \
    -fwdflat yes

    Current configuration:
    [NAME] [DEFLT] [VALUE]
    -agc none none
    -agcthresh 2.0 2.000000e+00
    -alpha 0.97 9.700000e-01
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -dither no no
    -doublebw no no
    -feat 1s_c_d_dd s2_4x
    -frate 100 100
    -input_endian little little
    -lda
    -ldadim 0 0
    -lifter 0 0
    -logspec no no
    -lowerf 133.33334 1.333334e+02
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 40
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 1.600000e+04
    -seed -1 -1
    -smoothspec no no
    -svspec
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+03
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wlen 0.025625 2.562500e-02

    INFO: file_omitted(0): Parsed model-specific feature parameters from /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/feat.params
    INFO: file_omitted(0): Reading HMM transition probability matrices: /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/transition_matrices
    INFO: file_omitted(0): Loading senones from dump file /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/sendump
    INFO: file_omitted(0): BEGIN FILE FORMAT DESCRIPTION
    INFO: file_omitted(0): Rows: 64, Columns: 6147
    INFO: file_omitted(0): Using memory-mapped I/O for senones
    INFO: file_omitted(0): Building PID tables for dictionary
    INFO: file_omitted(0): Allocating 49^3 * 2 bytes (229 KiB) for word-initial triphones
    INFO: file_omitted(0): Allocated 29008 bytes (28 KiB) for word-final triphones
    INFO: file_omitted(0): Allocated 29008 bytes (28 KiB) for single-phone word triphones
    INFO: file_omitted(0): No \data\ mark in LM file
    INFO: file_omitted(0): Number of word mappings: 2522
    INFO: file_omitted(0): Number of word mappings: 2522
    INFO: file_omitted(0): Initialized arc buffer ‘fwdtree’, each arc occupies 12 bytes
    2014-05-07 16:37:05.653 MyApp[2864:5c03] Warning: There is a request to use pathToTestFile but there isn’t a usable file at the location given (“/var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/test.wav”) so the live microphone will be used instead.
    2014-05-07 16:37:05.662 MyApp[2864:5c03] Starting openAudioDevice on the device.
    2014-05-07 16:37:05.666 MyApp[2864:5c03] Audio unit wrapper successfully created.
    2014-05-07 16:37:05.678 MyApp[2864:5c03] Set audio route to SpeakerAndMicrophone
    2014-05-07 16:37:05.683 MyApp[2864:5c03] Checking and resetting all audio session settings.
    2014-05-07 16:37:05.687 MyApp[2864:5c03] audioCategory is correct, we will leave it as it is.
    2014-05-07 16:37:05.692 MyApp[2864:5c03] bluetoothInput is correct, we will leave it as it is.
    2014-05-07 16:37:05.696 MyApp[2864:5c03] Output Device: SpeakerAndMicrophone.
    2014-05-07 16:37:05.700 MyApp[2864:5c03] categoryDefaultToSpeaker is correct, we will leave it as it is.
    2014-05-07 16:37:05.704 MyApp[2864:5c03] preferredBufferSize is correct, we will leave it as it is.
    2014-05-07 16:37:05.709 MyApp[2864:5c03] preferredSampleRateCheck is correct, we will leave it as it is.
    2014-05-07 16:37:05.713 MyApp[2864:5c03] Setting the variables for the device and starting it.
    2014-05-07 16:37:05.716 MyApp[2864:5c03] Looping through ringbuffer sections and pre-allocating them.
    2014-05-07 16:37:06.316 MyApp[2864:5c03] Started audio output unit.
    2014-05-07 16:37:06.319 MyApp[2864:5c03] Calibration has started
    2014-05-07 16:37:08.528 MyApp[2864:5c03] Calibration has completed
    2014-05-07 16:37:08.532 MyApp[2864:5c03] Listening.
    2014-05-07 16:37:09.573 MyApp[2864:5c03] Speech detected…
    INFO: file_omitted(0): arc_buffer_producer_start_utt
    time delta 1399469829.599150 start 0000000
    time delta 1399469829.602701 partial result fwdtree
    partial: <s>:(0000000)
    INFO: file_omitted(0): arc_buffer_consumer_start_utt
    time delta 1399469829.609724 start 0000000
    time delta 1399469829.633936 partial result fwdtree
    [del]

    2014-05-07 16:38:29.691 MyApp[2864:907] In ContinuousAudioUnit
    INFO: file_omitted(0): cmn_prior_update: from < 14.39 -2.20 -0.33 0.33 -0.42 -0.08 -0.21 -0.08 -0.34 -0.05 -0.17 -0.12 -0.09 >
    INFO: file_omitted(0): cmn_prior_update: to < 14.23 -2.12 -0.30 0.33 -0.37 -0.12 -0.21 -0.08 -0.34 -0.06 -0.18 -0.13 -0.08 >
    [Here is delay!]
    INFO: file_omitted(0): idx 8005 is final (8005)
    INFO: file_omitted(0): fwdtree: marking arc buffer final
    INFO: file_omitted(0): fwdtree: allocated 8192 arcs (96 KiB)
    INFO: file_omitted(0): fwdtree: allocated 512 start frame entries (2 KiB)
    INFO: file_omitted(0): Waiting for 1 consumers to finish
    INFO: file_omitted(0): idx 8005 is final (8005)
    2014-05-07 16:41:41.845 MyApp[2864:5c03] Stopping audio unit.
    time delta 1399470101.847927 end 0000000
    2014-05-07 16:41:41.913 MyApp[2864:5c03] Audio Output Unit stopped, cleaning up variable states.
    2014-05-07 16:41:41.916 MyApp[2864:5c03] This device is not recording, so first we will set its recording status to 0
    2014-05-07 16:41:41.918 MyApp[2864:5c03] The audio unit is running so we are going to dispose of its instance
    INFO: file_omitted(0): fwdtree canceled
    INFO: file_omitted(0): TOTAL fwdtree 270.115 wall 3.374 xRT
    INFO: file_omitted(0): fwdflat canceled
    INFO: file_omitted(0): TOTAL fwdflat 244.569 wall 3.055 xRT
    2014-05-07 16:41:41.957 MyApp[2864:5c03] No longer listening.

    #1021159
    Halle Winkler
    Politepix

    Hi,

    There’s a bit too much missing from the logging above. Can you email me the complete logging output from the beginning to the end with nothing removed? There also needs to be some NSLogging for the method you use to call stopListening so some programmatic indication of the fact that it was called appears inline in the log with a datestamp, since I can’t get any information out of the logs with the sequence deleted in which stopListening was called.

    BTW, did you see that pathToTestFile isn’t working in the case above? Here is the error:

    Warning: There is a request to use pathToTestFile but there isn’t a usable file at the location given (“/var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/test.wav”) so the live microphone will be used instead.
    
Viewing 18 posts - 1 through 18 (of 18 total)
  • You must be logged in to reply to this topic.