tbrandt78

Forum Replies Created

Viewing 11 posts - 1 through 11 (of 11 total)

  • Author
    Posts
  • in reply to: OE and Bluetooth HFP 8kHz Audio signal #1030779
    tbrandt78
    Participant

    Okay, thanks for the feedback.
    At the moment, I am switching between the internal phone microphone input and bluetooth speaker output. That seems to work reasonably well, but not ideal.

    Basically, I stay on category AVAudioSessionPlayAndRecord and switch the input port back and forth between HFP and the internal microphone.

    This is not ideal, but it seems to be the best way to get it to work.
    Otherwise, OE struggles recognizing commands using the BT microphone.

    in reply to: OE and Bluetooth HFP 8kHz Audio signal #1030771
    tbrandt78
    Participant

    Hey Halle –

    So, it seems that none of these settings will help my case, as far as I can tell. These settings all affect how OEContinuousAudioUnit::setAllAudioSessionSettings sets the AVAudioSession.

    From what I can tell, disablePreferredSampleRate just stop OE from attempting to override the sample rate using this AVAudioSession command :
    [AVAudioSession sharedInstance].preferredSampleRate

    The issue that I have observed from the automotive BluetoothHFP systems is that the audio input all seems to be 8kHz. I have already tried to set preferredSampleRate to 16000, but it always results in the sample rate staying at 8kHz.

    So… I really believe that the answer here is to ‘up-sample’ the input audio signal when 8kHz to 16kHz for OpenEars to take, cleanly. Yes, the captured audio will still have 8kHz resolution, but the bit depth will be at 16kHz which seems more compatible for OpenEars.

    Thoughts?

    in reply to: OE and Bluetooth HFP 8kHz Audio signal #1030768
    tbrandt78
    Participant

    Hi Halle –

    Okay, I will be looking into this, today. Thank You for the advice.
    I am guessing that disablePreferredSampleRate might do the trick is it removes the requirement for 16kHz audio input. I will see how it goes.

    Thank You,
    Tim

    in reply to: 'Key Phrase Spotting' Guidance #1027499
    tbrandt78
    Participant

    Another phrase that triggers ‘Hello Focus’ is ‘Hero Focus’.
    It’s very close, but not the same. Ideally, we would want the former to trigger and the latter to not.

    in reply to: 'Key Phrase Spotting' Guidance #1027498
    tbrandt78
    Participant

    I would also like to add that we are also using RuleORama to listen for “command phrases”. i.e. :
    ‘Text Halle’
    ‘Call Halle’

    For the same reason as above, I have been using RuleORama for this task, because it nearly always hears the phrase and the whole phrase.

    Where we running into problems is w/ false positives, just as in key phrase spotting. For example, if I say ‘Text Spaghetti Monster’ with the above grammar of only ‘Text Halle’ and ‘Call Halle’, it will match it to ‘Text Halle’.

    This is really the missing piece of the puzzle for us, is trying to figure out how to get this listening technology more exclusionary. Right now, it is very forgiving in matching incoming audio to one of the entries in the grammar. I am looking to mimic the ability of Rejecto to throw away unwanted speech.

    Again – any help that you can offer would be appreciated.

    Thanks,
    Tim

    in reply to: 'Key Phrase Spotting' Guidance #1027497
    tbrandt78
    Participant

    Hey Halle –

    Thanks for the response. There must be quite a few ‘false positives’ for hello focus, because it does go off “quite a bit” (ambiguous, I know) during conversation. That being said, to the extent that I’ve hunted for false positives, I had the app listening while I had some YouTube videos running and here are some of the phrases that triggered it :

    “Go for it, Mr. Robbins”
    “Shell of this Person”

    Those are the only two phrases that I know for sure, but I can run some more tests and get back to you.

    One thought that I had was maybe to identify as many of these ‘false positives’ that I can and add them to the grammar to be later ignored if triggered. It just doesn’t seem very scientific.

    I know that this has been said before, but I am really looking to keep the spot-on hair-trigger recognition accuracy that I am achieving w/ RuleORama+RapidEars, yet get the exclusivity that I see happening w/ Rejecto.

    Right now, I can only guarantee one or the other, depending on technology. I am trying to find a way to ensure both. Is that too much to ask? (wink, wink). I kid.

    Thanks so much for your help and advice.

    in reply to: RuleORama : Force SilenceDetection #1027187
    tbrandt78
    Participant

    No worries at all. These plugins work great.
    I have just been figuring out the best way to use them for our purposes.
    I think that we’ve gotten things pretty good.

    Thanks for your support and responses.

    in reply to: RuleORama : Force SilenceDetection #1027177
    tbrandt78
    Participant

    Yes, I was indeed fooling myself.
    In this forum :
    https://www.politepix.com/forums/topic/savethatwav-not-respecting-secondsofsilencetodetect/#post-1027176

    I realized that secondsOfSilenceToDetect only take affect when starting the listener, anyway. So, I was probably experiencing the placebo effect.

    My current thought is to cache the RapidEars result and if the regular OpenEars hypothesis does not kick in within a given threshold, just use it.
    That should accomplish the same objective.

    Thanks again for all the help and the outstanding plugins.
    I am just dialing in some details on an MVP that I hope takes us places.

    Thanks again!

    in reply to: RuleORama : Force SilenceDetection #1027157
    tbrandt78
    Participant

    Hey Halle –

    This ‘trick’ seems to be working for us, for the most part. Just hoping to hear some confirmation from you on whether or not this is the way to go.

    Thanks,
    Tim

    in reply to: RuleORama : Force SilenceDetection #1027113
    tbrandt78
    Participant

    Hey Halle –

    Just a quick followup. So, I tried my suggestion and I set secondsOfSilenceToDetect to 0.f when RapidEars has a valid hypothesis for a command. I released a build and so far it’s working. We haven’t seen any hangs. I imagine that it does indeed force silence detection.

    I will keep you updated, thanks.

    tbrandt78
    Participant

    Posted in wrong forum – edited to delete.
    Excuse me.

Viewing 11 posts - 1 through 11 (of 11 total)