False positive wake words

mike99mac · December 12, 2021, 11:00pm

I was listening to a podcast and Mycroft woke up more than a few times. Here are some examples from voice.log:

Utterance: ['i hear they very much want']
...
Wakeword Detected: hey mycroft
...
Utterance: ['network will pose no problems whatsoever to']
...
Wakeword Detected: hey mycroft

Is there any way to avoid these false positives? The install is all defaults …

Thanks.

-Mike M

baconator · December 13, 2021, 3:03am

fine tune with anything that falsely triggers it…turn on wake word uploading…train a custom word.

sparkyvision · December 19, 2021, 3:51am

Allow me to suggest my own tutorials, here:

And

My model is jussssst starting to be usable after…about 2,000 not-wake-words, most of those are from my own house. I set up a cron job on the pi to copy saved-wake-words, unplugged the speaker so I wouldn’t be annoyed by it, and just let it collect a whole bunch of things that trigger it for weeks. I also upped the sensitivity on my training to .85. I still have some work to do on it, and I’d especially be interested in adding some more wake-words with my family and strangers.

mike99mac · December 19, 2021, 1:47pm

Hi, thanks for the replies. It’s one thing to train one instance of Mycroft. But how do we bake it into the package so every instance is solid?

-Mike M

sparkyvision · December 19, 2021, 4:49pm

But how do we bake it into the package so every instance is solid?

The answer is obscene amounts of data. Apple, Amazon, Google, et al have hundreds of thousands of hours of training data, across many people, accents, and regions to train their model. These models are so solid out of the box, so to speak, because they’ve been trained on so much data. The Mycroft project simply does not have access to a well-organized and high-quality audio corpus of that size. At least, not yet. Right now, the best thing we can do is individually train models to work. In the future, it might be helpful - and @baconator might want to chime in here - to have a wiki where people can submit recordings for individual wake words so that the audio corpuses (corpi?) can be built up and the models subsequently improved. I believe something like this existed, or might exist somewhere? If a massive community effort to collect lots of high-quality recordings could happen, that would go a long way toward making these models better.

Of course, free community data will need sifting to keep the good stuff and weed out the bad, but that’s part of what it takes to build up large data sets.

But there’s no way to make massive improvements without a lot more data.

baconator · December 20, 2021, 12:59am

More and more varied data.

There’s also possibility of changes to the tool, and that’d probably make some difference in next steps.