Can I access the utterance recording?

Marc_Fielding · September 23, 2021, 12:48pm

Heya,

I’m writing a live translation skill that allows me to be converse in multiple languages using a portable rPi build.

The issue i’m facing is I put Mycroft into conversation mode, have a fallback skill that does the translation, now this works fine if the utterance is in same language Mycroft is expecting(english) but say then a french speaker talks, then the TTS obviously doesn’t work.

From within the fallback skill can I access the original recording that was used to generate the text?

Thanks

Marc

Marc_Fielding · September 23, 2021, 1:12pm

Hmm just seen this.

I suppose I could just save the recordings using this method, pull, translate and then remove them! Unless there’s an easier way?

gez-mycroft · September 24, 2021, 6:34am

Hey Marc,

You can set a custom save path in your mycroft config:

  "listener": {
    "save_path": "/home/user/mycroft-saved-data",
    "save_utterances": true
  }

Very hacky, but if you grab the file with the latest timestamp you should theoretically get the last utterance.

I can’t think of a way of doing it properly without modifying mycroft-core or modifying an STT plugin

Marc_Fielding · September 28, 2021, 9:50am

Yeah awesome thanks a lot, that’s perfect - here’s another question(I can ask another question in the forum if you’d rather).

I want 1 Mycroft instance, now our internal tool is called Kane, I use it to transfer windows, data, displays across our iPhones, TV screens, laptops in the offices “Kane give me my laptop on this device” - kane has facial and voice recognition so he knows what “this” is because he knows where you’re standing.

Now if I’m on my mobile and I want to send a voice clip to “The” Kane not “A” Kane(installed on mobile) is there a way to do that also? Basically, can I submit an audio recording rather than using a mic as input?

Thanks

Marc

gez-mycroft · September 30, 2021, 12:40am

Not an easy way that I can think of off the top of my head…

Your options are probably to either hack the speech service in core to do what you want. Or pipe the audio stream from your recording and set that as the audio input source - essentially emulate a mic.

Others might have different ideas though?

Marc_Fielding · September 30, 2021, 9:17am

Hmm yeah I bet I could pretend to be the mic somehow, I’ll investigate that while we see if anyone has any other ideas, thanks a lot gez!

Marc_Fielding · January 3, 2022, 6:02pm

Bumping an older thread here but found it, will update in a while when I figure out the integration steps but -https://voicemeeter.com/vban-talkie-use-cases/

Marc_Fielding · January 3, 2022, 6:03pm

Might not quite give the level of integration I’m after so we might need to implement our own VBAN emitter - but in short problem solved.