Can I access the utterance recording?

Heya,

I’m writing a live translation skill that allows me to be converse in multiple languages using a portable rPi build.

The issue i’m facing is I put Mycroft into conversation mode, have a fallback skill that does the translation, now this works fine if the utterance is in same language Mycroft is expecting(english) but say then a french speaker talks, then the TTS obviously doesn’t work.

From within the fallback skill can I access the original recording that was used to generate the text?

Thanks

Marc

Hmm just seen this.

I suppose I could just save the recordings using this method, pull, translate and then remove them! Unless there’s an easier way?

1 Like

Hey Marc,

You can set a custom save path in your mycroft config:

  "listener": {
    "save_path": "/home/user/mycroft-saved-data",
    "save_utterances": true
  }

Very hacky, but if you grab the file with the latest timestamp you should theoretically get the last utterance.

I can’t think of a way of doing it properly without modifying mycroft-core or modifying an STT plugin

Yeah awesome thanks a lot, that’s perfect - here’s another question(I can ask another question in the forum if you’d rather).

I want 1 Mycroft instance, now our internal tool is called Kane, I use it to transfer windows, data, displays across our iPhones, TV screens, laptops in the offices “Kane give me my laptop on this device” - kane has facial and voice recognition so he knows what “this” is because he knows where you’re standing.

Now if I’m on my mobile and I want to send a voice clip to “The” Kane not “A” Kane(installed on mobile) is there a way to do that also? Basically, can I submit an audio recording rather than using a mic as input?

Thanks

Marc

Not an easy way that I can think of off the top of my head…

Your options are probably to either hack the speech service in core to do what you want. Or pipe the audio stream from your recording and set that as the audio input source - essentially emulate a mic.

Others might have different ideas though?

Hmm yeah I bet I could pretend to be the mic somehow, I’ll investigate that while we see if anyone has any other ideas, thanks a lot gez!

1 Like

Bumping an older thread here but found it, will update in a while when I figure out the integration steps but -https://voicemeeter.com/vban-talkie-use-cases/

Might not quite give the level of integration I’m after so we might need to implement our own VBAN emitter - but in short problem solved.