Using Mycroft without Mycroft's speech recognition

Kaarel · July 29, 2017, 11:03am

Is there a way to send commands to Mycroft via HTTP queries that contain the command in natural language and the identifier of that language, something like:

http://192.168.0.123:8000?q=set+my+location+to+New+York&lang=en-US
http://192.168.0.123:8000?q=whats+the+weather+like&lang=en-US

This way there is no need for Mycroft’s own speech recognition / audio recording capability, and one would just use its text-to-speech, dialog management and query interpretation/execution capabilities. One could even disconnect Mycroft’s microphone (for increased privacy).

This makes the integration of certain custom solutions simpler. E.g. tapping on the smart watch triggers speech recognition on the watch + a custom app on the watch converts the transcription into the above HTTP query. This can improve usability because there is no need for the wake up word + speech recognition is more reliable because the positioning of the mouth vs the recorder is always constant.

Some custom solutions would prefer the query to result in some response (as a JSON structure), e.g.

http://192.168.0.123:8000?q=whats+the+weather+like&lang=en-US&respond-with=json

would deliver:

{ response: "it's sunny", lang: "en-US" }

Ideally the user should be able to define custom JSON templates which Mycroft would fill with the content of the “response” and “lang”.

This way both the STT and TTS are decoupled from the system.

It seems like it would be trivial to implement, so maybe something like this exists already?

baconator · August 1, 2017, 6:45am

On picroft there’s say_to_mycroft which does what you’re looking for. It’s basically a wrapper for say_command.py (https://github.com/MycroftAI/enclosure-picroft/blob/master/home/pi/say_command.py).

Kaarel · August 3, 2017, 11:27pm

Thanks! I’ve made a hackish wrapper around the say_command.py script, that more or less does what I need:

gist.github.com

https://gist.github.com/Kaljurand/df635ed92f1c6fb5c3a91231d13d4ade

my-server.py

#!/usr/bin/env python
# -*- coding: utf-8 -*-

# Simple web server that allows talking to Mycroft via HTTP queries like
# curl http://192.168.0.23:8000?q=tell+me+a+joke
# The response is a JSON object. It could be something general or something
# caller specific. (The examples are callbacks for K6nele.)
#
# Installation:
# 1. Allow access to the HTTP port: sudo ufw allow 8000

This file has been truncated. show original

One desired extension would be to get the response from Mycroft as plain text (the string that ends up in the log under “SpeechClient - INFO - Speak”) to be able to play it with a non-Mycroft TTS.

Btw, a bug report: when I ask: “How old is Roger Federer”, then the response (in version v0.8.19) is:
“Sorry, I don’t know how is Rodger Federer old”.

baconator · August 8, 2017, 7:27am

File an issue on github for bugs. Include copious documentation and logs if possible.