Yes this makes sense but unfortunately is not yet possible, particularly in this broad context of “Any valid utterance and any other valid utterance”.
One challenge is being able to extract out multiple intents and the many possibilities that come with that. It seems simple at first - split the utterance on “and” then handle both halves as independent queries. However consider these examples:
“Start a timer for 5 minutes and another for 20 minutes”
In this case we need the context from one intent to properly parse the second half.
“Play Piano and I by Alicia Keys”
This time there is only one intent. Which shows that we need to check both the whole utterance as well as components.
These are just two quick examples off the top of my head but we need to look at what all of those edge cases might be. We’re also working to speed up Mycroft’s response times and checking multiple possible utterances simultaneously adds greater complexity to that.
How would you rate your programming experience, is it something you are keen to explore, or more looking for an existing solution? There might be others reading along interested in helping out?
If you’re not a programmer, thinking through what some of these challenging phrases might be is also extremely helpful for those who do have the coding skills.