Hey all,
I plan to use the new Indigo Amazon Echo plugin soon (the one that mimics a hue bridge), but the functionality I REALLY want is to capture the actual text that Echo receives, even if Amazon doesn't know what to do with it.
I'm blown away by the speech recognition of Echo, really blown away, I just want access to the raw text so I can parse it on my own and not rely on somebody else's language processing algorithms.
Anybody with thoughts on how best to do this? I've seen several neat but not really elegant hacks that do this:
http://forum.universal-devices.com/topi ... -variable/
Above captures the graphics from the browser of a Windows machine, then preforms OCR, and sends it via an Android. Not preferable as I don't want a dedicated windows machine or Android.
https://hackaday.com/tag/amazon-echo/
Reliant on IFTTT, Adafruit, and an Arduino. Too many jumps, and too many services that get access to things spoken aloud in my house. (Amazon alone is bad enough)
viewtopic.php?f=65&t=15374
Jay's plugin, though it doesn't (to my knowledge) grant access to the raw command
http://blog.zfeldman.com/2014-12-28-usi ... mperature/
The most elegant, but pretty high level. Involves having a machine open a browser and capture the text from the raw html. A little over my head given reliance on Ruby (though I did have a Siri Proxy up), Linux, and Sinatra. Also invloves having to say "Stop" after every command, for reasons unclear.
My current thinking is to use Python (or even Applescript) to log me in to Amazon's Alexa page, then monitor the HTML portion that has the last received command (it is always in the same section). I've looked into basic web-scraping tools and coding, and I am trying to figure out a way to have this run in the background (ideally) of my host mac, which runs Indigo Server.
Once I can get text into my own variable, I can code things based on my typical needs.
Example: "Alexa, Netflix"
- Amazon doesn't do anything with this command, but stores it in a variable
- Netflix is only associated with apple TV in my setup, so it would trigger the following Macro (currently ssigned to a button on my Control Page)
- turn on receiver, TV
- change input on receiver to one used for Apple TV
- execute "up" "down" "left" "right" commands on the Apple TV, selecting Netflix
- execute the "enter" command on Apple TV
(I use an IP2IR and Perry the Cynic's amazing plugin for this)
- my current language processing, written by me in python, strips irrelevant words, infers context, and executes Indigo actions via REST calls
So, any thoughts? Is the functionality already IN the hue plugin?