11-08-2020, 11:33 AM
No server is required, anywhere. The only outside service required at this time is a STT (speech to text) converter. Currently the most reliable is google. The local STT (CMUSphinx) is only usable by experts. While the use of Google for STT is not the best, it is not as big privacy leak as it appears. All google does in this case is recognize the words. Google does not know what actions are taken due to the words. This is very unlike any of the proprietary VA's as they do voice recognition and application execution. The framework is easily extensible and there are multiple STTs and TTSs available.
For example I have six URLs for internet radio.
Sheila, play radio alberta
Google will see the words play radio alberta but has no idea what radio station I am listening
I am currently building a neuron to make phone calls via voice command
Sheila, call Bob
Since google has no access to my contacts, google does not know who I am calling
I use the program on my home media machine as an Alexa/Google/Siri alternative. It is in a state very similar to the pinephone, it works, but it takes work to make it work.
HTH
LF
For example I have six URLs for internet radio.
Sheila, play radio alberta
Google will see the words play radio alberta but has no idea what radio station I am listening
I am currently building a neuron to make phone calls via voice command
Sheila, call Bob
Since google has no access to my contacts, google does not know who I am calling
I use the program on my home media machine as an Alexa/Google/Siri alternative. It is in a state very similar to the pinephone, it works, but it takes work to make it work.
HTH
LF