Description
Large-language model AI assistants are all the rage, and this is a trend that is evolving quickly. Given that there is so much open availability to integrating an AI LLM into, well, anything, really, it would be very cool to have an assistant built into KDE that will do things for the end-user, rather than having to click or touch the screen. Want to launch Steam? Speak, verbally, to KDE, and tell it (literally) to launch Steam. Want to change a wallpaper? The KDE assistant should be able to, at least, launch settings, and at best, adjust those settings for the end-user, without ever having to touch the mouse, keyboard, or screen.
Why does it need to be done?
This would be extremely helpful from an accessibility perspective for people who are at a disadvantage when trying to use a mouse, a keyboard, or a screen. Additionally, given that KDE's main "claim to fame" (can't believe I just said that) is its configurability, and as that continues to expand with new features (and their configurations), it's only going to get more complex. Having an AI LLM to do the work for the end-user would make it much easier to sift through the various configurations.
How does it connect to KDE's vision of "A world in which everyone has control over their digital life and enjoys freedom and privacy"?
It puts the vast power of configurability into the (literal) voice of the end-user, allowing them to tell KDE what they want, and KDE can help them to do it.
How would it affect different parts of KDE?
KDE would need a built-in system that accept speech input from a microphone device. Phonon would be the logical place to put this additional functionality, although a separate AI/LLM subsystem to interpret speech and integrate into the system would be net new. And that system would need to be able to integrate deeply into existing functionality, including prompting for sudo privileges.
What it will take
A LOT of time, probably a LOT of money, and a LOT of knowledge that I don't have.
How we know we succeeded
When you can tell KDE what you want to do, with your own voice, and it does it for you. And then watching as end-users bow before you, sacrificing little gnomes, little apples, and little colored "windows" to your greatness. ;)
Champions
The team is:
- XXX
- XXX
- XXX
I am willing to put work into this
- Timothy Gravier, Jr. -- although I haven't actually coded in C++ in years, barely even seen QT libraries, and don't know how to integrate an AI LLM. But it can be done, right? People do it! Oh, and I can end-user-test the crap out of it.
I am interested
- Timothy Gravier, Jr.