293 Comments - Last post 10 minutes ago by AndrewTheD
42 Comments - Last post 15 minutes ago by Moogal
11 Comments - Last post 27 minutes ago by BattleChaing
201 Comments - Last post 39 minutes ago by CulitoRiko7u7
35 Comments - Last post 45 minutes ago by Gamy7
1,275 Comments - Last post 2 hours ago by TandborsteN
350 Comments - Last post 3 hours ago by Serpy
62 Comments - Last post 9 minutes ago by VahidSlayerOfAll
232 Comments - Last post 10 minutes ago by HowDareYou
1,040 Comments - Last post 11 minutes ago by Kireato
254 Comments - Last post 13 minutes ago by Vin3
72 Comments - Last post 18 minutes ago by macgamer
4,168 Comments - Last post 33 minutes ago by yugimax
154 Comments - Last post 37 minutes ago by RobbyRatpoison
So, a mate of mine starts rambling about this project idea of his — keeping it vague for now 'cause he's gonna try and market it or something — but basically it’s gonna involve AI, voice, the works. Since I’ve been off work recovering from some delightful dental surgery (10/10 don’t recommend), he asked if I could whip up a basic offline AI to help with his prototype.
One week later, in between games and wrangling the kids, I’ve somehow ended up knee-deep in a full-on desktop AI assistant. I’m calling it Version 0.8 for now, with my “MVP” version being 1.0.
Right now it uses FFmpeg, Whisper, LLaMA3, and Coqui TTS. It handles both text and voice input/output, caches WAVs, convos, user settings, and has a few colour themes 'cause who doesn’t love a bit of flair. Currently working on per-conversation caching and trying to make convos reference each other — which is as fun as it sounds.
Also, the AI voice? Sounds like a half-baked call centre operator. Absolutely cooked. I’m adding more voice options soon so it stops sounding like a robo-Karen trying to upsell me internet plans.
Performance-wise, I’ve managed to take voice response from "go make a cuppa" times down to about 6–8 seconds, thanks to streaming chunked WAVs and throwing the GPU at it. Still not lightning, but hey, it’s no longer yelling into the void and waiting for enlightenment.
Anyway, point is — since I was putting together a train anyway, thought I’d ask: anyone got feature ideas? Already blown past what my mate expected, so I’ve got a pretty hefty roadmap going. But I’m all ears for wild suggestions, practical or ridiculous.
Here is your entry to a progressive train. Good Luck and Enjoy ^^
Just finalised the addition of allowing the creation of different conversations, user defined conversation titles, conversational tabbing, persistent / cached conversations and deleting conversations ^^ Currently the entire App is 755 Megabytes. Let's watch that expand >.<
Comment has been collapsed.