Do you have a write up of the tech stack and setup? Or willing to give the gist ...

Aeroi · on May 13, 2025

I also ran across an interesting robot toy demo today that had voice built in. it was whimsical and seemed like it was aimed towards primary education and kids. Someone here might know the name.

stavros · on May 13, 2025

You can use Ollama or LM Studio, both in API mode, to return the responses. I believe they offer audio support, but I'm not entirely sure.

However, if you're looking for instruction following (like an agent), I've tried to implement my own agent and have lost faith. Even GPT-4.1 will regularly gaslight me that no, it definitely ran the tool call to add the event to my calendar, when it just didn't. I can't get any more adherence out of it.

tomp · on May 13, 2025

We're definitely there, there's just no "ready-made" apps yet. But the technology is possible, go to e.g. vapi.ai to test it.

cloudking · on May 13, 2025

Check out https://livekit.io/

keyle · on May 14, 2025

All this lead to, is paying or using APIs and more paying. That's not what I was asking for.

Aeroi · on May 13, 2025

yeah i made a post on here, but the algo sent it to the gulag abyss.

https://news.ycombinator.com/item?id=43926673

keyle · on May 13, 2025

That's a good product site but it doesn't help me in anyway...