755
chef kiss (infosec.pub)
you are viewing a single comment's thread
view the rest of the comments
[-] Pantoffel@feddit.de 3 points 1 year ago

So what are you building? A browser STT interface for chatting with GPT and other LLMs?

[-] flossdaily@lemmy.world 7 points 1 year ago* (last edited 1 year ago)

I'm not ready to talk about it in detail. Even my boss doesn't know. But you're in the right ballpark.

I'm actually building a proof-of-concept prototype for what I want to work on... and I'm using a browser extension so that I can build it independently without anyone from the tech team being involved and slowing me down.

[-] Pantoffel@feddit.de 2 points 1 year ago

That sounds nice. I've been looking at serenade.ai and thought about extending their STT with an option to use another third-party STT engine. I would then like to extend their command engine with LLM command recognition. In my experience, maybe also with my pronunciation as a non-english speaker, their STT and command recognition really doesn't work that well.

[-] flossdaily@lemmy.world 3 points 1 year ago

Have you tried Whisper from OpenAI? It's the best I've ever seen. I'm curious how it would handle accents.

[-] Pantoffel@feddit.de 2 points 1 year ago

No, not yet. But thanks for the tip!

this post was submitted on 11 Aug 2023
755 points (93.8% liked)

Programmer Humor

32050 readers
1536 users here now

Post funny things about programming here! (Or just rant about your favourite programming language.)

Rules:

founded 5 years ago
MODERATORS