But can you, with reasonable latency, run speech to text or text to speech?
I've got a couple frigate cameras with object detection, STT and TTS running, and using like 2.8 gigs of VRAM. I might just bump up the quality on the STT and/or TTS actually...
But can you, with reasonable latency, run speech to text or text to speech?
I've got a couple frigate cameras with object detection, STT and TTS running, and using like 2.8 gigs of VRAM. I might just bump up the quality on the STT and/or TTS actually...