SpeechToTextCloud

joined 8 months ago
MODERATOR OF
[–] [email protected] 2 points 3 weeks ago

Yes, you put the app in /opt, no not in /bin or /usr/bin

 

I have just finished two new features for our transcription service:

One-click summaries: Get the gist of your transcripts in seconds. Translate transcripts in 50+ languages: Reach global audiences with ease.

9 minutes free. Have fun!

#SpeechToText #Transcription #Translation

[–] [email protected] 5 points 1 month ago

If you're fine with #libreboot too, ask https://mas.to/@libreleah

4
Live Transcription (www.speech-to-text.cloud)
 

I just finished the “Live Transcription from the microphone” feature.

9 minutes are free.

Have fun!

[–] [email protected] 12 points 3 months ago (1 children)

This comment shows why I like Lemmy more than Reddit. Nuanced, acknowledging when the other person has a point without just yelling at each other.

 

Bazzite comes ready to rock with Steam and Lutris pre-installed, HDR support, BORE CPU scheduler for smooth and responsive gameplay, and numerous community-developed tools for your gaming needs.

 

CRISPR has become a key method for genome modification. Researchers are now proposing a potentially even better approach.

 

You type "Once upon a time!!!!!!!!!!" and those exclamation marks are rendered to show the LLM generated text, using a tiny 30MB model

via https://simonwillison.net/2024/Jun/23/llama-ttf/

[–] [email protected] 3 points 4 months ago

Thanks for the clarification.

[–] [email protected] 21 points 4 months ago

They use stem cells derived from human skin

 

Large Language Models made from cells incoming...

[–] [email protected] 0 points 8 months ago (2 children)

Kind of ironic how they use an AI generated article image

 

Hello Entrepreneur community,

I'm Martin, and I recently launched a new tool that I'd like to share with the community. It's a website for speech-to-text transcription that allows users to upload an audio file and receive a transcription of the spoken text.

The platform is built using Python, FastAPI, and Traefik, and offers the following features:

  • Speech recognition using Whisper
  • High accuracy transcriptions
  • HTTP routing and SSL termination handled by Traefik
  • Web framework provided by FastAPI

I believe this tool can be useful for a variety of applications, such as transcribing podcasts, interviews, or any other audio content. The accuracy of the transcriptions is quite high (WER score of 4.5), and I am constantly working on improving it even further.

I am looking for feedback on the platform, as well as any potential use cases or applications that you can think of. I am also interested in collaborations or partnerships, so please don't hesitate to reach out if you are interested in working together.

If this tool has piqued your interest and you would like to learn more, I encourage you to check it out and leave a comment with your thoughts. Your feedback will help me improve the platform moving forward.

Thank you for taking the time to check out my project.