this post was submitted on 02 Feb 2025
212 points (96.9% liked)

United States | News & Politics

2211 readers
992 users here now

Welcome to [email protected], where you can share and converse about the different things happening all over/about the United States.

If you’re interested in participating, please subscribe.

Rules

Be respectful and civil. No racism/bigotry/hateful speech.

Post anything related to the United States.

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 1 points 1 day ago

There are finetunes of Llama, Qwen, etc., based on DeepSeek that implement the same pre-response thinking logic, but they are ultimately still the smaller models with some tuning. If you want to run locally and don't have tens of thousands to throw at datacenter-scale GPUs, those are your best option, but they differ from what you'd get in the Deepseek app.