this post was submitted on 19 Apr 2025
6 points (87.5% liked)
Large Language Models
208 readers
3 users here now
A place to discuss large language models.
Rules
- Please tag [not libre software] and [never on-device] services as such (those not green in the License column here).
- Be useful to others
Resources
github.com/ollama/ollama
github.com/open-webui/open-webui
github.com/Aider-AI/aider
wikipedia.org/wiki/List_of_large_language_models
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
What I'm wondering is, is there a standard format for instructing models to give outputs using the tool? They're specifically trained to be better at doing this right
Ah for training a new model from scratch? Yes there is a specific format, you can look at the ollama source code or any of the big models that accept tool use like llama4 for the format both to and from a model. However unless you're secretly a billionaire I doubt you could compete with these pertained models in tool calling.
Ollama's model list on their website has a filter for tool using models. To be honest all open source models suck at tool use compared to the big players, openai, anthropic, google. To be fair I don't have any hardware capable of running deepseeks newest models so I haven't tested them for tool use.
No I meant like, for prompting tool supporting models to be aware of the functions you are making available to it. I've tried arbitrary prompts to tell it to do this and it sort of works but yeah the models I've tried don't seem very good at that, was mainly wondering if using a specific format in the prompt would improve performance
I think what you are looking for is MCP - Model Context Protocol. It's an effort to standardize what you are talking about.