LocalLLaMA

2545 readers

14 users here now

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

founded 2 years ago

MODERATORS

submitted 2 years ago by [email protected] to c/localllama

16 comments fedilink hide all child comments

For example, does a 13B parameter model at 2_K quantiation perform worse than a 7B parameter model at 8bit or 16bit?

you are viewing a single comment's thread
view the rest of the comments

[–] noneabove1182 2 points 2 years ago

These are good sources, to add one more, the GPTQ paper talks a lot about perplexity at several quantization and model sizes: