grahamsz

joined 11 months ago

Looking for a self-hosted chatgpt-like tool with an api in c/[email protected]

[–] [email protected] 1 points 10 months ago

I can run VMWare's Open LLama 7B v2 Open Instruct on my laptop comfortably (though I have 64GB ram and 16GB VRAM) and my sense is that's it's probably somewhere between GPT2 and GPT3 in inference quality. It is, however, very slow. Even with my comparatively strong hardware, it's slow enough that I wouldn't want to use it in an interactive context (though it may be useful for background processing)

permalink
fedilink
source
context