Throwaway4669332255

joined 1 year ago
[–] [email protected] 5 points 4 weeks ago (14 children)

How does the Nemo 12B compare to the Llama 3.1 8B?

[–] [email protected] 2 points 5 months ago

Apparently I am an idiot and read the wrong paper. The previous paper mentioned that "comparable with the 8-bit models"

https://huggingface.co/papers/2310.11453

[–] [email protected] 1 points 5 months ago (2 children)

They said their's is "comparable with the 8-bit models". Its all tradeoffs. It isn't clear to me where you allocate your compute/memory budget. I've noticed that full 7b 16 bit models often produce better results for me than some much larger quantied models. It will be interesting to find the sweet spot.

[–] [email protected] 2 points 5 months ago (4 children)

So are more bits less important than more paramters? Would a higher paramter or higher bit count matter more if the models ended up the same size?

[–] [email protected] 1 points 7 months ago

I'm so glad I work for a medium-small company. We moved to a smaller office and only require to go in twice a month

18
Spaghetti Again (www.youtube.com)
[–] [email protected] 1 points 7 months ago

Thank you! I had no idea this existed.

[–] [email protected] 7 points 7 months ago

He's opinionated but a pretty good science communicator.

[–] [email protected] 81 points 8 months ago (8 children)

Sony choose to not offer refunds. Sony knew the contract when they agreed to sell the content. When something gets pulled from steam I can still download and install it.

[–] [email protected] 50 points 8 months ago

I got banned because I said "lemmy dot world" in reply to someone who said they wished reddit had an alternative.

[–] [email protected] 32 points 8 months ago (5 children)

Its a disturbing tend and more evidence to own physical copies.

view more: next ›