this post was submitted on 20 Jan 2025
68 points (100.0% liked)

TechTakes

1591 readers
73 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 16 points 2 weeks ago (1 children)

I fucking knew it!!! I don't even know why I feel so vindicated for calling out such an obvious fraud tbh. anyone, besides possibly a HN poster, could have seen it coming

[–] Naz 1 points 2 weeks ago

You were right.

I make open models from scratch and I've tested the corporate benchmark banks and some of the results they've gotten are extremely sus.

After the Volkswagen Dieselgate Scandal, I've taken metrics not reported by independent auditors with a chonking peanut scooper of salt.

For the curious:

My best results so far have been like 74% on HumanEval with a 405B, zero-shot induction.