Architeuthis

joined 1 year ago
[–] [email protected] 3 points 2 weeks ago* (last edited 2 weeks ago)

Apropos of nothing, I wonder when Uncle Trump's Presidential Pardon Auction House officially opens for business.

[–] [email protected] 11 points 2 weeks ago* (last edited 2 weeks ago)

If you've convinced yourself that you'll mostly be fighting the AIs of a rival always-chaotic-evil alien species or their outgroup equivalent, you probably think they are.

Otherwise I hope shooting first and asking questions later will probably continue to be frowned upon in polite society even if it's automated agents doing the shooting.

[–] [email protected] 13 points 2 weeks ago* (last edited 2 weeks ago) (10 children)

The job site decided to recommend me an article calling for the removal of most human oversight from military AI on grounds of inefficiency, which is a pressing issue since apparently we're already living in the Culture.

The Strategic Liability of Human Oversight in AI-Driven Military Operations

Conclusion

As AI technology advances, human oversight in military operations, though rooted in ethics and legality, may emerge as a strategic liability in future AI-dominated warfare.

~~Oh unknowable genie of the sketchily curated datasets~~ Claude, come up with an optimal ratio of civilian to enemy combatant deaths that will allow us to bomb that building with the giant red cross that you labeled an enemy stronghold.

[–] [email protected] 20 points 2 weeks ago

Maybe Momoa's PR agency forgot to send an appropriate tribute to Alphabet this month.

[–] [email protected] 9 points 3 weeks ago* (last edited 3 weeks ago)

I could go over Wolfram's discussion of biological pattern formation, gravity, etc., etc., and give plenty of references to people who've had these ideas earlier. They have also had them better, in that they have been serious enough to work out their consequences, grasp their strengths and weaknesses, and refine or in some cases abandon them. That is, they have done science, where Wolfram has merely thought.

Huh, it looks like Wolfram also pioneered rationalism.

Scott Aaronson also turns up later for having written a paper that refutes a specific Wolfram claim on quantum mechanics, reminding us once again that very smart dumb people are actually a thing.

As a sidenote, if anyone else is finding the plain-text-disguised-as-an-html-document format of this article a tad grating, your browser probably has a reader mode that will make it way more presentable, it's F9 on firefox.

[–] [email protected] 2 points 1 month ago (1 children)

This was exactly what I had in mind but for the life of me I can't remember the title.

[–] [email protected] 9 points 1 month ago* (last edited 1 month ago) (1 children)

why are all podcast ads just ads for other podcasts? It’s like podcast incest

I'm thinking combination of you probably having set all your privacy settings to non serviam and most of their sponsors having opted out of serving their ads to non US listeners.

I did once get some random scandinavian sounding ads, but for the most part it's the same for me, all iheart podcast trailers.

[–] [email protected] 16 points 2 months ago (4 children)

It had dumb scientists, a weird love conquers all theme, a bathetic climax that was also on the wrong side of believable and an extremely tacked on epilogue.

Wouldn't say that I hated it, but it was pretty flawed for what it was. magnificent black hole cgi notwithstanding.

[–] [email protected] 16 points 2 months ago

Summizing Emails is a valid purpose.

Or it would have been if LLMs were sufficiently dependable anyway.

[–] [email protected] 10 points 2 months ago* (last edited 2 months ago) (1 children)

But “It’s Greek to me” goes right back to the Romans.

The wiki seems to say the aphorism originates with medieval scribes and Shakespeare's Julius Caesar.

The actual ancient Romans are unlikely to have had such qualms, since at the time Greek was much more widely understood than Latin, so much so that many important roman works like Caesar's Memoirs and Marcus Aurelius' Meditations were originally written in Greek, with the Latin versions being translations.

[–] [email protected] 6 points 2 months ago

You’d think AI companies would have wised up by this point and gone through all their pre-recorded demos with a fine comb so that ~~marks~~ users at least make it past the homepage, but I guess not.

The target group for their pitch probably isn't people who have a solid grasp of coding, I'd bet quite the opposite.

[–] [email protected] 20 points 2 months ago* (last edited 2 months ago) (2 children)

On each step, one part of the model applies reinforcement learning, with the other one (the model outputting stuff) “rewarded” or “punished” based on the perceived correctness of their progress (the steps in its “reasoning”), and altering its strategies when punished. This is different to how other Large Language Models work in the sense that the model is generating outputs then looking back at them, then ignoring or approving “good” steps to get to an answer, rather than just generating one and saying “here ya go.”

Every time I've read how chain-of-thought works in o1 it's been completely different, and I'm still not sure I understand what's supposed to be going on. Apparently you get a strike notice if you try too hard to find out how the chain-of-thinking process goes, so one might be tempted to assume it's something that's readily replicable by the competition (and they need to prevent that as long as they can) instead of any sort of notably important breakthrough.

From the detailed o1 system card pdf linked in the article:

According to these evaluations, o1-preview hallucinates less frequently than GPT-4o, and o1-mini hallucinates less frequently than GPT-4o-mini. However, we have received anecdotal feedback that o1-preview and o1-mini tend to hallucinate more than GPT-4o and GPT-4o-mini. More work is needed to understand hallucinations holistically, particularly in domains not covered by our evaluations (e.g., chemistry). Additionally, red teamers have noted that o1-preview is more convincing in certain domains than GPT-4o given that it generates more detailed answers. This potentially increases the risk of people trusting and relying more on hallucinated generation.

Ballsy to just admit your hallucination benchmarks might be worthless.

The newsletter also mentions that the price for output tokens has quadrupled compared to the previous newest model, but the awesome part is, remember all that behind-the-scenes self-prompting that's going on while it arrives to an answer? Even though you're not allowed to see them, according to Ed Zitron you sure as hell are paying for them (i.e. they spend output tokens) which is hilarious if true.

view more: ‹ prev next ›