

Also Martin Hairer is incredibly based besides having a big noggin. He gave this nice talk 2 months ago if any peeps want to see what he thinks comes next for math.
Your boi Ric, heir to the Big Muffin 69 family fortune
Recently left my job as a freelance paper boi to pursue my BFA (degree in Big Foot Alignment). Currently imagining positive futures with Big Foot governance in a post-Big Foot society.


Also Martin Hairer is incredibly based besides having a big noggin. He gave this nice talk 2 months ago if any peeps want to see what he thinks comes next for math.


This was a very nice problem set. Some were minor alterations to thms in literature but ranged up to problems that were quite involved. It appears that OAI got about 5 (possibly 6) of them but even then, this was accomplished with expert feedback to the model, which is quite different from the models just 1 shotting them on their own.
But I think this is what makes it so well done! A 0/10 or a 10/10 ofc gives very little info, a middling score that they admit they put a shit ton of effort into and tried to coax the right answers out of the models via hints says a lot about how much these systems can currently help prove lemmata.
Side note: I asked a FB friend of mine at one of the math + ai startups if they attempted the problems and he said “they had more pressing issues this week they couldnt be pulled away from” (no comment, :P I want to stay friends with them)
The lack of similar attempts being released by big companies like Google or Anth or X also should be a big red flag that their attempts were not up to snuff of even attempting.


“(((We’re))) never beating the allegations, are we?” -my wife



Gentlemen, it’s been an honour sneering w/ you, but I think this is the top 🫡 . Nothings gonna surpass this (at least until FTX 2 drops)


hits blunt
What if we make an ai too based?


On one hand as a poor grad student in the past, I could imagine working for a truly repugnant corp. but like if you’ve already made millions from your stock options, wtf are you doing. Idk, i really thought they’d have some shame over it, but they said shit like “our customers really like our deliverables” and i just fucking left with my wife


I have family working there, who told me during the holidays, “Current leadership makes me uncomfortable, but money is good”
Every impression I had of them completely shattered, cannot fathom that level out sell out exists in people I thought I knew.
As a bonus, their former partner was a former employee who became a whistleblower and has now gone full howard hughes


Without doxxing, my job has a contract with nvidia and my boss said we are doing it to make agi. Can i build a little of a torment nexus as a treat? Ty ans bless


Shit like this ^ makes me feel insane when otherwise reputable experts start talking about llms taking over


In b4 METR drops the next shoddy study and the promptfondlers go wild


Man, it just feels embarrassing at this point. Like I couldn’t fathom writing this shit. It’s 2026, we have ai capable of getting imo gold, acing the putnam, winning coding competitions… but at this point it should be extremely obvious these systems are completely devoid of agency?? They just sit there kek it’s like being worried about stockfish going rogue



If trump gets back in office, Scott will be dead within the year.


Man I remember an ep of this from when i was little like a fever dream


Well to be fair, he think your estimate is too low :( (as do i)


Fuck bro that sucks


Another massive win for Dan H. Safety advisor to elon and xai


Ohoho, a beautiful lw begging post on this of all days?

AcerFur (who is quoted in the article) tried them himself and said he got similar answers with a couple guiding prompts on gpt 5.3 and that he was “disappointed”
That said, AcerFur is kind of the goat at this kind of thing 🦊==🐐