Your boi Ric, heir to the Big Muffin 69 family fortune

Recently left my job as a freelance paper boi to pursue my BFA (degree in Big Foot Alignment). Currently imagining positive futures with Big Foot governance in a post-Big Foot society.

  • 1 Post
  • 107 Comments
Joined 8 months ago
cake
Cake day: July 1st, 2025

help-circle


  • This was a very nice problem set. Some were minor alterations to thms in literature but ranged up to problems that were quite involved. It appears that OAI got about 5 (possibly 6) of them but even then, this was accomplished with expert feedback to the model, which is quite different from the models just 1 shotting them on their own.

    But I think this is what makes it so well done! A 0/10 or a 10/10 ofc gives very little info, a middling score that they admit they put a shit ton of effort into and tried to coax the right answers out of the models via hints says a lot about how much these systems can currently help prove lemmata.

    Side note: I asked a FB friend of mine at one of the math + ai startups if they attempted the problems and he said “they had more pressing issues this week they couldnt be pulled away from” (no comment, :P I want to stay friends with them)

    The lack of similar attempts being released by big companies like Google or Anth or X also should be a big red flag that their attempts were not up to snuff of even attempting.