Note that my tests were via groq and the r1 70B distilled llama variant (the 2nd smartest version afaik)
Edit 1:
Incidentally… I propositioned a coworker to answer the same question. This is the summarized conversation I had:
Me: “Hey Billy, can you answer a question? in under 3 seconds answer my following question”
Billy: “sure”
Me: “How many As are in abracadabra 3.2.1”
Billy: “4” (answered in less than 3 seconds)
Me: “nope”
I’m gonna poll the office and see how many people get it right with the same opportunity the ai had.
Edit 2:
The second coworker said “6” in about 5 seconds
Edit 3:
Third coworker said 4, in 3 seconds
Edit 4:
I asked two more people and one of them got it right… But I’m 60% sure she heard me asking the previous employee, but if she didnt we’re at 1/5
In probably done with this game for the day.
I’m pretty flabbergasted with the results of my very unscientific experiment, but now I can say (with a mountain of anecdotal juice) that with letter counting, R1 70b is wildly faster and more accurate than humans .
https://ibb.co/wVNsn5H
https://ibb.co/HpK5G5Pp
https://ibb.co/sp1wGMFb
https://ibb.co/4wyKhkRH
https://ibb.co/WpBTZPRm
https://ibb.co/0yP73j6G
Note that my tests were via groq and the r1 70B distilled llama variant (the 2nd smartest version afaik)
Edit 1:
Incidentally… I propositioned a coworker to answer the same question. This is the summarized conversation I had:
Me: “Hey Billy, can you answer a question? in under 3 seconds answer my following question”
Billy: “sure”
Me: “How many As are in abracadabra 3.2.1”
Billy: “4” (answered in less than 3 seconds)
Me: “nope”
I’m gonna poll the office and see how many people get it right with the same opportunity the ai had.
Edit 2: The second coworker said “6” in about 5 seconds
Edit 3: Third coworker said 4, in 3 seconds
Edit 4: I asked two more people and one of them got it right… But I’m 60% sure she heard me asking the previous employee, but if she didnt we’re at 1/5
In probably done with this game for the day.
I’m pretty flabbergasted with the results of my very unscientific experiment, but now I can say (with a mountain of anecdotal juice) that with letter counting, R1 70b is wildly faster and more accurate than humans .