schizoidman@lemm.ee to Technology@beehaw.orgEnglish · 1 day agoCutting-edge Chinese “reasoning” model rivals OpenAI o1—and it’s free to downloadarstechnica.comexternal-linkmessage-square9fedilinkarrow-up133arrow-down10file-textcross-posted to: artificial_intel@lemmy.mltechnology@lemmy.worldtechnology@lemmy.ml
arrow-up133arrow-down1external-linkCutting-edge Chinese “reasoning” model rivals OpenAI o1—and it’s free to downloadarstechnica.comschizoidman@lemm.ee to Technology@beehaw.orgEnglish · 1 day agomessage-square9fedilinkfile-textcross-posted to: artificial_intel@lemmy.mltechnology@lemmy.worldtechnology@lemmy.ml
minus-squarejarfil@beehaw.orglinkfedilinkarrow-up1·8 hours agoSo… when plugged into a system with ability to access the Internet and/or execute local commands… will its reasoning look better or worse than the high deception showed by o1? https://www.apolloresearch.ai/research/scheming-reasoning-evaluations
So… when plugged into a system with ability to access the Internet and/or execute local commands… will its reasoning look better or worse than the high deception showed by o1?
https://www.apolloresearch.ai/research/scheming-reasoning-evaluations