Jaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 1 month agoAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comexternal-linkmessage-square267linkfedilinkarrow-up1971arrow-down120cross-posted to: technology@lemmy.mltechnology@beehaw.org
arrow-up1951arrow-down1external-linkAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comJaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 1 month agomessage-square267linkfedilinkcross-posted to: technology@lemmy.mltechnology@beehaw.org
minus-squareLog in | Sign up@lemmy.worldlinkfedilinkEnglisharrow-up1arrow-down1·1 month agoSo the chances of it being right ten times in a row are 2%.
minus-squareKnock_Knock_Lemmy_In@lemmy.worldlinkfedilinkEnglisharrow-up2·edit-21 month agoNo the chances of being wrong 10x in a row are 2%. So the chances of being right at least once are 98%.
minus-squareLog in | Sign up@lemmy.worldlinkfedilinkEnglisharrow-up2·1 month agoAh, my bad, you’re right, for being consistently correct, I should have done 0.3^10=0.0000059049 so the chances of it being right ten times in a row are less than one thousandth of a percent. No wonder I couldn’t get it to summarise my list of data right and it was always lying by the 7th row.
minus-squareKnock_Knock_Lemmy_In@lemmy.worldlinkfedilinkEnglisharrow-up1·1 month agoThat looks better. Even with a fair coin, 10 heads in a row is almost impossible. And if you are feeding the output back into a new instance of a model then the quality is highly likely to degrade.
minus-square𝕛𝕨𝕞-𝕕𝕖𝕧@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up1·1 month agodon’t you dare understand the explicitly obvious reasons this technology can be useful and the essential differences between P and NP problems. why won’t you be angry >:(
About 0.02
So the chances of it being right ten times in a row are 2%.
No the chances of being wrong 10x in a row are 2%. So the chances of being right at least once are 98%.
Ah, my bad, you’re right, for being consistently correct, I should have done 0.3^10=0.0000059049
so the chances of it being right ten times in a row are less than one thousandth of a percent.
No wonder I couldn’t get it to summarise my list of data right and it was always lying by the 7th row.
That looks better. Even with a fair coin, 10 heads in a row is almost impossible.
And if you are feeding the output back into a new instance of a model then the quality is highly likely to degrade.
don’t you dare understand the explicitly obvious reasons this technology can be useful and the essential differences between P and NP problems. why won’t you be angry >:(