ylai@lemmy.ml to AI@lemmy.mlEnglish · 2 years agoChatGPT gets code questions wrong 52% of the timewww.theregister.comexternal-linkmessage-square10fedilinkarrow-up1164arrow-down15
arrow-up1159arrow-down1external-linkChatGPT gets code questions wrong 52% of the timewww.theregister.comylai@lemmy.ml to AI@lemmy.mlEnglish · 2 years agomessage-square10fedilink
minus-squareSirGolanlinkfedilinkarrow-up5arrow-down3·2 years agoGPT4 with reflexion prompting gets 90% correct (for HumanEval coding benchmark). The paper this is based on is misleading at best.
GPT4 with reflexion prompting gets 90% correct (for HumanEval coding benchmark). The paper this is based on is misleading at best.