@goosetheM to math • 1 year agoTaming AI Bots: Prevent LLMs from entering "bad" states using continuous guidance from the LLM ("is this good? bad?") to avoid bad states.arxiv.orgmessage-square0arrow-up12arrow-down10cross-posted to: mediocreatbest
arrow-up12arrow-down1external-linkTaming AI Bots: Prevent LLMs from entering "bad" states using continuous guidance from the LLM ("is this good? bad?") to avoid bad states.arxiv.org@goosetheM to math • 1 year agomessage-square0cross-posted to: mediocreatbest