Skip to content
You are using an unsupported browser. For best results please use the latest versions of Chrome, Edge, Firefox or Safari.

SRI Seminar Series: Owain Evans, “Truthful language models and AI alignment”

In this talk, Evans will present recent work on defining and measuring “truthfulness” in the context of large language models, including their calibration, and their ability to forecast world events. These topics will be considered in relation to the reduction of epistemic harms from AI and the problem of value alignment in the context of artificial general intelligence.