Using Role-Playing Scenarios to Identify Bias in LLMs Audiovisual

media

posted on 2024-09-19, 14:42 authored by Katherine-Marie RobinsonKatherine-Marie Robinson, Violet TurriViolet Turri

Harmful biases in large language models (LLMs) make AI less trustworthy and secure. Auditing for biases can help identify potential solutions and develop better guardrails to make AI safer. In this podcast, Katie Robinson and Violet Turri, researchers in the SEI’s AI Division, discuss their recent work using role-playing game scenarios to identify biases in LLMs.