Artificial Intelligence▲ bullishImpact 7/10
DisaBench: A Participatory Evaluation Framework for Disability Harms in Language Models
cs.AI updates on arXiv.org·
✦AI Analysis
DisaBench introduces a new framework to evaluate disability-related harms in language models, addressing gaps in existing safety benchmarks. The initiative aims to enhance understanding and mitigation of subtle harms through a participatory approach involving people with disabilities and experts.
Key Topics
DisaBenchHugging Facelanguage modelsred teaming
Originally reported by cs.AI updates on arXiv.org. Read the full article ↗