Charikleia Moraitaki, Sarah Pan, Skyler Pulling, Gwendolyn Flusche, Kumail Alhamoud, Marzyeh Ghassemi
View original ↗Create a library of negation-aware evaluation probes for VLMs across languages. This tool will help developers identify and debug the 'affirmation bias' that plagues current vision-language models when interpreting negative images or captions.
Suggested repo: not-a-vision
"Does your VLM struggle with 'not'?"
Estimated effort: 35h