arXiv7h ago

SaFeR-Steer: Evolving Multi-Turn MLLMs via Synthetic Bootstrapping and Feedback Dynamics

Haolong Hu, Hanyu Li, Tiancheng He, Huahui Yi, An Zhang, Qiankun Li, Kun Wang, Yang Liu, Zhigang Zeng

View original ↗

Analysis

Viral velocity

low

Implementation gapYES

Novelty9/10

Categorypaper

Topics

multimodalsafetyagents

Opportunity Brief

Create an adversarial testing toolkit for multi-turn MLLMs that evolves synthetic dialogues to uncover long-context safety vulnerabilities. Help developers stress-test their agents before deployment.

Suggested repo: steer-test

"Don't get jailbroken on the 10th turn. Stress-test your vision agents."

Estimated effort: 50h