Develop an evaluation suite that tests for 'hidden' model constraints that survive fine-tuning. This tool would help researchers identify alignment artifacts in supposedly uncensored models.