Create an automated counterfactual augmentation pipeline for biological sequence models. This helps tackle shortcut learning in protein/peptide binding tasks.