Create an OSS implementation of the DEMASK predictor to speed up discrete diffusion language models. This is critical for making parallel decoding viable for production-grade models.