Create an implementation for Discrete Tilt Matching (DTM) to enable likelihood-free RL fine-tuning for masked diffusion LLMs. This bridge allows developers to apply RL to non-autoregressive models easily.