Yuyan Zhou, Kai Syun Hou, Weiyu Chen, James Kwok
View original ↗Implement a parallel decoder for diffusion-based language models that improves inference speed compared to autoregressive baselines. The current gap is a lack of high-performance implementations for non-autoregressive language generation.
Suggested repo: diff-speak
"Unlock parallel decoding for text generation."
Estimated effort: 80h