WeDLM: Reconciling Diffusion LM with Standard Causal Attentiongithub.com/Tencent6 pointssimonpure6 months ago