publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. arXiv 2025
    mmada.png
    MMaDA: Multimodal Large Diffusion Language Models
    Ling Yang*, Ye Tian*, Bowen Li, and 4 more authors
    arXiv preprint arXiv:2505.15809, 2025
  2. arXiv 2025
    hermesflow.png
    HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation
    Ling Yang, Xinchen Zhang*, Ye Tian, and 4 more authors
    arXiv preprint arXiv:2502.12148, 2025
  3. arXiv 2025
    diffusion-sharpen.png
    Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
    Ye Tian, Ling Yang, Xinchen Zhang, and 3 more authors
    arXiv preprint arXiv:2502.12146, 2025

2024

  1. NeurIPS 2024
    videotetris.png
    Videotetris: Towards compositional text-to-video generation
    Ye Tian, Ling Yang, Haotian Yang, and 8 more authors
    Advances in Neural Information Processing Systems, 2024
  2. NeurIPS 2024
    realcompo.png
    Realcompo: Dynamic equilibrium between realism and compositionality improves text-to-image diffusion models
    Xinchen Zhang, Ling Yang*, Yaqi Cai, and 7 more authors
    arXiv e-prints, 2024
  3. ICLR 2024
    vqgraph.png
    VQGraph: Rethinking Graph Representation Space for Bridging GNNs and MLPs
    Ling Yang*, Ye Tian*, Minkai Xu, and 7 more authors
    In The Twelfth International Conference on Learning Representations, 2024