Shangyu Xing

photo.jpg

I am now a Master student (3 year, course and research based) working with Prof. Xinyu Dai at Natural Language Processing Group, School of Artificial Intelligence, Nanjing University. I got my B.Sc. degree in Computer Science, School of Computer Science and Technology, Nanjing University in 2023.

My current research centers on the domain of multimodality and RL for LLMs, with the goal of advancing the capabilities and reliability of intelligent systems in complex, real-world scenarios.

  1. Pretraining and preference alignment of Vision-Language models. This line of work investigates novel strategies for integrating visual and textual modalities during both pretraining and alignment stages. By improving VLMs’ perceptual abilities, cross-modal understanding, and factual consistency, I aim to enhance their robustness and generalization across diverse applications.

  2. Reinforcement learning for reasoning Large Language Models. Leveraging advanced RL techniques such as GRPO, along with new paradigms like concept-based and latent language modeling, this research seeks to enhance LLMs’ reasoning capabilities, enabling them to perform more sophisticated, coherent, and reliable reasoning in tasks that require structured thought, logical inference, and long-term planning.

I am now actively seeking 2026 PhD opportunities. Here is my cv, or a pdf version here. Full publication list is available here or on Google Scholar. Looking forward to your interest and recommendation!

selected publications

  1. ACMMM
    DRIN: Dynamic Relation Interactive Network for Multimodal Entity Linking
    Shangyu Xing*, Fei Zhao*, Zhen Wu, Chunhui Li, and 2 more authors
    In Proceedings of the 31st ACM International Conference on Multimedia, 2023
  2. EMNLP
    EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models
    Shangyu Xing, Fei Zhao, Zhen Wu, Tuo An, and 4 more authors
    EMNLP 2024, 2024
  3. ICLR
    AnyPrefer: An Agentic Framework for Preference Data Synthesis
    Yiyang Zhou, Zhaoyang Wang, Tianle Wang, Shangyu Xing, and 12 more authors
    In International Conference on Learning Representations, 2025
  4. preprint
    GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models
    Shangyu Xing, Changhao Xiang, Yuteng Han, Yifan Yue, and 5 more authors
    submitted to NeurIPS 2025, 2025