Shangyu Xing

I am now a Master student (3 year, course and research based) working with Prof. Xinyu Dai at Natural Language Processing Group, School of Artificial Intelligence, Nanjing University. I got my B.Sc. degree in Computer Science, School of Computer Science and Technology, Nanjing University in 2023.
My current research centers on the domain of multimodality and RL for LLMs, with the goal of advancing the capabilities and reliability of intelligent systems in complex, real-world scenarios.
-
Pretraining and preference alignment of Vision-Language models. This line of work investigates novel strategies for integrating visual and textual modalities during both pretraining and alignment stages. By improving VLMs’ perceptual abilities, cross-modal understanding, and factual consistency, I aim to enhance their robustness and generalization across diverse applications.
-
Reinforcement learning for reasoning Large Language Models. Leveraging advanced RL techniques such as GRPO, along with new paradigms like concept-based and latent language modeling, this research seeks to enhance LLMs’ reasoning capabilities, enabling them to perform more sophisticated, coherent, and reliable reasoning in tasks that require structured thought, logical inference, and long-term planning.
I am now actively seeking 2026 PhD opportunities. Here is my cv, or a pdf version here. Full publication list is available here or on Google Scholar. Looking forward to your interest and recommendation!