I am currently a research scientist at AMD GenAI. I received my PhD degree from Department of Computer and Information Science, University of Pennsylvania, advised by Professor Dan Roth. Before going to Penn, I was a master's student at University of Illinois, Urbana-Champaign.
My research generally focuses on LLM pre-training/post-training, math reasoning, long-context understanding, and retrieval/faithfulness of knowledge.
Email: xdyu AT seas DOT upenn DOT edu
We are hiring full-time research scientists and research interns. Feel free to reach out if you are interested.
Introducing Instella-Math: A Fully Open Language Model with Reasoning Capability
Xiaodong Yu*, Jiang Liu*, Yusheng Su*, Gowtham Ramesh*, Zicheng Liu*, Prakamya Mishra, Sudhanshu Ranjan, Jialian Wu, Ximeng Sun, Ze Wang,
Emad Barsoum
(* Core Contributor)
Link: [link]
Introducing Instella-Long: A Fully Open Language Model with Long-Context Capability
Jialian Wu*, Jiang Liu*, Sudhanshu Ranjan*, Xiaodong Yu*, Gowtham Ramesh*, Prakamya Mishra*, Zicheng Liu*, Yusheng Su, Ximeng Sun, Ze Wang,
Emad Barsoum
(* Core Contributor)
Link: [link]
Introducing Instella: New State-of-the-art Fully Open 3B Language Models
Jiang Liu*, Jialian Wu*, Xiaodong Yu* Prakamya Mishra*, Sudhanshu Ranjan*, Zicheng Liu*, Chaitanya Manem, Yusheng Su, Pratik Prabhanja Brahma,
Gowtham Ramesh, Ximeng Sun, Ze Wang, Emad Barsoum
(* Core Contributor)
Link: [link]
ZeroTuning: Unlocking the Initial Token's Power to Enhance Large Language Models Without Training
Feijiang Han, Xiaodong Yu, Jianheng Tang, Lyle Ungar
Link: [PDF]
Self-Taught Agentic Long-Context Understanding
Yufan Zhuang, Xiaodong Yu, Jialian Wu, Ximeng Sun, Ze Wang, Jiang Liu, Yusheng Su, Jingbo Shang, Zicheng Liu, Emad Barsoum
ACL 2025
Link: [PDF]
Agent Laboratory: Using LLM Agents as Research Assistants
Samuel Schmidgall, Yusheng Su, Ze Wang, Ximeng Sun, Jialian Wu, Xiaodong Yu, Jiang Liu, Zicheng Liu and Emad Barsoum
Link: [PDF]
ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoning
Xiaodong Yu*, Ben Zhou*, Hao Cheng, Dan Roth
(* Equal Contribution)
Link: [PDF]
Model Tells Itself Where to Attend: Faithfulness Meets Automatic Attention Steering
Qingru Zhang*, Xiaodong Yu*, Chandan Singh, Xiaodong Liu, Liyuan Liu, Jianfeng Gao, Tuo Zhao, Dan Roth, Hao Cheng
(* Equal Contribution)
Link: [PDF]
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks
Xiaodong Yu, Hao Cheng, Xiaodong Liu, Dan Roth, Jianfeng Gao
Conference of the North American Chapter of the Association for Computational Linguistics (NAACL findings), 2024
Link: [PDF]
Event Linking: Grounding Event Mentions to Wikipedia
Xiaodong Yu, Wenpeng Yin, Nitish Gupta, Dan Roth
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Link: [PDF]
Learning to Decompose: Hypothetical Question Decomposition Based on Comparable Texts
Ben Zhou, Kyle Richardson, Xiaodong Yu, Dan Roth
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Link: [PDF]
Capturing the Content of a Document through Complex Event Identification
Zheng Qi, Elior Sulem, Haoyu Wang, Xiaodong Yu, Dan Roth
The 11th Joint Conference on Lexical and Computational Semantics (*SEM), 2022
Link: [PDF]
Pairwise Representation Learning for Event Coreference
Xiaodong Yu, Wenpeng Yin, Dan Roth
The 11th Joint Conference on Lexical and Computational Semantics (*SEM), 2022
Link: [PDF]
RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System
Haoyang Wen, Ying Lin, Tuan Manh Lai, Xiaoman Pan, Sha Li, Xudong Lin, Ben Zhou, Manling Li, Haoyu Wang, Hongming Zhang, Xiaodong Yu, Alexander Dong, Zhenhailong Wang, Yi Ren Fung, Piyush Mishra, Qing Lyu, Dídac Surís, Brian Chen, Susan Windisch Brown, Martha Palmer, Chris Callison-Burch, Carl Vondrick, Jiawei Han, Dan Roth, Shih-Fu Chang, Heng Ji
Conference of the North American Chapter of the Association for Computational Linguistics (NAACL Demonstrations), 2021
Link: [PDF]
Design Challenges in Low-resource Cross-lingual Entity Linking
Xingyu Fu*, Weijia Shi*, Xiaodong Yu, Zian Zhao, Dan Roth
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Link: [PDF]
On the Strength of Character Language Models for Multilingual Named Entity Recognition (short paper)
Xiaodong Yu, Stephen Mayhew, Mark Sammons, Dan Roth
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018
Link: [PDF]
CogCompNLP: Your Swiss Army Knife for NLP
Daniel Khashabi, Mark Sammons, Ben Zhou, Tom Redman, Christos Christodoulopoulos, Vivek Srikumar, Nicholas Rizzolo, Lev Ratinov, Guanheng Luo, Quang Do, Chen-Tse Tsai, Subhro Roy, Stephen Mayhew, Zhili Feng, John Wieting, Xiaodong Yu, Yangqiu Song, Shashank Gupta, Shyam Upadhyay, Naveen Arivazhagan, Qiang Ning, Shaoshi Ling, Dan Roth
11th Language Resources and Evaluation Conference (LREC), 2018
Link: [PDF]
2019/05 - 2024/08: University of Pennsylvania
PhD in Computer and Information Science
Advisor: Professor Dan Roth
2017/08 - 2019/05: University of Illinois Urbana-Champaign
Master of Science in Computer Science
Advisor: Professor Dan Roth
2016/01 - 2017/05: University of Illinois Urbana-Champaign
Bachelor of Science in Computer Science
Advisor: Professor Dan Roth
2013/09 - 2015/12: Shanghai Jiao Tong University
Electrical and Computer Engineering
2024/08 - present: AMD GenAI
Research Scientist
2023/02 - 2024/06: Microsoft Research (Deep Learning Group)
Research Intern
2022/05 - 2022/08: Amazon (Alexa AI-Web Info)
Applied Scientist Intern
2020/05 - 2020/08: Salesforce Research
Research Intern