👋🏻 Hi, I'm Zheyu Zhang (张哲语)!

I am a 2nd-year PhD student at Technical University of Munich, advised by Gjergji Kasneci. I am also a junior member of the Munich Center for Machine Learning.

Prior to that, I studied at LMU Munich, where I obtained my bachelor's degree in computer science and mathematics, and master's degree in computational linguistics.

My research currently focuses on large generative models, including LLM post-training [8], synthetic data generation [3,4,6], knowledge adaptation [1,2,5], and diffusion models [7].

Email / GitHub / Google Scholar / LinkedIn / CV

profile photo

News

Apr '26   🍧 One paper is accepted to ICML 2026! See you in Seoul!

Apr '26   🏖️ Two papers are accepted to ACL 2026! See you in San Diego!

Aug '25   🦀 Three papers are accepted to EMNLP 2025! See you in Suzhou!

May '25   🎻 One paper is accepted to ACL 2025! See you in Vienna!

Dec '24   🥳 I start my PhD journey!!!

Jan '24   🦞 One paper is accepted to EACL 2024 and I will be in Malta as a student volunteer!

Oct '23   🍼 One paper is accepted to BabyLM Challenge @CoNLL 2023!

▾ show all news

Selected Publications

* indicates equal contribution.

[8] Consolidating Rewarded Perturbations for LLM Post-Training
Zheyu Zhang, Shuo Yang, Gjergji Kasneci
Preprint
[7] Active Tabular Augmentation via Policy-Guided Diffusion Inpainting
Zheyu Zhang, Shuo Yang, Bardh Prenkaj, Gjergji Kasneci
ICML 2026
[6] SAGE: Sparse Adaptive Guidance for Dependency-Aware Tabular Data Generation
Shuo Yang*, Zheyu Zhang*, Bardh Prenkaj, Gjergji Kasneci
ACL 2026 (✨Oral)
[5] Where Paths Split: Localized, Calibrated Control of Moral Reasoning in Large Language Models
Chenchen Yuan, Zheyu Zhang, Bardh Prenkaj, Gjergji Kasneci
ACL 2026
[4] Not All Features Deserve Attention: Graph-Guided Dependency Learning for Tabular Data Generation with Language Models
Zheyu Zhang, Shuo Yang, Bardh Prenkaj, Gjergji Kasneci
EMNLP 2025 Findings
[3] Doubling Your Data in Minutes: Ultra-fast Tabular Data Generation via LLM-Induced Dependency Graphs
Shuo Yang, Zheyu Zhang, Bardh Prenkaj, Gjergji Kasneci
EMNLP 2025
[2] mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models
Peiqin Lin, Chengzhi Hu, Zheyu Zhang, André FT Martins, Hinrich Schütze
EACL 2024 Findings
[1] Baby’s CoThought: Leveraging Large Language Models for Enhanced Reasoning in Compact Models
Zheyu Zhang, Han Yang, Bolei Ma, David Rügamer, Ercong Nie
EMNLP 2023 BabyLM Challenge

Experience

Education

  • Ph.D. in Computer Science, Dec. 2024 - present
    Technical University of Munich
  • M.Sc. in Computational Linguistics, Oct. 2022 - Aug. 2024
    University of Munich (LMU Munich); Thesis advisor: Prof. Hinrich Schütze
  • B.Sc. in Computer Science and Mathematics, Apr. 2020 - Sept. 2022
    University of Munich (LMU Munich); Thesis advisor: Prof. Hinrich Schütze

Academic Service

  • Volunteering
    ACL 2025, EACL 2024
  • Reviewing
    ECML-PKDD 2025, ACL-ARR 2025
Flag Counter