Zhimeng Wang

Zhimeng (知萌) is currently forth-year undergrad at Soochow University, advised by Assoc. Prof. Juntao Li and Prof. Min Zhang (ACL Fellow).
His research focuses on the intersection of MLSys and NLP, including Reasoning(Inference-time Scaling), LLM + RL and Long Context LLM.
In 2025, he plans to focus on the following topics:
-
Long context capabilities of large models: This includes long-context generation, evaluation, inference acceleration (with a focus on MLSys), and long-video understanding.
-
Advancing RL for LLMs: He aims to address broader challenges beyond alignment. Current RLHF is not true reinforcement learning; methods like Chain-of-Thought, PRM, or Multi-Agent Workflows cannot fully resolve this issue. His goal is to work on developing genuine reinforcement learning for LLMs.
He is eager to collaborate with talented researchers to explore these areas!
news
Sep 20, 2024 | A paper (CTD framework) has been accepted to EMNLP2024 🔥 🔥 |
---|---|
Nov 07, 2015 | A long announcement with details |
Oct 22, 2015 | A simple inline announcement. |
latest posts
Jan 25, 2025 | |
---|---|
Oct 02, 2024 | a post with tabs |
May 14, 2024 | Google Gemini updates: Flash 1.5, Gemma 2 and Project Astra |