Zhimeng Wang

pic2.jpg

Zhimeng (知萌) is currently forth-year undergrad at Soochow University, advised by Assoc. Prof. Juntao Li and Prof. Min Zhang (ACL Fellow).

His research focuses on the intersection of MLSys and NLP, including Reasoning(Inference-time Scaling), LLM + RL and Long Context LLM.

In 2025, he plans to focus on the following topics:

  1. Long context capabilities of large models: This includes long-context generation, evaluation, inference acceleration (with a focus on MLSys), and long-video understanding.

  2. Advancing RL for LLMs: He aims to address broader challenges beyond alignment. Current RLHF is not true reinforcement learning; methods like Chain-of-Thought, PRM, or Multi-Agent Workflows cannot fully resolve this issue. His goal is to work on developing genuine reinforcement learning for LLMs.

He is eager to collaborate with talented researchers to explore these areas!

news

Sep 20, 2024 A paper (CTD framework) has been accepted to EMNLP2024 🔥 🔥
Nov 07, 2015 A long announcement with details
Oct 22, 2015 A simple inline announcement.

latest posts

selected publications

  1. Can Quantum-Mechanical Description of Physical Reality Be Considered Complete?
    A. Einstein*†B. Podolsky*, and N. Rosen*
    Phys. Rev., New Jersey. More Information can be found here , May 1935