PhD Student @ Hong Kong University of Science and Technology
Research Interest:
- LLM RL for Agent and Reasoning: PaW, HIVE, AHDAgent, Is PRM Necessary?
- LLM Alignment & RedTeaming: SafeDelta, SICO
I expect to graduate in 2026 and am actively seeking industry positions. Please feel free to reach out via email at nluab AT cse.ust.hk !
") does not match the recommended repository name for your site ("").
", so that your site can be accessed directly at "http://".
However, if the current repository name is intended, you can ignore this message by removing "{% include widgets/debug_repo_name.html %}" in index.html.
",
which does not match the baseurl ("") configured in _config.yml.
baseurl in _config.yml to "".

Ning Lu*, Baijiong Lin*, Shengcai Liu, Jiahao Wu, Haoze Lv, Yanbin Wei, Lingting Zhu, Shengju Qian, Xin Wang, Ying-Cong Chen, Qi Wang, Ke Tang (* equal contribution)
Under review. 2026
The first policy and world-modeling co-training RL framework for LLM agents.
Ning Lu*, Baijiong Lin*, Shengcai Liu, Jiahao Wu, Haoze Lv, Yanbin Wei, Lingting Zhu, Shengju Qian, Xin Wang, Ying-Cong Chen, Qi Wang, Ke Tang (* equal contribution)
Under review. 2026
The first policy and world-modeling co-training RL framework for LLM agents.

Haoze Lv*, Ning Lu*, Ziang Zhou, Shengcai Liu (* equal contribution)
Under review. 2026
The first tool-integrated multi-turn agentic framework for automatic algorithm design.
Haoze Lv*, Ning Lu*, Ziang Zhou, Shengcai Liu (* equal contribution)
Under review. 2026
The first tool-integrated multi-turn agentic framework for automatic algorithm design.

Jiahao Wu*, Ning Lu*, Shengcai Liu, Kun Wang, Yanting Yang, Li Qing, Ke Tang (* equal contribution)
Under review. 2026
The first online policy-verified data selection framework for efficient RL training.
Jiahao Wu*, Ning Lu*, Shengcai Liu, Kun Wang, Yanting Yang, Li Qing, Ke Tang (* equal contribution)
Under review. 2026
The first online policy-verified data selection framework for efficient RL training.

Zhangying Feng*, Qianglong Chen*, Ning Lu, Yongqian Li, Siqi Cheng, Shuangmu Peng, Duyu Tang, Shengcai Liu, Zhirui Zhang (* equal contribution)
Conference on Neural Information Processing Systems (NeurIPS) 2025
Unifying problem solving and solution-process judgment.
Zhangying Feng*, Qianglong Chen*, Ning Lu, Yongqian Li, Siqi Cheng, Shuangmu Peng, Duyu Tang, Shengcai Liu, Zhirui Zhang (* equal contribution)
Conference on Neural Information Processing Systems (NeurIPS) 2025
Unifying problem solving and solution-process judgment.

Ning Lu, Shengcai Liu, Jiahao Wu, Weiyu Chen, Zhirui Zhang, Yew-Soon Ong, Qi Wang, Ke Tang
International Conference on Machine Learning (ICML) 2025
The first safety-aware post-fine-tuning defense method for LLM alignment.
Ning Lu, Shengcai Liu, Jiahao Wu, Weiyu Chen, Zhirui Zhang, Yew-Soon Ong, Qi Wang, Ke Tang
International Conference on Machine Learning (ICML) 2025
The first safety-aware post-fine-tuning defense method for LLM alignment.