📝 Publications
A full publication list is available on my google scholar page.
(* denotes equal contribution)
- PGPO: Enhancing Agent Reasoning via Pseudocode-style Planning Guided Preference Optimization
Zouying Cao, Runze Wang, Yifei Yang, Xinbei Ma, Xiaoyong Zhu, Bo Zheng, Hai Zhao.
[ACL-Findings, 2025] ||
|
|
- LESA: Learnable LLM Layer Scaling-Up
Yifei Yang, Zouying Cao, Xinbei Ma, Yao Yao, Libo Qin, Zhi Chen, Hai Zhao.
[ACL-Main, 2025] ||
- Plan-over-Graph: Towards Parallelable LLM Agent Schedule
Shiqi Zhang*, Xinbei Ma*, Zouying Cao, Zhuosheng Zhang, Hai Zhao.
[Preprint, 2025] ||
- KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing
Yifei Yang, Zouying Cao, Qiguang Chen, Libo Qin, Dongjie Yang, Hai Zhao, Zhi Chen.
[Preprint, 2024] ||
- SCANS: Mitigating the Exaggerated Safety for LLMs via Safety-Conscious Activation Steering
Zouying Cao, Yifei Yang, Hai Zhao.
[AAAI Oral, 2025] ||
|
|
- Head-wise Shareable Attention for Large Language Models
Zouying Cao, Yifei Yang, Hai Zhao.
[EMNLP-Findings, 2024] ||
|
|
- LaCo: Large Language Model Pruning via Layer Collapse
Yifei Yang, Zouying Cao, Hai Zhao.
[EMNLP-Findings, 2024] ||
- AutoHall: Automated Hallucination Dataset Generation for Large Language Models
Zouying Cao, Yifei Yang, Hai Zhao.
[Preprint, 2023] ||