News
📌 2026
💼 Joined Alibaba Cloud, Hangzhou, China
2026.02 Technical Expert
📄 Paper accepted at IEEE CCGrid 2026
2026.02 "SD-MoE: Scenario-driven MoE forecasting for intelligent elastic scaling in cloud clusters"
📌 2025
📄 Paper accepted at ICA3PP 2025
2025.09 "AKD: Asymmetric knowledge distillation for time series models in cloud monitoring"
📄 Paper accepted at KSEM 2025
2025.06 "Optimization techniques for large language model inference: A review"
📄 Paper accepted at WASA 2025
2025.03 "Parallelization techniques for large language models: From training to inference"
📌 2024
📄 Paper accepted at IWQoS 2024
2024.09 "Improving QoS with CPU pinning via deep reinforcement learning"
📄 Paper accepted at IWQoS 2024
2024.09 "QoS perception for cloud databases: Necessity, trends, and challenges"
📄 Paper accepted at MMM 2024
2024.01 "MetaVSR: Video super-resolution for arbitrary magnification"
📌 2023
💼 Joined Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), Shenzhen, China
2023.08 Associate Researcher
