News & Events

Latest News
Events

Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels

2025-04-01

Make way for X-Y Serve! Prof. Yin Shouyi and Prof. Hu Yang from the School of Integrated Circuits, in collaboration with Huawei XiaoYi AI Infra@HuaweiAPAC, unveils a high-performance large language model serving system. By unifying computations into hardware-friendly kernels, significant improvement is achieved on Ascend NPUs (Outperforms A800!). The technology is fully adaptable to GPU architectures—and ready for the future.

Next:Challenges and recent advances in HfO₂-based ferroelectric films for non-volatile memory applications

The Latest Developments

2025.04.01
Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels
2024.11.05
Challenges and recent advances in HfO₂-based ferroelectric films for non-volatile memory applications
2024.04.15
Interface electronics using 28-nm node CMOS
2023.10.13
Tsinghua University makes breakthrough in system-integrated memristor computing-in-memory chips
2023.06.08
2023 Tsinghua University Alumni IC Forum Held
2023.05.12
Ren Tian-ling's research group developed an intelligent wearable artificial throat for mixed-modality speech recognition and interaction

Latest News

News & Events

Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels

Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels

Challenges and recent advances in HfO₂-based ferroelectric films for non-volatile memory applications

Interface electronics using 28-nm node CMOS

Tsinghua University makes breakthrough in system-integrated memristor computing-in-memory chips

2023 Tsinghua University Alumni IC Forum Held

Ren Tian-ling's research group developed an intelligent wearable artificial throat for mixed-modality speech recognition and interaction