Publications

3 results for Liu Liu

Accelerating LLM Inference via Dynamic KV Cache Placement in Heterogeneous Memory System
- - Yunhua Fang
  - Rui Xie
  - et al.
- 2025
- IEEE Computer Architecture Letters
NORA: Noise-Optimized Rescaling of LLMs on Analog Compute-in-Memory Accelerators
- - Yayue Hou
  - Hsinyu Tsai
  - et al.
- 2025
- DATE 2025
Breaking the HBM Bit Cost Barrier: Domain-Specific ECC for AI Inference Infrastructure
- - Rui Xie
  - Asad Ul Haq
  - et al.
- 2025
- IEEE Computer Architecture Letters