publications
2025
- DACSAGA: A Memory-Efficient Accelerator for GANN Construction via Harnessing Vertex SimilarityIn Design Automation Conference (DAC), Jun 2025
- MICROHEAT: NPU-NDP HEterogeneous Architecture for Transformer-Empowered Graph Neural NetworksIn IEEE/ACM International Symposium on Microarchitecture (MICRO), Jun 2025
2024
- HPCADifferential-Matching Prefetcher for Indirect Memory AccessIn The 30-th IEEE International Symposium on High-Performance Computer Architecture (HPCA), Jun 2024
2022
- TPDSA Comprehensive Performance Model of Sparse Matrix-Vector Multiplication to Guide Kernel OptimizationIEEE Transactions on Parallel and Distributed Systems, Jun 2022