Publications
Students with † are advised by me; Students with * are co-first authors.
2025
- AxCore: A Quantization-Aware Approximate GEMM Unit for LLM InferenceIn IEEE/ACM International Symposium on Microarchitecture (MICRO) , 2025(to appear)
- X-SET: An Efficient Graph Pattern Matching Accelerator With Order-Aware Parallel Intersection UnitsIn IEEE/ACM International Symposium on Microarchitecture (MICRO) , 2025(to appear)
- OA-LAMA: An Outlier-Adaptive LLM Inference Accelerator with Memory-Aligned Mixed-Precision Group QuantizationIn IEEE/ACM International Conference on Computer-Aided Design (ICCAD) , 2025(to appear)
- Graphitron: A Domain Specific Language for FPGA-Based Graph Processing Accelerator GenerationIn ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES) , 2025
- Rethinking Dynamic Networks and Heterogeneous Computing with Automatic ParallelizationIn The 9th Asia-Pacific Workshop on Networking (APNet) , 2025(to appear)
2023
2022
2021
- Skew-oblivious data routing for data intensive applications on FPGAs with HLSIn ACM/IEEE Design Automation Conference (DAC) , 2021
2020
2019
- A survey on graph processing accelerators: Challenges and opportunitiesJournal of Computer Science and Technology, 2019