I am a first-year Ph.D. student at MIT EECS, advised by Prof. Song Han.

Previously, I received my B.Eng. in Computer Science from Shanghai Jiao Tong University (ACM Honors Class). During my junior year, I also had a wonderful time as an undergraduate researcher advised by Prof. Jingwen Leng at SJTU EPCC Lab.

My research interests lie in Efficient Algorithms and Systems for Large Language Models.






News


Publications


* indicates equal contribution

  1. Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference Jiaming Tang*, Yilong Zhao*, Kan Zhu, Guangxuan Xiao, Baris Kasikci, and Song Han ICML 2024 / Abstract / Code
  2. AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Ji Lin*, Jiaming Tang*, Haotian Tang, Shang Yang, Wei-ming Chen, Wei-chen Wang, Guangxuan Xiao, Xingyu Dang, Chuang Gan, and Song Han MLSys 2024 / Best Paper Award / Abstract / Code
  1. OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization Cong Guo*, Jiaming Tang*, Weiming Hu, Jingwen Leng, Chen Zhang, Fan Yang, Yunxin Liu, Minyi Guo, and Yuhao Zhu ISCA 2023 / Abstract / Code