I am an undergraduate at ACM Honors Class, Shanghai Jiao Tong University. Currently, I am fortunate to work with Prof. Song Han at MIT HAN Lab as a research intern.

During my junior year, I also had a wonderful time as an undergraduate researcher advised by Prof. Jingwen Leng at SJTU EPCC Lab.

My research interests lie in Efficient Systems and Algorithms for Large Language Models.






News


Publications


* indicates equal contribution

  1. AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Ji Lin*, Jiaming Tang*, Haotian Tang, Shang Yang, Xingyu Dang, and Song Han arXiv / Abstract / Code
  2. OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization Cong Guo*, Jiaming Tang*, Weiming Hu, Jingwen Leng, Chen Zhang, Fan Yang, Yunxin Liu, Minyi Guo, and Yuhao Zhu ISCA2023 / Abstract / Code