About me

What’s up! I’m Jason, a 3rd-year undergrad studying computer engineering at UC San Diego. I am currently working under the supervision of Professor Hao Zhang @ Hao AI Lab and Professor Tajana Šimunić Rosing @ SeeLab. I am currently involved with a several cutting-edge LLM research, more specifically on LLM evaluation, compression, and KV-caches.

Why Research?

My Values and Attributes: Foresight and initiative are two key attributes that I practice every waking second. I channel my foresight into choosing the right problems to solve, and I fuel my initiative by understanding the importance of my work.

My Community: A big part of what makes research so enjoyable for me is belonging to a wonderful research community and having friendships with my smart PhD colleagues. If you’d like to chat, feel free to connect with me on 𝕏.

Projects and Publications

TinyAgentQuantization-aware Model Compression and Adaptation for On-device LLM Agent Deployment

  • We introduce a novel LLM edge deployment solution to automatically fine-tune and compress domain-specific LLMs with up to 8x memory saving and inference speedup with minimal performance loss. This work was presented at the ICML 2024 ES-FoMo-II workshop.

Awards

Best Poster Award - PRISM Annual Review

  • I was awarded the best poster award for PRISM Annual Review for Theme 1 - Systems & Software. November 8, 2024.