Publications

Tiny Agent

Published in ICML, 2024

TinyAgent: Quantization-aware Model Compression and Adaptation for On-device LLM Agent Deployment