Tiny Agent
Published in ICML, 2024
TinyAgent: Quantization-aware Model Compression and Adaptation for On-device LLM Agent Deployment
Published in ICML, 2024
TinyAgent: Quantization-aware Model Compression and Adaptation for On-device LLM Agent Deployment