FrostNet: Towards Quantization-Aware Network Architecture Search
-
Updated
May 3, 2024 - Python
FrostNet: Towards Quantization-Aware Network Architecture Search
Post quantization with TensorFlow and model compilation with TVM
π― Fine-tune large language models and use them for text-related tasks. This repository provides a straightforward approach to fine-tuning models like Gemma, Llama π¦, and Mistral πͺοΈ for various NLP tasks. π§ It includes training π, fine-tuning π οΈ, and inference pipelines βοΈ. π
Add a description, image, and links to the post-quantization topic page so that developers can more easily learn about it.
To associate your repository with the post-quantization topic, visit your repo's landing page and select "manage topics."