Qualcomm Innovation Center (QuIC) is at the forefront of enabling low-power inference at the edge through its pioneering model-efficiency research. QuIC has a mission to help migrate the ecosystem toward fixed-point inference. With this goal, QuIC presents the AI Model Efficiency Toolkit (AIMET) - a library that provides advanced quantization and compression techniques for trained neural network models. AIMET enables neural networks to run more efficiently on fixed-point AI hardware accelerators. Quantized inference is significantly faster than floating point inference. For example, models that we’ve run on the Qualcomm® Hexagon™ DSP rather than on the Qualcomm® Kryo™ CPU have resulted in a 5x to 15x speedup. Plus, an 8-bit model also has a 4x smaller memory footprint relative to a 32-bit model. However, often when quantizing a machine learning model (e.g., from 32-bit floating point to an 8-bit fixed point value), the model accuracy is sacrificed.

Features

  • Equalize weight tensors to reduce amplitude variation across channels
  • Tensor-decomposition technique to split a large layer into two smaller ones
  • Corrects shift in layer outputs introduced due to quantization
  • Removes redundant input channels from a layer and reconstructs layer weights
  • Use quantization sim to train the model further to improve accuracy
  • Automatically selects how much to compress each layer in the model

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow AIMET

AIMET Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of AIMET!

Additional Project Details

Programming Language

Python

Related Categories

Python Machine Learning Software, Python Neural Network Libraries, Python LLM Inference Tool

Registered

2022-08-19