ludoplex

ludoplex/neural-compressor

Provide unified APIs for SOTA model compression techniques, such as low precision (INT8/INT4/FP4/NF4) quantization, sparsity, pruning, and knowledge distillation on mainstream AI frameworks such as TensorFlow, PyTorch, and ONNX Runtime.

0
0
Apache License 2.0

IntelĀ® Neural Compressor is an open-source Python library that provides unified APIs for model compression techniques like quantization, pruning, and distillation across TensorFlow, PyTorch, ONNX Runtime, and MXNet. It's designed for developers and researchers working on optimizing deep learning models for Intel hardware and other platforms, offering features like automatic accuracy-driven quantization and support for popular models from hubs like Hugging Face and Torch Vision.

Total donated
Undistributed
Share with your subscribers:

Recipients

How the donated funds are distributed

Support the dependencies

Support the repos that depend on this repository

Top contributors

chensuyue's profile
chensuyue
281 contributions
guomingz's profile
guomingz
276 contributions
mengniwang95's profile
mengniwang95
227 contributions
PenghuiCheng's profile
PenghuiCheng
202 contributions
xin3he's profile
xin3he
165 contributions
ftian1's profile
ftian1
161 contributions
lvliang-intel's profile
lvliang-intel
133 contributions
ClarkChin08's profile
ClarkChin08
114 contributions
changwangss's profile
changwangss
102 contributions
zehao-intel's profile
zehao-intel
95 contributions

Recent events

Kivach works on the Obyte network, and therefore you can track all donations.

No events yet