powerfulmoves

powerfulmoves/pmoves-ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

0
0
Apache License 2.0

KTransformers is a Python framework that enhances Hugging Face Transformers with advanced kernel optimizations and placement/parallelism strategies, enabling cutting-edge LLM inference optimizations for local deployments. It provides a flexible, extensible platform for experimenting with optimizations like CPU/GPU offloading of quantized models, supporting features such as MoE offloading, sparse attention, and integration with kernels from Llamafile and Marlin.

Total donated
Undistributed
Share with your subscribers:

Recipients

How the donated funds are distributed

Support the dependencies

Support the repos that depend on this repository

Top contributors

Atream's profile
Atream
211 contributions
Azure-Tang's profile
Azure-Tang
112 contributions
KMSorSMS's profile
KMSorSMS
109 contributions
qiyuxinlin's profile
qiyuxinlin
94 contributions
UnicornChan's profile
UnicornChan
54 contributions
SkqLiao's profile
SkqLiao
37 contributions
ovowei's profile
ovowei
24 contributions
aubreyli's profile
aubreyli
16 contributions
ceerRep's profile
ceerRep
13 contributions
chenht2022's profile
chenht2022
11 contributions

Recent events

Kivach works on the Obyte network, and therefore you can track all donations.

No events yet