fishke22

fishke22/ktransformers

體驗尖端 LLM 推理優化的靈活框架

0
0
Apache License 2.0

KTransformers is a flexible Python framework designed to enhance the Hugging Face Transformers experience by providing advanced kernel optimizations and placement/parallelism strategies for LLM inference. It offers a user-friendly injection system that allows researchers to easily replace original modules with optimized variants, supporting features like CPU/GPU offloading of quantized models and integration with tools like Llamafile and Marlin.

Total donated
Undistributed
Share with your subscribers:

Recipients

How the donated funds are distributed

Support the dependencies

Top contributors

Azure-Tang's profile
Azure-Tang
58 contributions
UnicornChan's profile
UnicornChan
54 contributions
Atream's profile
Atream
53 contributions
KMSorSMS's profile
KMSorSMS
53 contributions
qiyuxinlin's profile
qiyuxinlin
8 contributions
chenht2022's profile
chenht2022
6 contributions
james0zan's profile
james0zan
3 contributions
ErvinXie's profile
ErvinXie
3 contributions
sayap's profile
sayap
2 contributions
hrz6976's profile
hrz6976
2 contributions

Recent events

Kivach works on the Obyte network, and therefore you can track all donations.

No events yet