KTransformers is a flexible Python framework that enhances Hugging Face Transformers with advanced kernel optimizations and placement/parallelism strategies for local LLM inference. It provides a template-based injection system that allows users to easily swap in optimized modules, enabling faster inference on resource-constrained devices through GPU/CPU offloading and support for quantized models.
How the donated funds are distributed
Kivach works on the Obyte network, and therefore you can track all donations.