Description:
BitNet is the official inference framework for 1-bit LLMs, providing optimized CPU/GPU kernels for fast, lossless 1-bit model inference. It includes demo models, build scripts and integration with llama.cpp for efficient 1-bit LLM inference.