Developer Guide Overview¶
Welcome to the Developer Guide for DeepExtension — a flexible, extensible platform for building domain-specific AI models. This guide is designed to help developers, engineers, and technically curious users understand how to extend and customize DeepExtension for real-world usage.
Whether you're deploying on a local machine or preparing your own training logic, this section provides everything you need to set up, integrate, and build with confidence — even if you're new to LLM fine-tuning.
What You'll Learn¶
This guide will walk you through:
-
Supported Hardware & Performance Notes
Understand which hardware configurations work best for inference, training, and deployment — and how they affect speed and capacity. -
Installation Guide
How to set up DeepExtension from scratch on supported platforms like Linux or Windows via WSL (CUDA-enabled) and macOS. -
Base Model Management
Learn how to register and organize local base models that serve as the foundation for all fine-tuning activities. -
Implementing Your Own Training (CUDA(Linux or Windows via WSL))
Extend DeepExtension by integrating your own training script (LoRA, SFT, GRPO, etc.) using our built-in templates. -
Implementing Your Own Training (macOS + MLX)
If you're developing on macOS, we offer support for lightweight training using Apple Silicon and MLX. You'll learn how to integrate and test custom MLX-based training workflows.
Who This Is For¶
This section is for:
- Developers with Python experience who want to extend model training logic
- MLOps engineers managing training environments and hardware
- AI researchers prototyping domain-specific models
- Technical users seeking to customize DeepExtension for internal workflows
You do not need to be an AI expert to follow along — we provide templates, configuration tips, and clear entry points so you can build progressively.
Ready to Start?¶
Choose a chapter below to dive in: