Zum Inhalt

Complete Models

Complete Models are fully materialized models created by merging trained adapters into their corresponding base models. Once Complete, these models are self-contained, versioned snapshots that can be independently deployed, evaluated, or reused.


Overview

On the Complete Models page, you will find a list of all merged models that were successfully Complete through the Customized Models interface.

Each model entry includes:

  • Auto-Generated Name: Created using the following format:
[complete_model_name] = [customized_model_name]_s_[first4ofSavingUUID]
  • Model Card: A structured summary of technical details, configurations, and metadata from both the base model and the training process

Deploy a Complete Model

To make a Complete model available for real-time use:

  1. Click "Deploy" on the desired Complete model
  2. Provide any required extra parameters (e.g., quantization configuration, deployment environment)
  3. DeepExtension will forward deployment requests to your configured LLM deployment tool
  4. Upon successful deployment, a new entry will appear under Live Models

Note:

  • Deployment requires prior integration with an external deployment backend. DeepExtension itself does not serve models.
  • To deploy via Ollama, a valid Deployment Template File is required. See Base Models for more details.
  • The Deployment Environment must be correctly configured. See Deployment Tool Configuration.
  • Currently, only the following quantization modes are supported: no_quantization, q8_0, q4_K_M, and q4_K_S — as these are the only options supported by the Ollama API.

Delete a Complete Model

To remove a Complete model:

  • Click "Delete" on the selected model
  • This action will permanently delete the model directory from your local storage

If the model was already deployed via the Deploy button, the deployment entry in Live Models will remain unaffected.


DeepExtension — From training artifacts to deployable intelligence, all in one flow