Raymond Maarloeve
Loading...
Searching...
No Matches
LoadModelDTO Class Reference

Represents a request to load a model into the system. More...

Public Attributes

string model_id
 The unique identifier of the model to be loaded.
 
string model_path
 The path to the model file to be loaded.
 
int n_ctx
 Context size for the model, which determines how many tokens can be processed at once.
 
int n_parts
 Number of parts to split the model into for loading.
 
int seed
 Seed for random number generation, used for reproducibility in model loading.
 
bool f16_kv
 Whether to use quantization for the model, which can reduce memory usage and improve performance.
 
int n_gpu_layers
 Whether to use GPU acceleration for the model, which can significantly speed up processing, -1 for uinlimited GPU, 0 for CPU, and positive integers for specific GPU IDs.
 

Detailed Description

Represents a request to load a model into the system.

Member Data Documentation

◆ f16_kv

bool LoadModelDTO.f16_kv

Whether to use quantization for the model, which can reduce memory usage and improve performance.

◆ model_id

string LoadModelDTO.model_id

The unique identifier of the model to be loaded.

◆ model_path

string LoadModelDTO.model_path

The path to the model file to be loaded.

◆ n_ctx

int LoadModelDTO.n_ctx

Context size for the model, which determines how many tokens can be processed at once.

◆ n_gpu_layers

int LoadModelDTO.n_gpu_layers

Whether to use GPU acceleration for the model, which can significantly speed up processing, -1 for uinlimited GPU, 0 for CPU, and positive integers for specific GPU IDs.

◆ n_parts

int LoadModelDTO.n_parts

Number of parts to split the model into for loading.

◆ seed

int LoadModelDTO.seed

Seed for random number generation, used for reproducibility in model loading.


The documentation for this class was generated from the following file: