com.gencior.triton.core.pojo.TritonModelConfig

public final class TritonModelConfig extends Object

Encapsulates configuration information for a Triton model.

This class represents the runtime configuration of a deployed model including its platform, backend, runtime environment, batching capabilities, and associated model files. This configuration is read from Triton's model configuration files and defines how the model is executed and what data formats it expects.

This is an immutable object that wraps the gRPC message ModelConfig.

Since:: 1.0.0
Author:: sachachoumiloff

Method Summary

Modifier and Type

Method

Description

static TritonModelConfig

fromProto(ModelConfigOuterClass.ModelConfig proto)

String

getBackend()

Returns the backend implementation name.

Map<String,String>

getCcModelFilenames()

Returns an unmodifiable map of compute capability to model filename mappings.

String

getDefaultModelFilename()

Returns the default model filename.

int

getMaxBatchSize()

Returns the maximum batch size supported by the model, or 0 if batching is disabled.

Map<String,String>

getMetricTags()

Returns an unmodifiable map of custom metric tags for this model.

String

getName()

Returns the name of the model.

String

getPlatform()

Returns the platform type (e.g., "tensorflow_savedmodel", "tensorrt_plan").

String

getRuntime()

Returns the runtime environment name.

String

toString()

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait

Method Details
- fromProto
  
  public static TritonModelConfig fromProto(ModelConfigOuterClass.ModelConfig proto)
- getName
  
  public String getName()
  
  Returns the name of the model.
- getPlatform
  
  public String getPlatform()
  
  Returns the platform type (e.g., "tensorflow_savedmodel", "tensorrt_plan").
- getBackend
  
  public String getBackend()
  
  Returns the backend implementation name.
- getRuntime
  
  public String getRuntime()
  
  Returns the runtime environment name.
- getMaxBatchSize
  
  public int getMaxBatchSize()
  
  Returns the maximum batch size supported by the model, or 0 if batching is disabled.
- getDefaultModelFilename
  
  public String getDefaultModelFilename()
  
  Returns the default model filename.
- getCcModelFilenames
  
  public Map<String,String> getCcModelFilenames()
  
  Returns an unmodifiable map of compute capability to model filename mappings.
- getMetricTags
  
  public Map<String,String> getMetricTags()
  
  Returns an unmodifiable map of custom metric tags for this model.
- toString
  
  public String toString()
  
  Overrides:
  
  toString in class Object

Class TritonModelConfig

Method Summary

Methods inherited from class java.lang.Object

Method Details

fromProto

getName

getPlatform

getBackend

getRuntime

getMaxBatchSize

getDefaultModelFilename

getCcModelFilenames

getMetricTags

toString