Architecture

The diagram below illustrates the architecture of the Alauda AI platform.

architecture

Component Description

Components in Alauda Container Platform Layer

ComponentDescriptionTypeLicense
GPU (Alauda Build of Nvidia GPU Device Plugin)Provides GPU resources for AI workloadsOpen sourceApache Version 2.0
HAMi (Alauda Build of Hami, Alauda Build of Hami-WebUI)GPU resource slicing, sharing and schedulingOpen sourceApache Version 2.0
Alauda Build of DCGM-ExporterGPU monitoringOpen sourceApache Version 2.0
Alauda Build of NPU OperatorProvides NPU resources for AI workloadsOpen sourceApache Version 2.0
Alauda Build of Node Feature DiscoveryDetects hardware features of cluster nodesOpen sourceApache Version 2.0
DRA (Alauda build of NVIDIA DRA Driver for GPUs)Dynamic Resource Allocation for GPU sharingOpen sourceApache Version 2.0
Volcano (Alauda support for Volcano)Batch job scheduling for AI workloadsOpen sourceApache Version 2.0
Kueue (Alauda Build of Kueue)Job scheduling for AI workloadsOpen sourceApache Version 2.0
Milvus (Alauda Build of Milvus)Vector database for embedding storage and retrievalOpen sourceApache Version 2.0
PGVector (Alauda support for PostgreSQL)PostgreSQL extension for vector similarity searchOpen sourceThe PostgreSQL License

Components in AI Platform Layer

ComponentDescriptionTypeLicense
Model Catalog (Alauda AI/Alauda AI Essentials)Centralized repository for managing AI models and their metadataProprietaryCommercial
Model Registry (Alauda support for Kubeflow Model Registry)Keep track of AI model versions and metadata for each namespaceOpen sourceApache Version 2.0
Datasets (Alauda AI/Alauda AI Essentials)Centralized repository for managing datasets and their metadataProprietaryCommercial
Labeling (Alauda support for Label Studio)Data labeling tool for creating labeled datasetsOpen sourceApache Version 2.0
Feature Store (Alauda support for FeatureForm)Centralized repository for managing and serving machine learning featuresOpen sourceMozilla Public License (MPL)
Workbench (Alauda AI Workbench)Web-based interface for managing AI projects, including model training and inferenceProprietaryCommercial
Training Jobs (Alauda support for Kubeflow Trainer v2)Kubernetes-native training job managementOpen sourceApache Version 2.0
Kubeflow Pipelines (Alauda support for Kubeflow Base & Alauda support for Kubeflow Pipeline)Workflow orchestration for AI pipelinesOpen sourceApache Version 2.0
Guardrails (Coming soon)AI safety and governance frameworkOpen sourceApache Version 2.0
Drift & Bias Detection (Alauda support for Evidently)Monitoring for model performance degradation and biasOpen sourceApache Version 2.0
Experiment Tracking (Alauda support for MLFlow)Tracking and comparing machine learning experimentsOpen sourceApache Version 2.0

Components in GenAI Platform Layer

ComponentDescriptionTypeLicense
Kserve (Alauda AI Model Serving/Alauda Generative AI)Kubernetes-native model serving frameworkOpen sourceApache Version 2.0
vLLM (Alauda AI Model Serving/Alauda Generative AI)High-performance model inference engine for large language modelsOpen sourceApache Version 2.0
llm-d (Alauda Generative AI)Distributed inference engine for large language modelsOpen sourceApache Version 2.0
Model as a Service (Alauda build of Envoy AI Gateway)API gateway for serving AI models as a serviceOpen sourceApache Version 2.0
Fine-tuningTools integrated with the workbench for fine-tuning large language models, e.g. transformers, accelerate, llama-factory etc.Open source-
Training (Alauda support for Kubeflow Trainer v2)Kubernetes-native training job managementOpen sourceApache Version 2.0
Model QuantizationTools integrated with the workbench for model quantization, e.g. llm-compressor etc.Open source-
EvaluationTools integrated with the workbench for evaluating model performance, e.g. lm-evaluation-harness etc.Open source-
Llama Stack (Alauda build of Llama Stack)Framework for building applications with large language modelsOpen sourceMIT
LangchainTools integrated with the workbench for building LLM applications using LangchainOpen sourceMIT
Dify (Alauda support for Dify)Platform for building AI assistants and chatbotsOpen sourceApache Version 2.0 (modified)
MCP ServersCan integrate with various MCP servers--
Agent Tracing (Alauda support for MLflow)Tracing and monitoring for AI agentsOpen sourceApache Version 2.0
Agent EvaluationTools integrated with the workbench for evaluating AI agents, e.g. RAGAS etc.Open source-