For Platform Engineers & Developers
Documentation for teams who install, configure, and maintain the KFP Operator platform
Documentation for teams who install, configure, maintain, and extend the KFP Operator platform.
Who This Is For
This documentation is for you if you:
- Deploy and configure the KFP Operator in Kubernetes clusters
- Manage platform services and infrastructure
- Implement security, RBAC, and compliance policies
- Set up monitoring, logging, and alerting
- Develop extensions or contribute to the operator
- Set up multi-tenant platforms for ML teams
What You’ll Learn
- Installation & deployment in various environments
- Advanced operator configuration and tuning
- System architecture and internals
- Enterprise security patterns and RBAC
- Production monitoring and maintenance
- Development and platform extension
Implementation Path
1. Installation & Setup
Get the operator running in your cluster:
2. Security & RBAC
Implement enterprise security and access control:
3. Operations & Monitoring
Set up production monitoring and maintenance:
4. Advanced Topics
Master advanced features and customization:
Prerequisites
Required Knowledge
- Kubernetes cluster administration
- Helm charts and package management
- Docker and container registries
- YAML and configuration management
Technical Requirements
- Kubernetes cluster admin access (v1.21+)
- Helm 3.x for installing and managing the operator
- kubectl configured with cluster admin permissions
- Container registry for storing operator and pipeline images
Recommended Experience
- Kubernetes operators and controllers
- GitOps workflows and tools
- Prometheus, Grafana, and observability
- Continuous integration and deployment
Quick Deployment Checklist
Deployment Scenarios
- Development: Single-node clusters (minikube, kind), minimal resources
- Production: Multi-node HA clusters, advanced security, monitoring
- Cloud: Managed K8s services (GKE, EKS, AKS), auto-scaling
- GitOps: ArgoCD/Flux deployment, Infrastructure as Code
Architecture Overview
Key components:
- Controller Manager: Core operator logic and reconciliation
- Admission Webhooks: Validation and mutation of resources
- Provider Services: Abstraction layer for ML platforms
- Argo Integration: Workflow execution and management
- Event System: Pipeline event processing and distribution
Security Considerations
Access Control
- RBAC for operator components
- Service account management
- Network policies and segmentation
- Secret and credential management
Data Security
- Encryption at rest and in transit
- Secure artifact storage
- Pipeline data isolation
- Audit logging and compliance
Threat Mitigation
- Container image scanning
- Runtime security monitoring
- Vulnerability management
- Incident response procedures
Getting Help
Kubernetes Ecosystem
Next: Installation Guide.
Complete guide to install and configure the KFP Operator in your Kubernetes cluster
Safe upgrade procedures and best practices for the KFP Operator platform
Advanced configuration options for the KFP Operator platform
Deep dive into KFP Operator architecture and system design
Complete technical reference for the KFP Operator event system and schemas