Getting Started

Quick start guide for ML engineers to deploy their first pipeline with the KFP Operator

Getting Started with ML Pipelines

This guide will help you deploy your first ML pipeline using the KFP Operator.

What You’ll Do

Deploy a pipeline using Kubernetes resources
Run the pipeline with custom parameters
Set up automated scheduling
Monitor pipeline status

Prerequisites

Access to a Kubernetes cluster with KFP Operator installed
kubectl configured to access your cluster
Basic familiarity with Kubernetes concepts (pods, services)
Container registry access (Docker Hub, GCR, etc.) for custom images

Need the operator installed? Check with your platform team or see the Platform Engineers documentation for installation instructions.

Core Concepts

The KFP Operator extends Kubernetes with custom resources for ML pipelines:

Pipeline: Defines a reusable ML pipeline template
Run: Represents a single execution of a pipeline
RunConfiguration: Configures automated pipeline execution
Provider: Manages connections to ML orchestration platforms

Instead of uploading pipelines through UIs, you define them as configuration files:

apiVersion: pipelines.kubeflow.org/v1beta1
kind: Pipeline
metadata:
  name: my-training-pipeline
spec:
  provider: provider-namespace/provider-name
  image: "my-registry/ml-pipeline:v1.0.0"
  framework:
    name: tfx
    parameters:
      pipeline: my_pipeline.create_components

Quick Start Tutorial

Step 1: Verify Your Environment

First, check that the KFP Operator is running in your cluster:

# Check if the operator is installed
kubectl get pods -n kfp-operator-system

# Verify Custom Resource Definitions are available
kubectl get crd | grep pipelines.kubeflow.org

You should see the operator pods running and several CRDs listed.

Step 2: Check Available Providers

Providers connect the operator to ML orchestration platforms:

# List available providers
kubectl get providers

# Example output:
# NAME                STATUS   TYPE   AGE
# kubeflow-provider   Ready    kfp    5m

If no providers are available, contact your platform team to set one up.

Step 3: Create Your First Pipeline

Create a file called my-first-pipeline.yaml:

apiVersion: pipelines.kubeflow.org/v1beta1
kind: Pipeline
metadata:
  name: penguin-training
  namespace: default
spec:
  provider: provider-namespace/provider-name
  image: "gcr.io/kfp-operator/penguin-pipeline:latest"
  framework:
    name: tfx
    parameters:
      pipeline: penguin_pipeline.create_components
  env:
    - name: DATA_ROOT
      value: "gs://kfp-operator-examples/penguin-data"
    - name: MODEL_ROOT
      value: "gs://my-bucket/models"  # Replace with your bucket

Deploy the pipeline:

kubectl apply -f my-first-pipeline.yaml

Check the pipeline status:

kubectl get pipelines
# NAME              STATUS   PROVIDER            AGE
# penguin-training  Ready    kubeflow-provider   30s

Step 5: Set Up Automated Execution

Create automated pipeline execution with run-configuration.yaml:

apiVersion: pipelines.kubeflow.org/v1beta1
kind: RunConfiguration
metadata:
  name: daily-training
  namespace: default
spec:
  run:
    provider: provider-namespace/provider-name
    pipeline: penguin-training
    parameters:
      - name: num_epochs
        value: "10"
      - name: learning_rate
        value: "0.001"
  triggers:
    schedules:
      - cronExpression: "0 2 * * *"  # Daily at 2 AM
        startTime: "2024-01-01T00:00:00Z"
        endTime: "2024-12-31T23:59:59Z"
    onChange:
      - pipeline

Apply the configuration:

kubectl apply -f run-configuration.yaml

# Verify it's scheduled
kubectl get runconfigurations

Complete!

You’ve successfully:

Deployed a pipeline using Kubernetes resources
Set up automated daily training
Executed a pipeline run with custom parameters
Learned how to monitor pipeline execution

Understanding What Happened

When you created the Pipeline resource, the KFP Operator:

Validated your pipeline specification
Compiled the pipeline for your target platform
Registered it with your ML orchestration platform (Kubeflow Pipelines)
Updated the status to show it’s ready

When you created the Run Configuration resource, the operator:

Created a RunSchedule to schedule the workflow for future runs
Created a Run resource to execute the pipeline
Monitored execution and updated status
Published events for downstream automation

Key Benefits

Version Control: Your pipeline definitions are now in YAML files and can be versioned in Git
Reproducible: Same pipeline definition works across environments
Automated: Set-and-forget scheduling with RunConfiguration
Observable: Full visibility through kubectl and Kubernetes events

What’s Next?

Build Custom Pipelines: Create your own TFX pipelines
Pipeline Dependencies: Chain multiple pipelines together
Best Practices: Production ML engineering patterns
API Reference: Complete Custom Resource specifications
Troubleshooting: Debug common pipeline issues

Need Help?

If you encounter issues:

Check Troubleshooting for common problems
Review pipeline logs: kubectl logs -l workflows.argoproj.io/workflow=<workflow-name>
Check operator logs: kubectl logs -n kfp-operator-system deployment/kfp-operator-controller-manager
Ask for help: GitHub Discussions

Next: Training Pipeline Tutorial.

Tutorials

Step-by-step tutorials for building and deploying ML pipelines with the KFP Operator