Kubeflow vs Airflow Forces a Hard ML Pipeline Choice

Choosing between Kubeflow vs Airflow ML pipelines is not just a tooling preference—it affects how your team authors workflows, runs containers, schedules retraining, tracks artifacts, debugs failures, and maintains production infrastructure. Both Kubeflow and Apache Airflow can orchestrate machine learning pipelines, but the source data consistently shows they were built around different assumptions: Kubeflow is ML-native and Kubernetes-first, while Airflow is a mature, general-purpose workflow orchestrator widely used by data teams.

This analysis breaks down where each tool fits, where it creates operational friction, and when using both may be the most practical architecture.

Why ML Pipeline Orchestration Is Different From Data Orchestration

Traditional data orchestration usually focuses on moving, transforming, validating, and loading data on a schedule. Machine learning orchestration adds more moving parts: feature computation, model training, evaluation, hyperparameter tuning, artifact tracking, model registration, deployment, and sometimes serving.

A production ML pipeline often needs to run steps in a strict order:

Ingest or validate data
Compute features
Train a model
Evaluate the model
Apply a quality gate
Register or deploy the model

That is why the orchestrator matters. The source data describes every production ML system as needing something that runs data ingestion, feature computation, training, evaluation, and deployment steps in the right order, while also handling failures and providing visibility.

Key insight: Airflow can orchestrate ML workflows, but it is not ML-specific. Kubeflow Pipelines was built specifically for ML workflows on Kubernetes, with native concepts for datasets, models, metrics, artifacts, and containerized pipeline steps.

Data orchestration vs ML orchestration

Requirement	Traditional Data Pipeline	ML Pipeline
Main objective	Move and transform data	Train, evaluate, and operationalize models
Common steps	ETL, reporting, API actions	Validation, feature engineering, training, evaluation, serving
Artifact needs	Logs, transformed data	Datasets, models, metrics, experiment outputs
Compute profile	Often batch-oriented	May require GPUs, containers, and large-scale training
Reproducibility needs	Important	Critical for model comparison and retraining
Native fit from sources	Airflow	Kubeflow Pipelines

Airflow is described as strong for data engineering and ETL processes, automated reporting, and general workflow orchestration. Kubeflow, by contrast, is described as an end-to-end machine learning stack orchestration toolkit for deploying, scaling, and managing large-scale ML systems on Kubernetes.

That distinction is the foundation of the Kubeflow vs Airflow ML pipelines decision.

Kubeflow and Airflow: Core Concepts

At a high level, both tools let teams define workflows as directed dependency graphs. The similarities largely stop there.

What Kubeflow is

Kubeflow is an open-source ML platform designed to simplify deployment, orchestration, and scaling of machine learning workflows on Kubernetes. It provides tools for model training, serving, monitoring, notebooks, and pipeline execution in a cloud-native environment.

Source data highlights these Kubeflow capabilities:

Kubeflow Pipelines: Builds and deploys portable, scalable ML workflows based on Docker containers.
Central dashboard: Gives access to installed Kubeflow components in a cluster and supports multi-user isolation.
Notebook support: Users can launch Jupyter notebook servers, RStudio, or VSCode from the dashboard in Kubeflow v1.3+.
Framework compatibility: Works with Scikit-learn, TensorFlow, PyTorch, MXNet, XGBoost, and related libraries.
TensorBoard integration: Helps visualize ML training.
Katib: Supports hyperparameter tuning by running pipelines with different hyperparameters.
KFServing: Supports model serving, including multi-model serving.

Kubeflow’s model is Kubernetes-native: pipeline stages are converted into Kubernetes jobs, and workflows run as containerized components.

What Airflow is

Apache Airflow is an open-source platform for programmatically authoring, scheduling, and monitoring workflows. It represents workflows as Directed Acyclic Graphs, or DAGs, where each node is a task and edges represent dependencies.

Source data highlights these Airflow capabilities:

Scheduler: Monitors DAGs and tasks, triggers scheduled workflows, and submits tasks to executors.
Executors: Run task instances and can be swapped depending on installation requirements.
Webserver/UI: Shows task status, logs, and workflow state.
Python authoring: Workflows are created using Python.
Jinja templating: Supports parameterized scripts.
Advanced scheduling semantics: Allows regular pipeline execution.
Integrations: Includes operators and providers for Google Cloud Platform, Amazon Web Services, Azure, Databricks, and other third-party platforms.
Notifications: Can send email or Slack notifications when processes complete or fail.

Airflow is widely used by data teams and is described in the source data as having a broader and more established user base than Kubeflow.

Quick comparison: Kubeflow vs Airflow ML pipelines

Dimension	Kubeflow	Apache Airflow
Primary use	Machine learning pipelines and workflows	General-purpose data pipelines and workflow orchestration
Core purpose	End-to-end ML workflow management	Scheduling, orchestrating, and managing workflows
Execution model	Kubernetes pods / Kubernetes jobs	Workers such as Celery or Kubernetes-based execution
Kubernetes requirement	Designed to run primarily on Kubernetes	Does not require Kubernetes, but can run with Kubernetes tooling
Container isolation	Per-step container isolation	Not container-isolated by default; KubernetesExecutor or KubernetesPodOperator can be used
GPU support	Native GPU scheduling in Kubeflow Pipelines examples	Manual Kubernetes configuration in Airflow examples
ML artifact tracking	Built-in datasets, models, and metrics in Kubeflow Pipelines	External tooling needed, such as MLflow according to source data
Caching	Built-in input-based caching	Limited; no built-in pipeline-step caching cited in source data
UI	Pipeline, artifacts, dashboard	DAGs, logs, task status
Setup complexity	High; requires Kubernetes	Medium to high depending on deployment
Best fit	Kubernetes-native ML teams	Data teams adding ML to existing orchestration

Pipeline Authoring and Developer Experience

Pipeline authoring is one of the biggest practical differences between Kubeflow and Airflow.

Airflow authoring: Python DAGs and operators

Airflow pipelines are written in Python as DAGs. The source data emphasizes that Airflow is easy to use for people familiar with Python, and that users can create workflows using Python features such as date formats, loops, and task definitions.

A typical Airflow ML DAG can use the KubernetesPodOperator to run ML pipeline stages in containers:

with DAG(
    dag_id="ml_training_pipeline",
    description="Nightly model retraining pipeline",
    schedule_interval="0 2 * * *",
    catchup=False,
    tags=["ml", "training"],
) as dag:
    validate_data = KubernetesPodOperator(
        task_id="validate_data",
        image="registry.company.com/ml/data-validator:latest",
        cmds=["python", "validate.py"],
        namespace="ml-pipelines",
        get_logs=True,
    )

    train_model = KubernetesPodOperator(
        task_id="train_model",
        image="registry.company.com/ml/trainer:latest",
        cmds=["python", "train.py"],
        namespace="ml-pipelines",
        resources={
            "request_memory": "8Gi",
            "request_cpu": "4",
            "limit_gpu": "1",
        },
        node_selector={"gpu": "true"},
    )

    validate_data >> train_model

This model is familiar to data engineers. You define tasks, connect dependencies, schedule runs, and inspect logs in the Airflow UI.

However, source data also identifies ML-specific limitations:

Artifact tracking: Airflow has no native ML artifact tracking; external tools such as MLflow are needed.
Caching: Airflow has no built-in caching of pipeline steps in the cited comparison.
Data passing: XCom has size limits, so it is not suited for large datasets or model artifacts.
Container isolation: Airflow is not container-isolated by default.

Kubeflow authoring: typed ML components

Kubeflow Pipelines uses Python components with typed inputs and outputs such as Dataset, Model, and Metrics. Each step runs in its own container with its own dependencies.

from kfp import dsl
from kfp.dsl import Input, Output, Dataset, Model, Metrics

@dsl.component(
    base_image="python:3.11-slim",
    packages_to_install=["pandas", "scikit-learn"]
)
def compute_features(data: Input[Dataset], features: Output[Dataset]):
    import pandas as pd
    df = pd.read_parquet(data.path)
    # Feature engineering...
    df.to_parquet(features.path)

@dsl.component(base_image="nvcr.io/nvidia/pytorch:24.01-py3")
def train_model(
    features: Input[Dataset],
    model: Output[Model],
    metrics: Output[Metrics]
):
    import pandas as pd
    import joblib
    from sklearn.ensemble import GradientBoostingClassifier

    df = pd.read_parquet(features.path)
    X, y = df.drop("target", axis=1), df["target"]

    clf = GradientBoostingClassifier(n_estimators=200, max_depth=8)
    clf.fit(X, y)

    joblib.dump(clf, model.path)
    metrics.log_metric("train_accuracy", clf.score(X, y))

Kubeflow’s authoring model is more ML-specific. The source data calls out typed inputs and outputs as a way to prevent wiring errors, while built-in artifact tracking captures datasets, models, and metrics.

Practical takeaway: Airflow feels natural for data engineering teams that already think in DAGs and schedules. Kubeflow feels more natural when each ML step needs its own container, dependency set, GPU requirements, typed artifacts, and ML metadata.

Developer experience comparison

Developer Experience Area	Kubeflow	Airflow
Authoring language	Python via Kubeflow Pipelines SDK	Python DAGs
Workflow abstraction	ML pipeline components	DAG tasks
Dependency isolation	Per-step containers	Requires Kubernetes-based execution for container isolation
ML types	Dataset, Model, Metrics	General task outputs; XCom has size limits
Parameterization	Pipeline components and inputs	Jinja templates and Python
Learning curve	Steeper, due to ML-specific features and Kubernetes	Easier for Python/data engineering teams

Kubernetes-Native Workloads and Infrastructure Fit

Infrastructure fit is often the deciding factor in Kubeflow vs Airflow ML pipelines.

Kubeflow is Kubernetes-first

Kubeflow is designed to run primarily on Kubernetes. The source data repeatedly describes Kubeflow as Kubernetes-based and cloud-native. It works by arranging ML components on Kubernetes and converting stages in the data science process into Kubernetes jobs.

This has meaningful advantages:

Scalability: Kubeflow leverages Kubernetes for scaling.
Container isolation: Every Kubeflow Pipelines step runs in its own container.
GPU scheduling: Kubeflow Pipelines supports native GPU scheduling.
Model serving: KFServing provides serverless inferencing on Kubernetes and interfaces for frameworks such as PyTorch, TensorFlow, and XGBoost.
Multi-model serving: KFServing can serve several models at once, though source data warns this can quickly use available cluster resources as query volume increases.

But the trade-off is operational weight. Kubeflow requires a Kubernetes cluster and is described as having heavier infrastructure overhead and a steeper learning curve.

Airflow can run with or without Kubernetes

Airflow does not require Kubernetes. It can run using different executors, and source data notes that Kubernetes support is available when needed through Airflow’s Kubernetes tooling, including the Kubernetes Airflow Operator and KubernetesPodOperator.

This gives Airflow more deployment flexibility:

Non-Kubernetes teams: Can use Airflow without adopting Kubernetes as a prerequisite.
Kubernetes users: Can run containerized ML tasks through KubernetesPodOperator.
Hybrid workloads: Can orchestrate ETL, reports, API actions, notifications, and ML training from the same platform.

However, Airflow’s Kubernetes integration does not make it an ML-native platform. GPU support and container isolation require explicit configuration in the Airflow examples, while Kubeflow treats them as core workflow concepts.

Infrastructure fit comparison

Infrastructure Question	Choose Kubeflow When...	Choose Airflow When...
Is Kubernetes already standard?	Your ML workloads already run on Kubernetes	Kubernetes is optional or only used for some tasks
Do steps need separate containers?	Every step needs isolated dependencies	Isolation is useful but not required by default
Do you need GPU scheduling?	GPU training is central to the pipeline	GPU usage can be manually configured
Do you need model serving support?	You want Kubeflow components such as KFServing	You only need to orchestrate deployment tasks
Do you need broad workflow coverage?	The workflow is mostly ML-specific	The workflow includes ETL, reporting, APIs, and ML

Scheduling, Retraining, and Dependency Management

Airflow’s reputation is built on scheduling. Kubeflow’s strength is ML pipeline execution and reproducibility, especially in Kubernetes environments.

Airflow scheduling strengths

The source data describes Airflow as strong in scheduling and monitoring. Its scheduler runs continuously, monitors DAGs and tasks, triggers scheduled workflows, and submits tasks to executors.

Airflow supports regular pipeline execution through advanced scheduling semantics. In the cited ML example, a nightly retraining pipeline runs at 2 AM daily using:

schedule_interval="0 2 * * *"

Airflow also supports retries, retry delays, and email notifications:

default_args = {
    "owner": "ml-team",
    "email_on_failure": True,
    "email": ["[email protected]"],
    "retries": 2,
    "retry_delay": timedelta(minutes=5),
}

These features make Airflow a strong fit for recurring retraining jobs, especially when retraining depends on upstream data pipelines already managed in Airflow.

Kubeflow dependency strengths

Kubeflow Pipelines focuses more on ML workflow structure. Dependencies are defined through component inputs and outputs. For example, a feature computation step can consume the validated dataset output from a validation step, and the training step can consume the feature dataset output.

This model is especially useful when ML artifacts—not just task completion—define the dependency graph.

Kubeflow also includes built-in caching that can skip steps with unchanged inputs. That matters for iterative ML workflows where validation, feature generation, or training steps may be expensive.

Dependency management comparison

Capability	Kubeflow	Airflow
Regular scheduling	Not emphasized as its core differentiator in source data	Strong scheduling semantics
Retraining workflows	Strong for ML-native training workflows	Strong for scheduled retraining DAGs
Dependency expression	Typed component inputs and outputs	DAG task dependencies
Retries	Supported through pipeline execution patterns, though source data emphasizes Airflow more here	Explicit retries and retry delays in examples
Step caching	Built-in input-based caching	Limited; no built-in pipeline-step caching cited
External event workflows	Not covered in detail by sources	Source data notes scheduled and externally triggered tasks

Decision point: If retraining is mainly a scheduled workflow problem, Airflow is often the simpler fit. If retraining is an ML artifact and reproducibility problem, Kubeflow has stronger native concepts.

Metadata Tracking, Artifacts, and Reproducibility

Metadata and artifacts are where Kubeflow becomes more attractive for ML-specific teams.

Kubeflow’s ML-native metadata model

Kubeflow Pipelines supports ML artifact tracking for datasets, models, and metrics. The cited examples use typed outputs such as:

Dataset
Model
Metrics

A validation step can log row counts and column counts. A training step can output a model and log training accuracy.

stats.log_metric("rows", len(df))
stats.log_metric("columns", len(df.columns))
metrics.log_metric("train_accuracy", clf.score(X, y))

This is useful for reproducibility because pipeline outputs are treated as first-class ML artifacts, not just task logs.

Kubeflow also integrates TensorBoard for visualizing training and includes Katib for hyperparameter tuning. Katib runs pipelines with different hyperparameters to find an optimal ML model, according to the source data.

Airflow needs external ML tracking

Airflow provides logs, task status, DAG views, and monitoring. But the source data clearly states that Airflow has no native ML artifact tracking and typically needs an external tool such as MLflow for ML metrics and model artifacts.

That does not make Airflow unsuitable for ML. It means Airflow is usually the orchestrator, not the ML system of record.

For example, an Airflow task may run a container that trains a model, but the model artifact, metrics, and experiment lineage need to be handled outside Airflow.

Artifact and reproducibility comparison

Area	Kubeflow	Airflow
Dataset tracking	Built-in typed Dataset artifacts	External system needed
Model tracking	Built-in typed Model artifacts	External system needed
Metrics tracking	Built-in Metrics outputs	External system needed
Experiment tracking	Kubeflow supports ML experiment concepts	Not native
Hyperparameter tuning	Katib included in Kubeflow ecosystem	Requires custom implementation or external tooling
Training visualization	TensorBoard integration	Logs and task UI; ML visualization requires external tools

If your commercial evaluation includes auditability, model comparison, and reproducibility, this section may carry more weight than scheduling alone.

Monitoring, Debugging, and Failure Recovery

Both platforms include user interfaces, but they expose different operational views.

Airflow monitoring and debugging

Airflow’s UI provides a full view of DAG status, completed tasks, in-progress tasks, and logs. Source data describes the webserver as a UI that displays job status, allows users to view, trigger, and debug DAGs and tasks, and helps interact with the database and read logs from remote file storage.

Airflow is also known in the source material for:

Monitoring task execution
Viewing logs
Managing workflows
Sending failure notifications
Retrying failed tasks
Visualizing DAGs in production

For teams running hundreds or many more DAGs, the source data describes Airflow as battle-tested at scale, including environments with 1000+ DAGs.

Kubeflow monitoring and debugging

Kubeflow provides a central dashboard for deployed components in a cluster. Kubeflow Pipelines includes a UI to manage jobs, an engine for scheduling multi-step ML workflows, an SDK to define and manipulate pipelines, and notebooks to interact with the system.

Kubeflow’s ML-specific debugging advantages include visibility into:

Pipeline runs
Component-level artifacts
Datasets
Models
Metrics
Training visualization through TensorBoard

For model training problems, Kubeflow’s metadata and artifact visibility can be more relevant than a generic task log. For infrastructure failures, teams still need Kubernetes fluency because pipeline steps run as Kubernetes workloads.

Monitoring comparison

Monitoring Need	Kubeflow	Airflow
DAG/task status	Pipeline UI	Strong DAG UI
Logs	Available through pipeline and Kubernetes context	Strong log visibility in UI
ML artifacts	Built-in artifacts	External tool required
Training visualization	TensorBoard integration	External tooling needed
Failure notifications	Not emphasized in sources	Email and Slack notifications mentioned
Debugging complexity	Requires Kubernetes understanding	Familiar to data engineering teams

Operational warning: Kubeflow may expose richer ML context, but debugging often requires Kubernetes knowledge. Airflow may be easier to operate for general workflow failures, but it does not natively understand ML artifacts.

Operational Complexity and Team Requirements

The right orchestrator depends heavily on team skills.

Kubeflow team requirements

Kubeflow is powerful, but the source data repeatedly flags complexity:

Requires Kubernetes
Steeper learning curve
Heavier infrastructure overhead
Substantial Kubernetes resources
More complex setup and maintenance
Less mature than Airflow for non-ML tasks

Kubeflow works best when the team already has strong Kubernetes and MLOps capabilities. It is especially compelling when your ML workflows need native support for model training, tuning, serving, notebooks, artifacts, and GPU-backed workloads.

However, Kubeflow may be excessive if your main problem is simply scheduling Python scripts or coordinating ETL jobs.

Airflow team requirements

Airflow is generally more accessible to Python-fluent data teams. Source data says an Airflow pipeline can be set up by someone familiar with Python, and Airflow’s ecosystem includes many reusable operators and integrations.

Airflow also has a larger and more established community than Kubeflow according to multiple sources. It is described as being used by significantly more engineers and companies, with more GitHub forks and stars than Kubeflow at the time of writing.

Airflow is also more broadly applicable. It can orchestrate:

ETL processes
Data pipelines
Automated report generation
ML training jobs
Notifications
System checks
API actions

The trade-off is that Airflow requires additional ML tooling for artifact tracking, experiment tracking, caching, and model-specific workflow patterns.

Team-fit comparison

Team Profile	Better Fit	Why
Data engineering team adding ML retraining	Airflow	Mature scheduler, Python DAGs, broad integrations
Kubernetes-native ML platform team	Kubeflow	Native Kubernetes execution, ML artifacts, GPU scheduling
Team needing model serving as part of platform	Kubeflow	KFServing and multi-model serving capabilities
Team managing many non-ML workflows	Airflow	General-purpose orchestration and broad ecosystem
Team without Kubernetes experience	Airflow	Kubeflow requires Kubernetes
Team prioritizing ML reproducibility	Kubeflow	Built-in datasets, models, metrics, and caching

Decision Framework: Kubeflow, Airflow, or Both?

The most useful way to decide is not “Which tool is better?” but “Which operating model matches our pipeline?”

Choose Kubeflow when ML lifecycle depth matters most

Kubeflow is the stronger fit when your pipeline is primarily about machine learning lifecycle management.

Choose Kubeflow when:

Kubernetes is already standard: Your infrastructure team already operates Kubernetes clusters.
You need ML-native artifacts: Datasets, models, and metrics should be tracked as first-class pipeline outputs.
You need step isolation: Each pipeline component needs its own container and dependencies.
You use GPUs: Native GPU scheduling matters for training workloads.
You need model serving: KFServing and multi-model serving are relevant to your platform.
You need hyperparameter tuning: Katib is part of the Kubeflow ecosystem.
You want notebook integration: Jupyter notebooks, RStudio, or VSCode from the dashboard are useful to your workflow.

Kubeflow is not the lightest option. It is best for teams prepared to manage Kubernetes-based ML infrastructure.

Choose Airflow when orchestration breadth matters most

Airflow is the stronger fit when your organization needs a mature scheduler and workflow platform across many domains.

Choose Airflow when:

You already use Airflow: Adding ML DAGs to an existing orchestration environment may be simpler than adopting a new platform.
Your workflows include ETL and ML: Airflow handles data engineering, reporting, API actions, notifications, and ML tasks.
Scheduling is central: Regular retraining, dependency timing, retries, and alerts are primary needs.
Your team is Python-heavy: Airflow DAGs are approachable for Python users.
You rely on integrations: Airflow has extensive providers for AWS, GCP, Azure, Databricks, and third-party platforms.
You do not require Kubernetes: Airflow can run without Kubernetes, though it can use Kubernetes when needed.

Airflow is not ML-native. If you choose it for production ML, plan for external artifact tracking, experiment management, model registry, and caching where needed.

Use both when data orchestration and ML platform needs are separate

In some organizations, the best architecture is not Kubeflow or Airflow—it is Kubeflow and Airflow.

A practical split is:

Responsibility	Tool Fit
Upstream ETL and data availability	Airflow
Scheduled retraining trigger	Airflow
ML training pipeline with artifacts	Kubeflow
GPU-backed training steps	Kubeflow
Model metrics and artifacts	Kubeflow
Cross-system workflow dependencies	Airflow
Model serving on Kubernetes	Kubeflow

This pattern works when Airflow is already the enterprise scheduler and Kubeflow is the ML execution platform. Airflow can coordinate when a pipeline should run, while Kubeflow handles ML-native execution, artifacts, and model lifecycle concerns.

Featured-snippet decision table

If Your Priority Is...	Recommended Direction
Mature scheduling and broad integrations	Airflow
End-to-end ML workflow management	Kubeflow
Kubernetes-native ML workloads	Kubeflow
General ETL plus occasional ML	Airflow
Built-in ML artifact tracking	Kubeflow
Existing Python DAG-based workflows	Airflow
Native model serving components	Kubeflow
Avoiding Kubernetes as a requirement	Airflow
Combining enterprise scheduling with ML-native execution	Both

Bottom Line

For Kubeflow vs Airflow ML pipelines, the decision comes down to whether your main problem is ML lifecycle management or general workflow orchestration.

Kubeflow is better aligned with Kubernetes-native ML teams that need per-step container isolation, GPU scheduling, typed artifacts, built-in caching, datasets, models, metrics, TensorBoard integration, Katib, notebooks, and serving components such as KFServing. Its trade-off is higher setup and maintenance complexity because it requires Kubernetes and more specialized operational skills.

Airflow is better aligned with data engineering teams that need mature scheduling, dependency management, monitoring, logs, retries, notifications, and broad integrations across cloud and third-party platforms. Its trade-off is that ML-specific capabilities such as artifact tracking, experiment tracking, and pipeline-step caching require external tools or custom implementation.

For many production organizations, the cleanest answer is to use Airflow for broad orchestration and Kubeflow for ML-native execution when Kubernetes-based model training and artifact tracking become important.

FAQ

Is Kubeflow based on Airflow?

No. Kubeflow is not based on Airflow. The source data describes Kubeflow as built on Kubernetes for machine learning workflows, while Airflow is a general-purpose workflow orchestration tool.

What is the main difference between Apache Airflow and Kubeflow Pipelines?

Apache Airflow is a general workflow orchestration platform used to author, schedule, and monitor tasks across many domains. Kubeflow Pipelines is designed specifically for end-to-end machine learning workflows, including model training, tuning, artifacts, and ML-specific pipeline components.

Does Airflow support machine learning pipelines?

Yes. Airflow can orchestrate ML pipelines and is widely used for workflows that include validation, feature computation, training, evaluation, and registration tasks. However, it does not provide native ML artifact tracking or built-in ML pipeline-step caching in the cited source data.

Does Kubeflow require Kubernetes?

Yes. Kubeflow is designed to run primarily on Kubernetes. Its pipeline stages run as Kubernetes workloads, and its strengths—such as scalability, container isolation, GPU scheduling, and serving—depend on Kubernetes infrastructure.

Which is easier to adopt: Kubeflow or Airflow?

Airflow is generally easier for teams already familiar with Python and DAG-based workflow orchestration. Kubeflow has a steeper learning curve because it is ML-specific and requires Kubernetes resources and operational knowledge.

Can Kubeflow and Airflow be used together?

Yes. A common pattern is to use Airflow for enterprise scheduling, upstream ETL, alerts, and cross-system dependencies, while using Kubeflow for ML-native pipeline execution, artifacts, metrics, GPU-backed training, and Kubernetes-based model serving.