For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
      • AstroFully-managed data operations, powered by Apache Airflow.
      • Astro Private CloudRun Airflow-as-a-service in your environment.
      • Professional ServicesExpert Airflow services for your enterprise's success.
    • Tools
      • Cosmos
      • Orbiter
      • CLI
      • AI SDK
      • Agents
      • Blueprint
      • UpdatesThe State of Airflow 2026See the insights from over 5,800 data practitioners in the full report. Download Now ➔
  • Customers
  • Docs
    • Insights
      • Blog
      • Webinars
      • Resource Library
      • Events
    • Education
      • Academy
      • What is Airflow?
  • Pricing
Get Started Free
    • Astro Private Cloud overview
    • Astro Private Cloud features
      • Configure metrics
      • Configure liveness and readiness probes
      • Forward logs to Amazon S3
      • Platform and deployment alerts
      • Logs configuration
      • Export task logs
    • Release and lifecycle policy
    • Support policy

Product

  • Platform Overview
  • Astro
  • Astro Observe
  • Astro Private Cloud
  • Security & Trust
  • Pricing

Tools & Services

  • Cosmos
  • Docs
  • Professional Services
  • Product Updates

Use Cases

  • AI Ops
  • Data Observability
  • ETL/ELT
  • ML Ops
  • Operational Analytics
  • All Use Cases

Industries

  • Financial Services
  • Gaming
  • Retail
  • Manufacturing
  • Healthcare
  • All Industries

Resources

  • Academy
  • eBooks & Guides
  • Blog
  • Webinars
  • Events
  • The Data Flowcast Podcast
  • All Resources

Airflow

  • What is Airflow
  • Airflow on Astro
  • Airflow 3.0
  • Airflow Upgrades
  • Airflow Use Cases
  • Airflow 2.x End of Life

Company

  • Our Story
  • Customers
  • Newsroom
  • Careers
  • Contact

Support

  • Knowledge Base
  • Status
  • Contact Support
GitHubYouTubeLinkedInx
  • Legal
  • Privacy
  • Terms of Service
  • Consent Preferences

  • Do Not Sell or Share My Personal information
  • Limit the Use Of My Sensitive Personal Information

Apache Airflow®, Airflow, and the Airflow logo are trademarks of the Apache Software Foundation. Copyright © Astronomer 2026. All rights reserved.

LogoLogo
On this page
  • StatsD configuration (default)
  • Airflow configuration
  • Prometheus integration
  • Grafana dashboards
  • Alerting
  • Built-in alerts
  • Key metrics
  • Best practices
Platform Observability

Configure metrics

Edit this page
Built with

Astro Private Cloud (APC) provides multiple options for collecting and exporting Airflow metrics including StatsD, OpenTelemetry (OTEL), and Prometheus integration.

StatsD configuration (default)

StatsD resource limits are managed at the API level via componentsConfig and apply to all components — they can’t be configured independently per component.

1resources:
2 requests:
3 cpu: "100m"
4 memory: "384Mi"
5 limits:
6 cpu: "100m"
7 memory: "384Mi"

Airflow configuration

1[metrics]
2statsd_on = True
3statsd_host = localhost
4statsd_port = 8125
5statsd_prefix = airflow

Prometheus integration

1prometheus:
2 enabled: true
3 retention: 15d
4 persistence:
5 enabled: true
6 size: 100Gi

Grafana dashboards

Access Grafana at:

https://grafana.<platform-domain>

Pre-built dashboards include:

  • Airflow Dag performance
  • Task execution metrics
  • Scheduler health
  • Worker utilization

Alerting

1alertmanager:
2 enabled: true
3 config:
4 route:
5 receiver: 'platform'
6 receivers:
7 - name: 'platform'
8 webhook_configs:
9 - url: 'http://houston:8871/v1/alerts'

Built-in alerts

  • AirflowDeploymentUnhealthy
  • AirflowSchedulerUnhealthy
  • AirflowTasksPendingIncreasing

Key metrics

MetricDescription
airflow_dagrun_duration_secondsDag run duration
airflow_ti_successesSuccessful task instances
airflow_ti_failuresFailed task instances
airflow_scheduler_heartbeatScheduler health
airflow_executor_queued_tasksQueued task count

Best practices

  • Set appropriate retention based on storage capacity.
  • Use OTEL for multi-backend export.
  • Configure alerts for critical health metrics.
  • Monitor task queue depth for scaling needs.