Fly.io Status

All Systems Operational

About This Site

This page is for updates about global incidents. It does not include updates about routine hardware failures or isolated infrastructure events that have limited impact. For a personalized view of all events that might affect your apps, please check the personalized status page in your Fly Organization's dashboard. For all internal incidents and other activities, please check Infra Log.

Uptime over the past 90 days. View historical uptime.

Customer Applications Operational

Dashboard Operational

Machines API Operational

Regional Availability Operational

AMS - Amsterdam, Netherlands Operational

ARN - Stockholm, Sweden Operational

ATL - Atlanta, Georgia (US) Operational

BOG - Bogotá, Colombia Operational

BOM - Mumbai, India Operational

CDG - Paris, France Operational

DEN - Denver, Colorado (US) Operational

DFW - Dallas, Texas (US) Operational

EWR - Secaucus, NJ (US) Operational

EZE - Ezeiza, Argentina Operational

FRA - Frankfurt, Germany Operational

GDL - Guadalajara, Mexico Operational

GIG - Rio de Janeiro, Brazil Operational

GRU - Sao Paulo, Brazil Operational

HKG - Hong Kong Operational

IAD - Ashburn, Virginia (US) Operational

JNB - Johannesburg, South Africa Operational

LAX - Los Angeles, California (US) Operational

LHR - London, United Kingdom Operational

MAD - Madrid, Spain Operational

MEL - Melbourne, Australia Operational

MIA - Miami, Florida (US) Operational

NRT - Tokyo, Japan Operational

ORD - Chicago, Illinois (US) Operational

OTP - Bucharest, Romania Operational

PHX - Phoenix, Arizona (US) Operational

QRO - Querétaro, Mexico Operational

SCL - Santiago, Chile Operational

SEA - Seattle, Washington (US) Operational

SIN - Singapore Operational

SJC - San Jose, California (US) Operational

SYD - Sydney, Australia Operational

WAW - Warsaw, Poland Operational

YUL - Montréal, Canada Operational

YYZ - Toronto, Canada Operational

Persistent Storage (Volumes) Operational

Deployments Operational

Remote Builds Operational

Logs Operational

Metrics Operational

SSL/TLS Certificate Provisioning Operational

UDP Anycast Operational

Fly Machine Image Registry 1 Operational

Fly Machine Image Registry 2 Operational

Extensions Operational

Upstash for Redis Operational

DNS Operational

Fly Machine .internal DNS Operational

Fly Machine External DNS Operational

*.fly.dev Nameservers Operational

*.flyio.net Nameservers Operational

Billing Operational

Usage Metrics API Operational

Stripe API Connection Operational

Corrosion Operational

Managed Postgres Operational

90 days ago

99.99 % uptime

Today

Management Plane - ORD Operational

90 days ago

100.0 % uptime

Today

Management Plane - IAD Operational

90 days ago

99.95 % uptime

Today

Management Plane - FRA Operational

90 days ago

99.99 % uptime

Today

Management Plane - GRU Operational

90 days ago

100.0 % uptime

Today

Management Plane - LAX Operational

90 days ago

100.0 % uptime

Today

Management Plane - SYD Operational

90 days ago

100.0 % uptime

Today

Phoenix.new Operational

Operational

Degraded Performance

Partial Outage

Major Outage

Maintenance

Past Incidents

Jul 29, 2025

Fly.io APIs and dashboard inaccessible

Resolved - This incident has been resolved.
Jul 29, 18:21 UTC

Monitoring - A fix has been implemented and we are monitoring the results.
Jul 29, 17:50 UTC

Investigating - Trying to log into Fly.io dashboard, most `flyctl` commands and API requests are returning HTTP 503 errors. We are investigating.
Jul 29, 17:39 UTC

Jul 28, 2025

No incidents reported.

Jul 27, 2025

No incidents reported.

Jul 26, 2025

No incidents reported.

Jul 25, 2025

No incidents reported.

Jul 24, 2025

AMS network issues

Resolved - This incident has been resolved.
Jul 24, 15:58 UTC

Monitoring - A fix has been implemented on all hosts in AMS and we are seeing networking return to normal in the region. We are continuing to monitor networking in the region to ensure stability.
Jul 24, 15:35 UTC

Identified - The issue has been identified and an initial fix has been deployed. We are seeing improvements on most hosts in the region, and are continuing to work on restoring the remaining impacted hosts.
Apps with machines on impacted hosts may still experience network issues at this time.
Jul 24, 15:13 UTC

Investigating - We are investigating upstream network issues in AMS region. Apps may experience network issues.
Jul 24, 14:45 UTC

Jul 23, 2025

GraphQL API maintenance

Completed - The scheduled maintenance has been completed.
Jul 23, 23:00 UTC

In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Jul 23, 22:00 UTC

Update - This activity has been moved 2 hours forward due to a conflict with an internal incident. Previous "in progress" message was posted due to an automation error on the status page.
Jul 23, 20:57 UTC

Scheduled - We will be undergoing scheduled maintenance during this time.
Jul 23, 20:56 UTC

In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Jul 23, 20:00 UTC

Scheduled - We will be performing a migration of the Redis instance backing Sidekiq jobs for our GraphQL API server. No user-facing downtime is expected, but mutations may return errors for a couple seconds as the switchover happens.
Jul 21, 16:03 UTC

Network maintenance in IAD (Ashburn, Virginia, USA)

Completed - The scheduled maintenance has been completed.
Jul 23, 14:00 UTC

In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Jul 23, 12:00 UTC

Scheduled - An upstream provider is performing network maintenance in IAD, from 2025-07-23 at 12:00 UTC (8:00am EDT local time) to 14:00 UTC (10:00am EDT local time). You may experience a short total loss of connectivity within the scheduled maintenance window hours.
Jul 17, 20:24 UTC

Jul 22, 2025

No incidents reported.

Jul 21, 2025

No incidents reported.

Jul 20, 2025

No incidents reported.

Jul 19, 2025

No incidents reported.

Jul 18, 2025

No incidents reported.

Jul 17, 2025

Elevated "App Not Found" errors

Resolved - This incident has been resolved.
Jul 17, 22:50 UTC

Monitoring - We have completed rolling out the fix to all apps impacted by this issue. We are seeing "App Not Found" errors decrease. We are continuing to monitor to ensure stability.
Jul 17, 21:28 UTC

Update - We are continuing to roll out the fix for this issue to impacted apps and some users are seeing operations return to normal. We will confirm once the roll out is fully complete.
Jul 17, 19:41 UTC

Identified - We are seeing elevated rates of "App not found" errors impacting some apps. Users may see these errors when interacting with impacted apps via the Machines API, Flyctl, or their dashboard.

We have identified the cause of these errors and are working on a fix.
Jul 17, 16:00 UTC

Elevated API issues

Resolved - All systems are now operating normally. We’ll continue to monitor closely, but no further impact is expected.
Jul 17, 14:29 UTC

Monitoring - The reseeding process is complete and API responses are back to normal. We continue to monitor the system to ensure everything remains healthy.
Jul 17, 12:28 UTC

Identified - We're making progress with reseeding our state store and are now working on speeding up the reseed process overall. Users may continue to see inconsistencies with our API.
Jul 17, 01:42 UTC

Update - We're continuing to investigate this issue.
Jul 17, 00:40 UTC

Update - We've started a rollout to our global state store and are noticing degraded performance for our API. Users may see inconsistent data related to their machines.
Jul 17, 00:09 UTC

Investigating - We're seeing a higher number of 500s when calling the Machines API or when trying to launch a machine.
Jul 17, 00:02 UTC

Jul 16, 2025

Network issues in SJC

Resolved - This incident has been resolved.
Jul 16, 17:41 UTC

Monitoring - A fix has been implemented and we are monitoring the results.
Jul 16, 17:23 UTC

Investigating - We are observing high packet loss in SJC region. Apps may experience network issues.
Jul 16, 16:57 UTC

Jul 15, 2025

Networking issues in SJC

Resolved - This incident has been resolved. Networking in SJC has remained stable over the past two hours.

This was caused by a dark fiber maintenance, which had a larger-than-anticipated impact.
As a result, some transport capacity was lost, leading to packet loss to and from our servers.
Jul 15, 12:47 UTC

Update - We are continuing to investigate this issue.
Jul 15, 08:23 UTC

Investigating - We are currently investigating networking issues in SJC. Customers may see higher latency and elevated packet loss connecting to machines in this region.
Jul 15, 08:01 UTC

All Systems Operational

About This Site

Related

Past Incidents