All Systems Operational

About This Site

This page is for updates about global incidents. It does not include updates about routine hardware failures or isolated infrastructure events that have limited impact. For a personalized view of all events that might affect your apps, please check the personalized status page in your Fly Organization's dashboard. For all internal incidents and other activities, please check Infra Log.

Customer Applications Operational
Dashboard Operational
Machines API Operational
Regional Availability Operational
AMS - Amsterdam, Netherlands Operational
ARN - Stockholm, Sweden Operational
ATL - Atlanta, Georgia (US) Operational
BOG - Bogotá, Colombia Operational
BOM - Mumbai, India Operational
CDG - Paris, France Operational
DEN - Denver, Colorado (US) Operational
DFW - Dallas, Texas (US) Operational
EWR - Secaucus, NJ (US) Operational
EZE - Ezeiza, Argentina Operational
FRA - Frankfurt, Germany Operational
GDL - Guadalajara, Mexico Operational
GIG - Rio de Janeiro, Brazil Operational
GRU - Sao Paulo, Brazil Operational
HKG - Hong Kong Operational
IAD - Ashburn, Virginia (US) Operational
JNB - Johannesburg, South Africa Operational
LAX - Los Angeles, California (US) Operational
LHR - London, United Kingdom Operational
MAD - Madrid, Spain Operational
MEL - Melbourne, Australia Operational
MIA - Miami, Florida (US) Operational
NRT - Tokyo, Japan Operational
ORD - Chicago, Illinois (US) Operational
OTP - Bucharest, Romania Operational
PHX - Phoenix, Arizona (US) Operational
QRO - Querétaro, Mexico Operational
SCL - Santiago, Chile Operational
SEA - Seattle, Washington (US) Operational
SIN - Singapore Operational
SJC - San Jose, California (US) Operational
SYD - Sydney, Australia Operational
WAW - Warsaw, Poland Operational
YUL - Montréal, Canada Operational
YYZ - Toronto, Canada Operational
Persistent Storage (Volumes) ? Operational
Deployments ? Operational
Remote Builds Operational
Logs Operational
Metrics ? Operational
SSL/TLS Certificate Provisioning Operational
UDP Anycast ? Operational
Fly Machine Image Registry 1 Operational
Fly Machine Image Registry 2 Operational
Extensions Operational
Upstash for Redis Operational
DNS Operational
Fly Machine .internal DNS ? Operational
Fly Machine External DNS Operational
*.fly.dev Nameservers Operational
*.flyio.net Nameservers Operational
Billing Operational
Usage Metrics API Operational
Stripe API Connection Operational
Corrosion ? Operational
Managed Postgres Operational
90 days ago
99.99 % uptime
Today
Management Plane - ORD Operational
90 days ago
100.0 % uptime
Today
Management Plane - IAD Operational
90 days ago
99.95 % uptime
Today
Management Plane - FRA Operational
90 days ago
99.99 % uptime
Today
Management Plane - GRU Operational
90 days ago
100.0 % uptime
Today
Management Plane - LAX Operational
90 days ago
100.0 % uptime
Today
Management Plane - SYD Operational
90 days ago
100.0 % uptime
Today
Phoenix.new ? Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Jul 29, 2025
Resolved - This incident has been resolved.
Jul 29, 18:21 UTC
Monitoring - A fix has been implemented and we are monitoring the results.
Jul 29, 17:50 UTC
Investigating - Trying to log into Fly.io dashboard, most `flyctl` commands and API requests are returning HTTP 503 errors. We are investigating.
Jul 29, 17:39 UTC
Jul 28, 2025

No incidents reported.

Jul 27, 2025

No incidents reported.

Jul 26, 2025

No incidents reported.

Jul 25, 2025

No incidents reported.

Jul 24, 2025
Resolved - This incident has been resolved.
Jul 24, 15:58 UTC
Monitoring - A fix has been implemented on all hosts in AMS and we are seeing networking return to normal in the region. We are continuing to monitor networking in the region to ensure stability.
Jul 24, 15:35 UTC
Identified - The issue has been identified and an initial fix has been deployed. We are seeing improvements on most hosts in the region, and are continuing to work on restoring the remaining impacted hosts.
Apps with machines on impacted hosts may still experience network issues at this time.

Jul 24, 15:13 UTC
Investigating - We are investigating upstream network issues in AMS region. Apps may experience network issues.
Jul 24, 14:45 UTC
Jul 23, 2025
Completed - The scheduled maintenance has been completed.
Jul 23, 23:00 UTC
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Jul 23, 22:00 UTC
Update - This activity has been moved 2 hours forward due to a conflict with an internal incident. Previous "in progress" message was posted due to an automation error on the status page.
Jul 23, 20:57 UTC
Scheduled - We will be undergoing scheduled maintenance during this time.
Jul 23, 20:56 UTC
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Jul 23, 20:00 UTC
Scheduled - We will be performing a migration of the Redis instance backing Sidekiq jobs for our GraphQL API server. No user-facing downtime is expected, but mutations may return errors for a couple seconds as the switchover happens.
Jul 21, 16:03 UTC
Completed - The scheduled maintenance has been completed.
Jul 23, 14:00 UTC
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Jul 23, 12:00 UTC
Scheduled - An upstream provider is performing network maintenance in IAD, from 2025-07-23 at 12:00 UTC (8:00am EDT local time) to 14:00 UTC (10:00am EDT local time). You may experience a short total loss of connectivity within the scheduled maintenance window hours.
Jul 17, 20:24 UTC
Jul 22, 2025

No incidents reported.

Jul 21, 2025

No incidents reported.

Jul 20, 2025

No incidents reported.

Jul 19, 2025

No incidents reported.

Jul 18, 2025

No incidents reported.

Jul 17, 2025
Resolved - This incident has been resolved.
Jul 17, 22:50 UTC
Monitoring - We have completed rolling out the fix to all apps impacted by this issue. We are seeing "App Not Found" errors decrease. We are continuing to monitor to ensure stability.
Jul 17, 21:28 UTC
Update - We are continuing to roll out the fix for this issue to impacted apps and some users are seeing operations return to normal. We will confirm once the roll out is fully complete.
Jul 17, 19:41 UTC
Identified - We are seeing elevated rates of "App not found" errors impacting some apps. Users may see these errors when interacting with impacted apps via the Machines API, Flyctl, or their dashboard.

We have identified the cause of these errors and are working on a fix.

Jul 17, 16:00 UTC
Resolved - All systems are now operating normally. We’ll continue to monitor closely, but no further impact is expected.
Jul 17, 14:29 UTC
Monitoring - The reseeding process is complete and API responses are back to normal. We continue to monitor the system to ensure everything remains healthy.
Jul 17, 12:28 UTC
Identified - We're making progress with reseeding our state store and are now working on speeding up the reseed process overall. Users may continue to see inconsistencies with our API.
Jul 17, 01:42 UTC
Update - We're continuing to investigate this issue.
Jul 17, 00:40 UTC
Update - We've started a rollout to our global state store and are noticing degraded performance for our API. Users may see inconsistent data related to their machines.
Jul 17, 00:09 UTC
Investigating - We're seeing a higher number of 500s when calling the Machines API or when trying to launch a machine.
Jul 17, 00:02 UTC
Jul 16, 2025
Resolved - This incident has been resolved.
Jul 16, 17:41 UTC
Monitoring - A fix has been implemented and we are monitoring the results.
Jul 16, 17:23 UTC
Investigating - We are observing high packet loss in SJC region. Apps may experience network issues.
Jul 16, 16:57 UTC
Jul 15, 2025
Resolved - This incident has been resolved. Networking in SJC has remained stable over the past two hours.

This was caused by a dark fiber maintenance, which had a larger-than-anticipated impact.
As a result, some transport capacity was lost, leading to packet loss to and from our servers.

Jul 15, 12:47 UTC
Update - We are continuing to investigate this issue.
Jul 15, 08:23 UTC
Investigating - We are currently investigating networking issues in SJC. Customers may see higher latency and elevated packet loss connecting to machines in this region.
Jul 15, 08:01 UTC