سجل التاريخ

~0.0 ms

أبريل 2025

مايو 2025

يونيو 2025

~0.0 ms

أبريل 2025

مايو 2025

يونيو 2025

~0.0 ms

أبريل 2025

مايو 2025

يونيو 2025

~0.0 ms

أبريل 2025

مايو 2025

يونيو 2025

~0.0 ms

أبريل 2025

مايو 2025

يونيو 2025

~0.0 ms

أبريل 2025

مايو 2025

يونيو 2025

~0.0 ms

أبريل 2025

مايو 2025

يونيو 2025

يونيو 2025

تحقيق
06 28 2025 في 11:26
تحقيق
06 28 2025 في 11:26
Our team is aware of and looking into some issues causing bot approvals to fail as well as various other aspects of our service such as updating assets (banner etc) and more.

تم الحل
06 21 2025 في 05:32
تم الحل
06 21 2025 في 05:32
Production is back up. This incident was automatically resolved by Instatus monitoring.
تحقيق
06 21 2025 في 05:22
تحقيق
06 21 2025 في 05:22
Production is down at the moment. This incident was automatically created by Instatus monitoring.

بعد الموت
06 21 2025 في 09:38
بعد الموت
06 21 2025 في 09:38
📝 Postmortem: June 2025 Service Outage
Incident Duration:
June 16, 2025 – June 21, 2025
Status: Resolved
Root Cause: Misconfigured infrastructure and networking components
📌 Summary
Between June 16 and June 21, 2025, our services experienced a prolonged and critical disruption. This impacted system accessibility, network stability, and overall deployment reliability. The root causes were traced back to multiple misconfigurations within our new infrastructure stack, primarily involving our Dokploy instance, networking setup, and reverse proxy (Traefik).
⚙️ Technical Cause
Upon investigation, we identified several compounding issues:
- Misconfigured Dokploy Instance: The initial deployment lacked critical network isolation and routing configurations, leading to service timeouts and container miscommunication.
- Traefik Reverse Proxy: Misconfigured routing and TLS handling caused failed ingress connections and prevented external traffic from reaching internal services.
- Networking Setup Errors: Overlapping subnets and improperly bridged networks led to intermittent connectivity between deployer and host machines, further destabilizing the system.
- Missing Health Checks: Some containers were not being monitored properly, which delayed automatic restarts and extended service downtime.
🚑 Immediate Actions Taken
- Isolated deployer and host networks to stabilize inter-service traffic.
- Corrected routing rules and middleware configuration in Traefik.
- Rebuilt the Dokploy configuration with clearer network separation and improved error handling.
- Re-enabled and audited health checks across services.
- Conducted live testing and verification to ensure full service restoration by June 21, 2025.
✅ Resolution and Recovery
The system was gradually stabilized beginning on June 16, with partial access restored within 30 minutes of our first major fix. However, additional network-level issues prolonged the resolution timeline. By June 21 at 3:35 AM, all services were fully restored and verified functional.
📚 Lessons Learned
- Configuration reviews must be enforced before production deployment of new infrastructure tools.
- Network planning (IP ranges, bridges, proxies) needs to be documented and peer-reviewed.
- Critical systems (like DNS routing, ingress, and orchestration layers) must have dedicated monitoring and rollback plans.
🔧 Preventative Measures
- Implement automated preflight checks in our deployment pipelines.
- Schedule recurring audits of proxy and ingress configurations.
- Build fallback container orchestration playbooks for Dokploy-based deployments.
- Expand post-deploy smoke testing to catch network-level regressions earlier.
🗣️ Final Note
We sincerely apologize for the extended downtime and the impact it had on your experience. While our intention was to modernize our infrastructure, we recognize that our transition planning and oversight fell short. This will be addressed internally, and improvements are already underway.
Thank you for your patience and continued support.
تم الحل
06 21 2025 في 09:35
تم الحل
06 21 2025 في 09:35
This incident has been resolved.
تحديث
06 17 2025 في 22:51
تحديث
06 17 2025 في 22:51
We apologize that these issues have been ongoing for so long, our team is still working to resolve the issues! And properly configure our new dokploy network!
تحديث
06 17 2025 في 01:29
تحديث
06 17 2025 في 01:29
Some additional issues have arised and we are working on a fix
تحديث
06 17 2025 في 00:32
تحديث
06 17 2025 في 00:32
We have isolated networks between our deployer and host machine to help stable out its long term usage, our team is currently finishing up with the final setup stages. Services should start being restored within the next 20 - 30 mins.
محدد
06 16 2025 في 21:46
محدد
06 16 2025 في 21:46
We have identified this issue is due to Dokploy. We are working resolving this issue now.

تم الحل
06 21 2025 في 04:37
تم الحل
06 21 2025 في 04:37
Production is back up. This incident was automatically resolved by Instatus monitoring.
تحقيق
06 16 2025 في 21:34
تحقيق
06 16 2025 في 21:34
Production is down at the moment. This incident was automatically created by Instatus monitoring.

تم الحل
06 12 2025 في 21:05
تم الحل
06 12 2025 في 21:05
Production is back up. This incident was automatically resolved by Instatus monitoring.
تحقيق
06 12 2025 في 20:46
تحقيق
06 12 2025 في 20:46
Production is down at the moment. This incident was automatically created by Instatus monitoring.

مايو 2025

تم الحل
05 31 2025 في 00:22
تم الحل
05 31 2025 في 00:22
Production is back up. This incident was automatically resolved by Instatus monitoring.
تحقيق
05 29 2025 في 08:58
تحقيق
05 29 2025 في 08:58
Production is down at the moment. This incident was automatically created by Instatus monitoring.

تم الحل
05 29 2025 في 06:29
تم الحل
05 29 2025 في 06:29
Production is back up. This incident was automatically resolved by Instatus monitoring.
تحقيق
05 29 2025 في 05:52
تحقيق
05 29 2025 في 05:52
Production is down at the moment. This incident was automatically created by Instatus monitoring.

DNS Configuration

مكتمل
05 31 2025 في 20:51
مكتمل
05 31 2025 في 20:51
Our team is still working hard to restore our services, at this time 90% of services have been restored and we are actively working to resolve the remaining issues.
قيد التقدم
05 29 2025 في 09:44
قيد التقدم
05 29 2025 في 09:44
We are still working on this maintenance and will be extending its end date!
مكتمل
05 29 2025 في 08:47
مكتمل
05 29 2025 في 08:47
اكتملت الصيانة بنجاح
مخطط
05 29 2025 في 05:47
مخطط
05 29 2025 في 05:47
Our team will be looking into the dns capabilities of our new deployment service, our services may drop randomly during this time. Please bare with us while we resolve it!
قيد التقدم
05 29 2025 في 05:47
قيد التقدم
05 29 2025 في 05:47
Maintenance is now in progress.

تم الحل
05 28 2025 في 22:29
تم الحل
05 28 2025 في 22:29
Production is back up. This incident was automatically resolved by Instatus monitoring.
تحقيق
05 28 2025 في 22:09
تحقيق
05 28 2025 في 22:09
Production is down at the moment. This incident was automatically created by Instatus monitoring.

تم الحل
06 01 2025 في 01:54
تم الحل
06 01 2025 في 01:54
This incident has been resolved.
تحقيق
05 28 2025 في 20:58
تحقيق
05 28 2025 في 20:58
We are aware that our widget server is still offline and are working to get it restored as soon as possible, our team overlooked this on the initial restart of the server and we apologize for that!

أبريل 2025

لم يتم الإبلاغ عن أي إشعارات هذا الشهر

أبريل 2025 ألى يونيو 2025

Infinity Bots - سجل التاريخ

أداء متدهور جزئيًا

سجل التاريخ

يونيو 2025

مايو 2025

أبريل 2025