ControlMonkey expands cloud configuration disaster recovery for improved resilience

ControlMonkey augments its platform to protect and restore observability configurations, safeguarding monitoring environments during outages.

ControlMonkey, an Infrastructure Governance and Resilience platform, has expanded its Cloud Configuration Disaster Recovery services to include observability and monitoring platforms. This development builds on its recent expansion into network configuration recovery and extends support for observability configurations across platforms such as Datadog, New Relic, Dynatrace, Grafana Cloud, and Splunk.

The platform can automatically capture daily snapshots of key observability configurations, including dashboards, alert rules, monitors, escalation policies, and service monitoring definitions. This enables teams to restore monitoring environments and maintain operational visibility during incidents and outages.

Observability platforms are often used by engineers during outages to diagnose issues and coordinate incident response. However, dashboards, alert policies, and monitoring rules are typically created manually and are not always included in disaster recovery plans. Configuration changes, accidental deletions, or overly permissive AI agents can result in reduced visibility during critical events.

ControlMonkey enables teams to restore observability configurations from versioned configuration snapshots, reducing manual rebuilding effort and helping maintain consistency and recoverability of monitoring environments during incidents.

Unlike traditional disaster recovery approaches that focus primarily on data restoration, ControlMonkey provides automated recovery for cloud infrastructure and configuration across infrastructure, network, and observability layers within the cloud control plane.

Additional capabilities include automated configuration recovery for dashboards, alert rules, monitors, and observability policies; continuous configuration monitoring to track changes and detect configuration drift across monitoring platforms; and a resilience score providing visibility into configuration coverage and recovery capability across infrastructure, network, and observability layers.

With this expansion, ControlMonkey aims to support organisations in maintaining recoverable observability configurations and operational continuity during outages.
SATLINE’s core infrastructure achieves Tier III alignment, with upgrades intended to improve...
Rebellions secures $400 million in pre-IPO funding, with plans to expand in the U.S. and scale its...
The new Gcore Radar report highlights the surge in DDoS attacks, driven by sophisticated techniques...
CoreWeave strikes a $21 billion deal to help strengthen Meta's AI capabilities with advanced cloud...
NTT Data has opened a data centre in Kyoto, Japan, adding capacity to digital infrastructure in the...
LSEG has partnered with Dell Technologies to develop a private cloud platform and optimise its...
Toshiba and Quantum Bridge Technologies have demonstrated an international network for quantum-safe...
A new reseller partnership between GNM and LINX aims to strengthen network interconnection options...