Disaster Recovery (AWS DR)

Disaster Recovery/Fail-Over Architecture (AWS DR)

There are several ways you can take advantage of Active Geo-Replication when designing your service for business continuity and implement AWS DR. The choice depends on several factors, but the main purpose is to optimize for specific application patterns. Other factors include ease of failover management, service level agreement, and traffic latency and costs.

One option is Active-Passive compute with coupled failover

This option is best suited for applications with a single active deployment per geographic region. The application requires colocation of the compute tier and the data tier, this may be because of a chatty interface between the business logic and the database. It may also be critical to avoid additional traffic cost between the two tiers. For geo-redundancy, the database is geo-replicated to another region and the compute tier is deployed there also. Connections to the MS SQL database are pre-configured for each location.

Sample use cases
for Disaster Recovery

The following figure illustrates the service configuration before the failure. In this case all user traffic is directed to the active compute deployment in the primary location.

Designing AWS DR Solutions

The following figure illustrates the service configuration after a failure. As shown in the figure, the compute deployment and the back-end database remain co-located, but are now in the secondary location.

Designing Cloud Disaster Recovery Solutions 02

This strategy has the following advantages and trade-offs:

Advantages:

The SQL connection can be statically pre-configured in each region.

Trade-offs:

  • The entire service (all tiers) is treated as a single failure domain. This means that a failure of any part of the service results in the failover of the entire service.
  • The failover results in some data loss.

SLA calculation:

  • RPO varies depending on EC2 instance type and number of concurrently replicated databases and other factors (from 2 seconds to a few mins depending on number of tables, number of databases replicated per CLOUDBASIC server, replication mode /i.e. replication latency will increase if change tracking processes are serialized to avoid server overloading, in the cases where a smaller size CLOUDBASIC server is used to lower EC2 cost/)
    RTO = DNS record change + database state change + application verification test

RDS-R MS SQL Multi-AZ/AR

RDS Multi-AR for MS SQL Server Web+ w/ RedShift and Hadoop Support

Upgrade from Multi-AZ to Multi-AR (Cross-Region; Readable-Replicas) without increasing cost. Unlike Multi-AZ for SQL Server, RDS Multi-AR creates Cross-Region Readable-Replicas, which can be used for reporting, DR, data-locality and off-loading of your primary databases. Creating of standard RDS SQL Server .bak backups, with the option to be downloaded outside of AWS (unlike the RDS snapshots) or automatically uploaded to AWS Glacier, comes standard.


View product details

RDS-R SQL Multi-AZ/AR

RDS AlwaysOn/Geo-Replicate for MS SQL Server w/ RedShift & Hadoop Support

Create Multi-AZ (Multi Availability Zones) or Multi-AR (Multi Availability Regions) Read-Replicas (Web Edition support) for HA, DR, Data Locality and/or to off-load your primary RDS/EC2 or On-Prem SQL servers, even if you already operate RDS Multi-AZ SQL Enterprise. Feed data to RedShift & EMR/Hadoop.


View product details