High Availability

The Tetra Data Platform (TDP) leverages the fault tolerance build in AWS services and was designed to closely follow AWS best practices for each of these platform components:

  • Data Lake Files - Stored in AWS S3 which has 99.999999999% durability and 99.99% availability per year and is designed to sustain data loss in two facilities.
  • RDS Database - Multi-AZ deployment where the data is permanently and synchronously replicated to a standby instance in another availability zone. In case of an infrastructure problem, the database will automatically fail over to a standby.
  • Elasticsearch - Configured by default with three master nodes, and two data nodes in two availability zones. An infrastructure failure in one availability zone will not impact the cluster.
  • ECS Services - All important platform services are running at least two instances (each in its own availability zone). As a result, an instance failure will not impact the overall platform.