Data Backup

The Tetra Data Platform (TDP) supports data backup for these platform components:

  • Data Lake Files - The S3 Data Lake bucket has versioning enabled. Delete is implemented as a soft delete, where the actual file content is never removed. TetraScience adds delete markers to the files. Additionally, if the TDP is accidentally uninstalled from either CloudFormation or ServiceCatalog, the buckets are not removed.

  • RDS Database

    • By default, custom backups are performed once per day, with a retention period of 30 days. You can use the deployment parameters to modify the frequency and retention. There is a default AWS limit of having a maximum of 100 snapshots per DB instance, however, you can increase it by contacting AWS support.
    • AWS Automated Backups are performed daily with a retention period of seven days.
  • Elasticsearch - Backed up four times per day in an S3 bucket. You can also recreate data in Elasticsearch from S3 by using an existing ECS re-processing task.