Data Warehouse and Relational Database
TetraScience provides features and services to pipeline your data into Data Warehouse and SQL Database solutions. The most commonly used are:
- Amazon Redshift
- Amazon Relational Database Service (Amazon RDS)
- Amazon Aurora
Optional Features and ServicesThese features and services are optional and are not provided out-of-the-box. To use these features and services, please contact your Account Manager and Solution Architect.
Amazon Redshift | Amazon Aurora RDS | AWS MySQL/PostgreSQL RDS | |
---|---|---|---|
Website | |||
Use Cases | In contrast to an OLTP (Online Transactional Processing) database, an OLAP (Online Analytic Processing) database is designed to process large datasets quickly to answer questions about data. Common use cases for an OLAP database are:
Other popular OLAP databases include:
| OLTP (Online Transactional Processing) is the most commonly thought of database. OLTP is designed to serve as a persistent state store for front-end applications. They quickly look up specific information as well as transactional procedures, such as INSERT, UPDATE, or DELETE. Some common tasks performed by OLTP systems include:
| Similar to Amazon Aurora |
Cost | High | Medium | Low |
Storage Limit | PetaByte Scale | 16 TB | 16 TB |
Concurrent Connections | 50 | 1000s | 100s |
TetraScience recommends that you begin with the less complicated and most cost-effective solution, for example Amazon RDS MySQL and RDS PostgreSQL. This enables you to use most of the available common tools and frameworks to quickly get started.
After you become more familiar and confident of your business needs and the types of operations you want to perform, you can then transition to Amazon RedShift or Amazon Aurora. At that point, you are more confident with the type of queries and aggregations you want to perform, and can better optimize your use cases by designing the correct table in Amazon Redshift.
How to Move Data from TetraScience Data Lake to Data Warehouse and Relational Database
The TetraScience delivery team will provide AWS Glue ETL pipelines to move the latest data from Amazon Simple Storage Service (Amazon S3) to your data warehouse. We will handle edge cases, such as when the data is re-processed (for example, scientists may recalculate the peaks on the chromatogram) and your data warehouse must contain the latest changes.
AWS Glue ETL PipelinesUse of the AWS Glue ETL pipelines will incur an extra cost based on workload and traffic.
AWS Oracle RDS
To use Oracle databases, TetraScience recommends that you consider Amazon RDS for Oracle, and you review the information in this table:
Question | Answer |
---|---|
Does Amazon RDS for Oracle provide licenses? |
|
Can the data be encrypted using Oracle's Transparent Data Encryption? | Yes, this is compatible with Amazon RDS for Oracle. |
Does Amazon RDS for Oracle support Spring Data JPA? | Yes, this is compatible with Amazon RDS for Oracle. Reference: |
Documentation Feedback
Do you have questions about our documentation or suggestions for how we can improve it? Start a discussion in TetraConnect Hub. For access, see Access the TetraConnect Hub.
NOTEFeedback isn't part of the official TetraScience product documentation. TetraScience doesn't warrant or make any guarantees about the feedback provided, including its accuracy, relevance, or reliability. All feedback is subject to the terms set forth in the TetraConnect Hub Community Guidelines.
Updated 18 days ago