File Compression of Tetra Data Lake RAW Files

Some of the RAW files that are stored in the Tetra Data Lake have been compressed by using GZip. Due to compressions, the file sizes for RAW files are different before and after a file uploads to the Data Lake.

Compressed files result from the following sources:

  • Manual file uploads using the TDP User Interface
  • These Tetra Agent uploads using the GDC/"No Connector" agents:
    • Tetra Chromeleon Agent
    • Tetra Empower Agent
    • Tetra File-Log Agent
    • Tetra LabX Agent
    • Tetra UNICORN Agent
  • These Tetra Agent uploads using the Direct S3 Upload feature:
    • Tetra Chromeleon Agent
    • Tetra Empower Agent
    • Tetra LabX Agent
    • Tetra UNICORN Agent
  • These data sources:
    • Egnyte
    • Box
    • HRB Cellario
    • SDC
  • Data Pipelines (Windows, Python, and NodeJS task scripts)