Egnyte is a service that offers secure file sharing. Tetra Data Platform has a built-in integration that allows users to pull raw data files from their secure Egnyte storage and into the Tetra Data Lake.
TetraScience leverages Egnyte's API to constantly detect file change events in your Egnyte account and upload the files into our Data Lake and then trigger Data Pipelines.
Our Egnyte Connector currently tracks file creation events, including different versions of the same file. If you remove your file from Egnyte, TetraScience Data Lake will not mirror that and will not delete the files we collected.
Egnyte integration will track (listen) to three types of events in your Egnyte account:
- File has been uploaded (create event)
- File has been changed (update event)
- File has been copied from another Egnyte location (copy event)
The integration will detect the file creation in Egnyte every minute.
You can only have one Egnyte integration created by one Egnyte user in your organization. This is due to the rate limit of the Egnyte API and TetraScience is working on improvements to address this limitation.
For each environment, there should be one Egnyte application in Egnyte. One Egnyte application cannot be shared between different environments.
First, create a dedicated API user for this integration. For production usage, the best practice is to create an API user (standard user) dedicated for this integration. Recommend you name it: [email protected].
After the user is created, share the Egnyte folder that you would like the integration to track with the API user with viewer permission.
Organize your Egnyte folder
It is always a good idea to leverage the folder structure to organize your data, the best practice is to include your study number, project name/id, instrument name/id and etc in the folder path. For example:
If you are organizing data from your CRO, you can consider something like the following:
Updated about 2 months ago