For Tetra Data Platform (TDP) versions >= 2.22.0, the proxy setting configuration process is:
- Step 1 - Configure environment variables can be manual (because usually network administrators prepare environment variables for proxy as part of machine setup), or can be configured through an installation script.
- Step 2 - Configure Amazon SSM client proxy settings can only be configured through an installation script. Manual changes are necessary only when debugging or resolving possible proxy issues.
For TDP versions < 2.22.0, proxy configuration for Data Hub was manual.
When the Data Hub machine is behind a proxy, TetraScience must perform additional configuration to ensure that both installations, and the Data Hub, are working properly.
To set the proxy settings:
- Configure environment variables for the
rootaccount so that you can install the Data Hub successfully. You must do this before you install the Data Hub.
- Configure the proxy settings for the Amazon SSM client to work properly. You must perform this step after you install the Data Hub, but before you activate the Data Hub through the TDP UI.
The Amazon SSM client that drives the TDP Data Hub uses the
root account on a dedicated Linux machine. Therefore, TetraScience configures the shell environment variables to be persistent between sessions. This means that the environment variables are available all the time, such as when the Data Hub server is restarted, or when a new root session is created. As a result, TetraScience cannot configure environment variables temporarily whenever we create new session. Instead, we must configure it in the root's user profile. The file that needs to change depends on the Linux distribution used for the Data Hub.
- For Ubuntu, TetraScience edits the
- For RedHat based Linux distributions, TetraScience edits the
These lines are added to the previously mentioned files:
export http_proxy=http://hostname:port export https_proxy=http://hostname:port export no_proxy=169.254.169.254
After this step, TetraScience ensures that the settings are applied. We either:
- Restart the connection
- Restart the Data Hub server
- (Ubuntu) Invoke command
- (Red Hat)Invoke
source /root/.bash_profile(Red Hat)
TetraScience verifies if the proxy settings environment variables are configured by invoking the shell command
After we have verified proxy settings, we execute the Data Hub installation script.
The Data Hub installer script generates a log file. If the installation was not successful, then the installer provides a short error and path to the log file to review more detailed messages. Please review the log file and verify that the proxy settings were configured properly.
After the Data Hub installation script executes properly, and before we activate the Data Hub using the TDP UI, TetraScience must enable the proxy settings for Amazon SSM client.
To review the Amazon SSM client proxy configuration procedure, see their AWS documentation.
After the proxy is configured for Amazon SSM client, TetraScience can activate the Data Hub.
If the activation fails, then TetraScience returns to the Data Hub machine and reviews the Amazon SSM client log. The log is stored at:
/var/log/amazon/ssm/amazon-ssm-agent.log. TetraScience can execute a command such as
tail /var/log/amazon/ssm/amazon-ssm-agent.logand verify that no messages display, such as
unable to resolve address https://*.amazonservices.com, or
unable to connect to https://*.amazonservices.com. However, if these messages do display, then please check the Amazon SSM client proxy settings once more
Updated 7 months ago