Superset Monitoring App EAP Release Notes

The following are release notes for the prerelease versions of the Superset Monitoring App.

v0.5.0

Release date: October 15 2024

What's New

🚧

IMPORTANT

The Superset Monitoring App is available through an early adopter program (EAP) currently and is activated for customers through coordination with TetraScience. For more information, or to activate the app in your environment, contact your customer success manager (CSM).

TetraScience has released its first version of the Superset Monitoring Embedded Data App, version 0.5.0, as part of an EAP. The Superset Monitoring App provides a new Health Monitoring dashboard in the Tetra Data Platform (TDP) user interface to help customers gain an end-to-end understanding of data downtime. The dashboard provides a set of metrics for ingestion failures and latency for on-premises Tetra File-Log Agents and Tetra Data Pipelines.

Customers can access the new Health Monitoring dashboard in the TDP after it's activated by doing the following:

  1. Open the legacy Health Monitoring page.
  2. Select the upper right New UI | WELCOME TO OUR NEW MONITORING EXPERIENCE toggle.

Here are the details for what's new in Superset Monitoring App v0.5.0.

Prerequisites

Superset Monitoring App v0.5.0 requires the following:

  • Tetra Data Platform (TDP) v4.1.1 or higher
  • Customers must contact their customer success manager (CSM) or account executive to activate the app and the new Health Monitoring Dashboard in the TDP UI

For Customer-Hosted TDP Environments Only

  • The TDP's Transport Layer Security (TLS) certificate must validate the following endpoint: *.data-apps.tdp-hostname.com
  • The Domain Name Server (DNS) zone for tdp-hostname.com must have a CNAME record routing *.data-apps.tdp-hostname.com to tdp-hostname.com

New Functionality

New functionality includes features not previously available in the Tetra Data Platform (TDP) or the Superset Monitoring App.

New Health Monitoring Dashboard UI

A single, interactive Health Monitoring dashboard provides performance trending analytics for Tetra File-Log Agents and Tetra Data Pipelines in the following areas.

Monitoring

The dashboard's Monitoring tab shows key performance indicators (KPIs), historical trends (charts), and the top 10 TDP entities where data downtime is highest (tables).

Monitoring tab

Monitoring Performance Metrics
Performance MetricMetric TypeDescription
Path Scan Failure PercentKPIPercent of scans failed from all started path scans
File Ingestion Failure PercentKPIPercent of files that failed to appear in File Search from all started path scans
Workflow Failure PercentKPIPercent of pipeline workflows that failed from all of the workflows that ran
Average Path Scan DurationKPIAverage time it took for all succeeded and failed path scans to complete
Average File Ingestion DurationKPIAverage time it took for files to be scanned and then become available through search in the TDP
Workflow Run DurationKPIAverage time it took for pipelines to run each successful workflow
Path Scan Failure Over TimeChartPercentage of path scans that failed from all path scans that started over a specific time range
File Ingestion Failure Over TimeChartPercentage of scanned files that failed to appear in search over a specific time range
Workflow Failure Over TimeChartPercentage of started pipeline workflows that failed over a specific time range
Path Scan Duration Over TimeChartAverage time it took for all succeeded and failed path scans to complete over a specific time range
File Ingestion Duration Over TimeChartAverage time it took for scanned files to appear in search over a specific time range
Workflow Run Duration Over TimeChartAverage time it took for pipelines to run each successful workflow over a specific time range
File Ingestion JourneyChartThe current number of files in each stage of the file ingestion journey
Top 10 Paths by Path Scan ErrorsTableThe 10 scan paths that had the highest File Scan error rate
Top 10 File Upload ErrorsTableThe 10 scan paths that had the highest File Upload error rate
Top 10 Pipelines by Workflow FailuresTableThe 10 pipelines that had the highest workflow failure rate
Top 10 Paths by Longest Path Scan DurationTableThe 10 scan paths that took the longest to complete each scan
Top 10 Files by Longest File Ingestion DurationTableThe 10 files that took the longest to become available through search in the TDP after being scanned
Top 10 Pipelines by Longest Workflow DurationTableThe 10 pipelines that had the longest average workflow runtime duration
Monitoring Filters

Customers can apply the following filters to metrics on the Monitoring tab:

  • Time range: Defines a specific time range for each metric
  • AgentID: Indicates a specific Tetra File-Log Agent by its ID
  • Path: Indicates a specific scan path
  • PipelineID: Indicates a specific Tetra Data Pipeline by its ID

Troubleshooting

The dashboard's Troubleshooting tab shows metrics as tables to help with troubleshooting data downtime issues.

Troubleshooting tab

Troubleshooting Metrics
Performance MetricMetric TypeDescription
Last Agent HeartbeatTableShows when the TDP received the last heartbeat signal from each agent
Path Status (Latest Scan)TableShows the scan time and status for the latest scan on each scan path
File StatusTableShows key events in the file ingestion journey for each ingested file
Workflow StatusTableShows the workflow status for each pipeline
Troubleshooting Filters

Customers can apply the following filters to metrics on the Troubleshooting tab:

  • Time range: Defines a specific time range for each metric
  • AgentID: Indicates a specific Tetra File-Log Agent by its ID
  • Path: Indicates a specific scan path
  • PipelineID: Indicates a specific Tetra Data Pipeline by its ID
  • FileID: Indicates a specific file by its ID

Limitations

The following are known limitations of Superset Monitoring App v0.5.0:

  • Metrics data isn't backfilled when the app is activated, so the available data spans one day in duration only when the new Health Monitoring dashboard first appears in customers' TDP environments.
  • If the available data spans one day in duration only (for example, when the new dashboard is first activated) then charts in the new Health Monitoring dashboard will appear empty. If customers hover their cursors over an empty chart, they will see a single data point, which is a summary statistic for that day. Once customers have two-days of data available, then the charts and tables will populate as normal.
  • The File Ingestion Journey chart on the Monitoring tab won't show any file search indexing events or other downstream file events if those events occur outside of the Time range filter defined by customers. This behavior occurs because all File Ingestion Journey events are a subset of the original files that were scanned during the specified time range.
  • There is a maximum number of table rows available for each of the following troubleshooting metrics:
    • Last Agent Heartbeat: 100 row maximum
    • File Status: 1,000 row maximum
    • Workflow Status: 1,000 row maximum
    • Path Status (Latest Scan): 1,000 row maximum

Upgrade Considerations

To activate the Superset Monitoring App in the Tetra Data and AI Workspace, please contact your customer success manager (CSM) or account executive.

To access the new Health Monitoring dashboard, open the legacy Health Monitoring page in the TDP. Then, select the upper right New UI | WELCOME TO OUR NEW MONITORING EXPERIENCE toggle.

For more information, see Superset Monitoring App in the TetraConnect Hub. To get access, see Access the TetraConnect Hub.

Other Release Notes

To view other release notes for Tetra Data Apps, see Tetra Data Apps Release Notes.