Alibaba Cloud PolarDB-X Monitoring Integration
Site24x7 offers comprehensive monitoring for Alibaba Cloud PolarDB-X, enabling deep visibility into compute resources, query performance, and distributed transaction behavior. With insights into resource consumption, replication delays, and active session counts, you can identify bottlenecks, optimize query execution, and maintain cluster stability across compute nodes (CN), data nodes (DN), and Global Metadata Services (GMS). When you integrate your Alibaba Cloud account with Site24x7, all PolarDB-X instances are automatically discovered and monitored.
Use cases
- Query and transaction analysis: Monitor QPS, TPS, and slow query trends to evaluate workload distribution.
- Resource optimization: Track CPU, memory, and IOPS utilization across CN, DN, and GMS nodes to prevent overuse.
- Connection visibility: Monitor active connections and sessions to identify saturation points.
- Replication assurance: Detect slave lag and CDC delays to ensure real-time data synchronization.
Setup and configuration
- Log in to your Site24x7 account and navigate to Cloud > Alibaba Cloud > Add Monitor.
- In the Edit Alibaba Cloud Monitor page, select PolarDB-X from the Service Types list.
- Once added, go to Cloud > Alibaba > PolarDB-X to view dashboards and performance metrics.
Supported metrics
Compute & Query Performance
| Metric name | Description | Unit |
|---|---|---|
| QPS | The number of queries executed per second. | Count/second |
| TPS | The number of transactions processed per second. | Count/second |
| Logical QPS | The logical query rate across compute nodes. | Count/second |
| Logical TPS (CN Node) | The logical transaction rate for each compute node. | Count/second |
| Logical Request Count (CN) | The total number of logical requests processed by compute nodes. | Count |
| Logical Request Count (CN Node) | The total number of logical requests per compute node. | Count |
| Logical Slow Queries (CN) | The total number of slow queries detected across compute nodes. | Count |
| Logical Slow Queries (CN Node) | The number of slow queries per compute node. | Count |
| Logical Response Time | The average response time of logical queries across compute nodes. | Milliseconds |
| Logical Response Time (CN Node) | The average logical query response time per compute node. | Milliseconds |
| Physical QPS | The physical query rate across data nodes. | Count/second |
| Physical Response Time | The average response time of physical queries. | Milliseconds |
| Slow Queries | The total number of slow queries in the cluster. | Count |
Resource Utilization
| Metric name | Description | Unit |
|---|---|---|
| CPU Utilization | The percentage of CPU utilization across all nodes. | Percentage |
| CPU Usage (CN) | The CPU usage of compute nodes. | Percentage |
| CPU Usage (CN Node) | The CPU usage per compute node. | Percentage |
| CPU Usage (DN) | The CPU usage of data nodes. | Percentage |
| Memory Usage (CN) | The memory usage of compute nodes. | Percentage |
| Memory Usage (CN Node) | The memory usage per compute node. | Percentage |
| Memory Usage (DN) | The memory usage of data nodes. | Percentage |
| Memory Usage (GMS) | The memory usage of Global Metadata Service nodes. | Percentage |
| Disk Usage (DN) | The disk utilization of data nodes. | Percentage |
| Disk Usage (GMS) | The disk utilization of Global Metadata Service nodes. | Percentage |
| IOPS Usage (DN) | The number of I/O operations per second on data nodes. | Count/second |
| IOPS Usage (GMS) | The number of I/O operations per second on GMS nodes. | Count/second |
Connections & Sessions
| Metric name | Description | Unit |
|---|---|---|
| Connection Count | The total number of active connections in the cluster. | Count |
| Active Connections (CN) | The number of active connections in compute nodes. | Count |
| Active Connections (CN Node) | The number of active connections per compute node. | Count |
| Active Sessions (GMS) | The number of active sessions on GMS nodes. | Count |
| Active Sessions (DN) | The number of active sessions on data nodes. | Count |
| Connection Usage | The overall percentage of connection utilization. | Percentage |
| Connection Usage (GMS) | The connection usage percentage on GMS nodes. | Percentage |
| Connection Usage (DN) | The connection usage percentage on data nodes. | Percentage |
Replication & CDC
| Metric name | Description | Unit |
|---|---|---|
| Slave Lag | The replication lag between the primary and standby instances. | Milliseconds |
| GMS Slave Lag | The replication lag observed in GMS nodes. | Milliseconds |
| DN Slave Lag | The replication lag observed in data nodes. | Milliseconds |
| CDC Dumper TPS | The number of transactions processed by the CDC dumper per second. | Count/second |
| CDC Dumper Delay | The time delay in the CDC dumper process. | Milliseconds |
| CDC Dumper CPU Usage | The CPU usage of the CDC dumper process. | Percentage |
| CDC Dumper BPS | The data transfer rate (bytes per second) of the CDC dumper. | Bytes/second |
InnoDB & Transaction
| Metric name | Description | Unit |
|---|---|---|
| InnoDB Log Writes | The number of log writes to the InnoDB storage engine. | Count |
| InnoDB Log Write Requests | The total number of log write requests issued to InnoDB. | Count |
| InnoDB Row Inserts | The number of rows inserted in InnoDB tables. | Count |
| InnoDB Row Updates | The number of rows updated in InnoDB tables. | Count |
| InnoDB Row Deletes | The number of rows deleted from InnoDB tables. | Count |
| InnoDB Rows Read | The number of rows read from InnoDB tables. | Count |
Threshold configuration
- Go to Admin > Configuration Profiles > Threshold and Availability.
- Create or edit a threshold profile for PolarDB-X.
- Assign the profile to the respective monitors to trigger alerts.
IT automation
Site24x7's IT Automation tools help with automatically resolving performance degradation issues. When a breach occurs, the alarm engine continuously examines the system events for which thresholds have been defined and performs the mapped automation.
- Go to Admin > IT Automation Templates.
- Create a new automation rule.
- Map the rule to the monitor for proactive resolution.
How to configure IT Automation for a monitor
Configuration rules
With Site24x7's Configuration Rules, you can set parameters like Threshold Profile, Notification Profile, Tags, and Monitor Group for multiple monitors and automate the configuration settings of your monitoring resources. Automatically assign these settings when new PolarDB-X monitors are added.
How to add a Configuration Rule
