How to monitor GaussDB in Huawei Cloud
Site24x7 provides comprehensive monitoring support for Huawei Cloud GaussDB (GDB), focusing on query performance, health, and connection management.
This service empowers database administrators (DBAs) to maintain distributed MySQL databases within service-level agreements (SLAs) and effectively prevent performance degradation from impacting application layers.
Use cases
Catch regressions early: Monitoring queries per second (QPS), transactions per second (TPS), and slow query together reveals when a deployment schema change has introduced inefficient queries.
Replication lag control: Millisecond-precision monitoring of replication delays effectively mitigates the risk of stale reads from causing data consistency issues.
Query performance tuning: Analyzing slow query and row lock metrics guide index improvements before user face latency degrades.
Setup and configuration
GaussDB resources are auto-discovered and monitored during the Huawei Cloud integration. To enable monitoring, follow the steps below:
- Navigate to Cloud > Huawei > Add Huawei Monitor. Follow the steps to add a Huawei Cloud monitor.
- While adding or editing a Huawei Cloud monitor, select GaussDB from the Service/Resource Types drop-down and click Save.
- Navigate to Cloud > Huawei, select the created Huawei monitor, and then click GaussDB to view the performance metrics.
Supported metrics
CPU and memory
Metric name | Description | Units |
| CPU Usage | Overall CPU utilization consumed by the GaussDB instance. | Percentage |
| Memory Usage | Percentage of total memory capacity currently in use by the GaussDB instance. | Percentage |
Network
Metric name | Description | Units |
| Network Input Throughput | Volume of data received by the instance from clients per second. | Bytes/second |
| Network Output Throughput | Volume of data sent by the instance to clients per second. | Bytes/second |
| Error Rate of Received Packets | Rate of errors detected in packets received by the instance network interface. | Percentage |
| Loss Rate of Received Packets | Rate at which inbound packets are being dropped at the network interface. | Percentage |
| Error Rate of Sent Packets | Rate of errors detected in packets transmitted by the instance network interface. | Percentage |
| Loss Rate of Sent Packets | Rate at which outbound packets are being dropped at the network interface. | Percentage |
Connections
Metric name | Description | Units |
| Total Connections | Total number of client connections. | Count |
| Current Active Connections | Number of connections currently processing a request. | Count |
| Connection Usage | Percentage of maximum allowed connections in use. | Percentage |
Queries
Metric name | Description | Units |
| QPS | Number of SQL queries executed per second across all client connections. | Count/second |
| TPS | Number of transactions committed and rolled back per second. | Count/second |
| SELECT Statements per Second | Rate of SELECT query statements executed by clients per second. | Count/second |
| INSERT Statements per Second | Rate of INSERT query statements executed by clients per second. | Count/second |
| UPDATE Statements per Second | Rate of UPDATE query statements executed by clients per second. | Count/second |
| DELETE Statements per Second | Rate of DELETE query statements executed by clients per second. | Count/second |
| INSERT_SELECT Statements per Second | Rate of INSERT_SELECT query statement executed per second. | Count/second |
| REPLACE Statements per Second | Rate of REPLACE statements executed per second. | Count/second |
| REPLACE_SELECTION Statements per Second | Rate of REPLACE_SELECT query statements executed per second. | Count/second |
| INSERT and INSERT_SELECT per Second | Combined rate of INSERT and INSERT_SELECT query statements per second. | Count/second |
| COMMIT Statements per Second | Rate of explicit COMMIT query statements issued per second. | Count/second |
| ROLLBACK Statements per Second | Rate of explicit ROLLBACK query statements issued per second. | Count/second |
| Slow Query Logs | Number of queries whose execution time exceeded the configured slow query threshold. | Count/minute |
InnoDB
Metric name | Description | Units |
| Buffer Pool Usage | Percentage of the InnoDB buffer pool pages currently occupied by data. | Percentage |
| Buffer Pool Hit Ratio | Percentage of read requests served from the InnoDB buffer pool without requiring a disk read. | Percentage |
| Buffer Pool Dirty Block Ratio | Percentage of buffer pool pages that have been modified but not yet flushed to disk. | Percentage |
| InnoDB Read Throughput | Volume of data read by InnoDB from the buffer pool per second. | Bytes/second |
| InnoDB Write Throughput | Volume of data written by InnoDB to the buffer pool per second. | Count/second |
| InnoDB Log Write Request Frequency | Rate of write requests to the InnoDB redo log per second. | Count/second |
| InnoDB Log Writes | Number of physical writes performed to the InnoDB redo log per second. | Count/second |
| Row Delete Frequency | Number of rows deleted from InnoDB tables per second. | Count/second |
| Row Insert Frequency | Number of rows inserted into InnoDB tables per second. | Count/second |
| Row Read Frequency | Number of rows read from InnoDB tables per second. | Count/second |
| Row Update Frequency | Number of rows updated in InnoDB tables per second. | Count/second |
| InnoDB Storage Layer Read Requests per Second | Number of read requests issued to the underlying storage layer by InnoDB per second, indicating cache misses. | Count/second |
| InnoDB Read Requests per Second | Total number of logical read requests made to the InnoDB buffer pool per second. | Count/second |
| InnoDB Bufpool Read Ahead | Number of pages read into the buffer pool by the linear read-ahead mechanism. | Count |
| InnoDB Bufpool Read Ahead Evicted | Number of pages read ahead by InnoDB that were evicted from the buffer pool without being accessed. | Count |
| InnoDB Bufpool Read Ahead Rnd | Number of random read-ahead operations initiated by InnoDB. | Count |
| InnoDB Pages Read | Total number of pages read from the InnoDB buffer pool per second. | Count |
| InnoDB Pages Written | Total number of pages written to the InnoDB buffer pool. | Count |
| Total Bytes of Buffer Pool | Total size in bytes of data currently held in the InnoDB buffer pool. | Bytes |
| Row Lock Time | Cumulative time in milliseconds spent waiting for InnoDB row-level locks. | Millisecond |
| Row Lock Waits | Number of times InnoDB had to wait for a row lock before proceeding with a transaction. | Count/minute |
Disk and I/O
Metric name | Description | Units |
| Used Storage Space | Total amount of disk storage currently consumed by database data files. | GB |
| Disk Usage | Total allocated storage currently in use by the instance. | Percentage |
| Write IOPS | Number of disk written I/O operations per second. | Count/second |
| Read IOPS | Number of disk read I/O operations per second. | Count/second |
| Write Bandwidth | Overall data written to disk per second. | Byte/second |
| Read Bandwidth | Overall data read from disk per second. | Byte/second |
| Storage Write Delay | Average storage-layer write time in milliseconds. | Millisecond |
| Storage Read Delay | Average storage-layer read time in milliseconds. | Millisecond |
Replication
Metric name | Description | Units |
| Replication Delay | Time lag in seconds between the primary node and its standby replication. | Seconds |
| Replication Delay Millisecond-level | Fine-grained replication lag in milliseconds between the primary and standby nodes. | Millisecond |
Miscellaneous
Metric name | Description | Units |
| Temporary Tables | Current number of temporary tables existing in memory or on disk for active queries. | Count |
| Temporary Tables Created per Minute | Rate at which the engine is creating new temporary tables to satisfy complex queries. | Count/minute |
| Long-Running Transactions | Number of transactions that have been open and executing longer than the configured threshold. | Count |
| Existing Transactions in Undo Space | Number of active transactions currently holding undo log space. | Count |
| Sorts Using Ranges | Number of sort operations performed using index ranges per minute. | Count/minute |
| Sorted Rows | Number of rows sorted per second across all active sort operations per minute. | Count/minute |
| Sorts by Scanning Tables | Number of sort operations that required a full table scan per minute. | Count/minute |
| Hits for Open Tables Cache Lookups | Number of times the table open cache was successfully consulted to avoid re-opening a table. | Count/minute |
| Misses for Open Tables Cache Lookups | Number of times a table was not found in the open cache and had to be reopened from disk. | Count/minute |
| Binlog Files | Total number of binary log files currently present on the instance. | Count |
| Write Traffic Control | Number of write throttling events triggered by the storage layer due to write pressure. | Count |
| Read Traffic Control | Number of read throttling events triggered by the storage layer due to read pressure. | Count |
Threshold configuration
You can configure thresholds and alerts for all GaussDB metrics to proactively detect performance degradation or connection issues.
- Go to Admin > Configuration Profiles > Threshold and Availability.
- Create or edit your Threshold Profile for GaussDB.
- Assign the profile to the respective monitors to trigger alerts.
IT Automation
Use Site24x7's IT Automation to resolve common issues with GaussDB performance:
- Go to Admin > IT Automation Templates. Then, click Add Automation Templates.
- Create an automation rule by selecting the automation Type (e.g., Server reboot, clear queue).
- Map the created rules to the GaussDB, for automatic execution during alerts.
Configuration rules
Use Configuration Rules to simplify bulk setup across GaussDB instances. Automatically assign Threshold Profiles, Notification Profiles, Tags, and Monitor Groups when new monitors are discovered.
