Alibaba Cloud ApsaraDB ClickHouse Monitoring Integration
Site24x7 provides comprehensive out-of-the-box monitoring for ApsaraDB ClickHouse instances deployed in your Alibaba Cloud environment. Gain detailed insights into storage performance, CPU and memory usage, network throughput, and query behavior to ensure your real-time analytics workloads are always optimized and responsive. Once your Alibaba Cloud account is integrated with Site24x7, all associated ClickHouse instances are auto-discovered and continuously monitored.
Use cases
- Real-time query monitoring: Track QPS, TPS, and failed queries to ensure analytics workloads run smoothly.
- Storage health visibility: Monitor disk usage, cold storage data, and IOPS to avoid bottlenecks.
- Memory and CPU control: Prevent overuse and memory allocation failures by monitoring utilization rates.
- Connection pool analysis: Understand HTTP and TCP connection usage to optimize throughput.
- Merge operation tracking: Keep an eye on active merge operations and data part growth to manage storage fragmentation.
Setup and configuration
- Log in to your Site24x7 account and navigate to Cloud > Alibaba Cloud > Add Monitor.
- In the Edit Alibaba Cloud Monitor page, select ApsaraDB ClickHouse from the Service Types list.
- Once added, go to Cloud > Alibaba > ApsaraDB ClickHouse to view dashboards and performance metrics.
Supported metrics
Memory and CPU Metrics
| Metric name | Description | Unit |
|---|---|---|
| CPU Utilization | The percentage of CPU used by the ClickHouse instance. | Percentage |
| Memory Utilization | The percentage of memory used. | Percentage |
| Memory Used | The actual memory consumed by the instance. | MB |
| Memory Usage Size | The total memory usage of the process. | MB |
| Merge/Mutation Failures | The number of memory allocation failures. | Count |
| Total Merges/Mutations | The total number of memory allocations. | Count |
Storage and Disk Utilization Metrics
| Metric name | Description | Unit |
|---|---|---|
| Disk Space Used | The amount of disk space currently used. | GB |
| Disk Utilization | The percentage of disk utilization. | Percentage |
| Disk Read Rate | The disk read rate. | KB/second |
| Disk Write Rate | The disk write rate. | KB/second |
| Disk Read IOPS | The number of read operations per second. | Count/second |
| Disk Write IOPS | The number of write operations per second. | IOPS |
| Cold Storage Data Size | The volume of data stored in cold storage. | GB |
Core Performance and Query Metrics
| Metric name | Description | Unit |
|---|---|---|
| Queries Per Second (QPS) | The number of queries processed per second. | Count/second |
| Transactions Per Second (TPS) | The number of transactions processed per second. | Count/second |
| Total Queries | The total number of queries executed. | Count |
| Failed Queries | The number of queries that failed to execute. | Count |
| Failed Insert Queries | The number of failed insert operations. | Count |
| Failed Select Queries | The number of failed select operations. | Count |
| Rows Inserted | The number of rows inserted. | Count |
| Insert Data Size | The size of inserted data. | Bytes |
| Concurrent Queries | The number of concurrent queries. | Count |
| Initial Queries | The number of queries initiated. | Count |
Network and Connections
| Metric name | Description | Unit |
|---|---|---|
| Network Receive Rate | The rate of data received over the network. | KB/second |
| Network Transmit Rate | The rate of data sent over the network. | KB/second |
| Total Connections | The number of active client connections. | Count |
| HTTP Connections | The number of HTTP connections in use. | Count |
| TCP Connections | The number of TCP connections in use. | Count |
Data Part and Merge Metrics
| Metric name | Description | Unit |
|---|---|---|
| Data Part Count | The number of active data parts in the storage engine. | Count |
| Merge Operations | The number of merge operations currently in progress. | Count |
Threshold configuration
- Go to Admin > Configuration Profiles > Threshold and Availability.
- Create or edit a threshold profile for ApsaraDB ClickHouse.
- Assign the profile to the respective monitors to trigger alerts.
IT automation
Site24x7's IT Automation tools help with automatically resolving performance degradation issues. When a breach occurs, the alarm engine continuously examines the system events for which thresholds have been defined and performs the mapped automation.
- Go to Admin > IT Automation Templates.
- Create a new automation rule.
- Map the rule to the monitor for proactive resolution.
How to configure IT Automation for a monitor
Configuration rules
With Site24x7's Configuration Rules, you can set parameters like Threshold Profile, Notification Profile, Tags, and Monitor Group for multiple monitors and automate the configuration settings of your monitoring resources. Automatically assign these settings when new ApsaraDB ClickHouse monitors are added.
How to add a Configuration Rule
