Alibaba Cloud ApsaraDB for MongoDB Monitoring Integration
Site24x7 offers extensive out-of-the-box monitoring for ApsaraDB for MongoDB instances deployed in your Alibaba Cloud environment. With visibility into resource utilization, cache efficiency, query performance, replication health, and sharding activity, you can optimize the reliability and performance of your MongoDB clusters. Once your Alibaba Cloud account is integrated with Site24x7, all associated MongoDB instances are auto-discovered and continuously monitored.
Use cases
- Resource and cache management: Monitor memory and WiredTiger cache usage to prevent memory saturation and cache inefficiency.
- Storage and IOPS optimization: Track disk utilization, IOPS consumption, and bandwidth to avoid bottlenecks.
- Query performance analysis: Gain insights into QPS, average command latency, and query/update/delete operations.
- Transaction monitoring: Observe commit, abort, and prepare rates to detect transactional anomalies.
- Replication health checks: Monitor replication lag and replica operations for consistency across nodes.
- Cluster-wide observability: Analyze sharding throughput, connection utilization, and distributed load balancing.
Setup and configuration
- Log in to your Site24x7 account and navigate to Cloud > Alibaba Cloud > Add Monitor.
- In the Edit Alibaba Cloud Monitor page, select ApsaraDB for MongoDB from the Service Types list.
- Once added, go to Cloud > Alibaba > ApsaraDB for MongoDB to view dashboards and performance metrics.
Supported metrics
Memory & Cache Metrics
| Metric name | Description | Unit |
|---|---|---|
| Memory Utilization | The percentage of memory used by the instance. | Percentage |
| WiredTiger Cache Usage | The percentage of WiredTiger cache usage. | Percentage |
| WiredTiger Cache Dirty Usage | The percentage of WiredTiger cache occupied by dirty data. | Percentage |
| Bytes Read from Cache | The number of bytes read from the cache per node. | Bytes |
| Bytes Written to Cache | The number of bytes written into the cache. | Bytes |
| Cache Bytes | The total number of bytes used in cache. | Bytes |
Storage & IOPS Metrics
| Metric name | Description | Unit |
|---|---|---|
| Data IOPS Utilization | The percentage of IOPS consumed. | Percentage |
| Data IOPS | The number of I/O operations executed for data. | Count |
| I/O Bandwidth | The bandwidth utilized for I/O per node. | Bytes/second |
| Instance Disk Amount | The total disk capacity allocated to the instance. | Bytes |
| Disk Utilization | The percentage of disk usage. | Percentage |
Query & Command Performance Metrics
| Metric name | Description | Unit |
|---|---|---|
| Queries Per Second (QPS) | The number of queries processed per second. | Count/second |
| Command Average Response | The average response time for commands per node. | Milliseconds |
| Query Operations | The number of query operations executed. | Count |
| Update Operations | The number of update operations executed. | Count |
| Delete Operations | The number of delete operations executed. | Count |
| Insert Operations | The number of insert operations executed. | Count |
| Operation Command | The number of command operations executed. | Count |
| Documents Scanned | The number of documents scanned. | Count |
| Documents Returned Per Second | The number of documents returned per second. | Count/second |
| Documents Inserted Per Second Node | The number of documents inserted per second per node. | Count/second |
| Documents Updated Per Second | The number of documents updated per second per node. | Count/second |
Transactions Metrics
| Metric name | Description | Unit |
|---|---|---|
| Transactions Started Per Second | The number of transactions started per second per node. | Count/second |
| Transactions Total Committed Per Second Node | The number of transactions committed per second per node. | Count/second |
| Transactions Aborted Per Second | The number of transactions aborted per second per node. | Count/second |
| Transactions Prepared Per Second | The number of transactions prepared per second per node. | Count/second |
| Transactions Prepared Then Committed Per Second Node | The number of prepared transactions that were later committed. | Count |
| Transactions Prepared Then Aborted Per Second | The number of prepared transactions that were later aborted. | Count |
Replication Metrics
| Metric name | Description | Unit |
|---|---|---|
| Replica Updates | The number of replicated update operations. | Count |
| Replica Inserts | The number of replicated insert operations. | Count |
| Replica Deletes | The number of replicated delete operations. | Count |
| Replica GetMores | The number of replicated getMore operations. | Count |
| Replica Queries | The number of replicated query operations. | Count |
| Replication Lag | The replication delay between primary and secondary nodes. | Milliseconds |
Connections & Sessions Metrics
| Metric name | Description | Unit |
|---|---|---|
| Connection Amount | The total number of open connections. | Count |
| Available Connections | The number of available connections. | Count |
| Active Connections | The number of active connections. | Count |
| Session Closed Cursors Node | The number of closed cursors per node. | Count |
| Connections Created Per Second | The number of new connections created per second. | Count/second |
Sharding & Cluster Load Metrics
| Metric name | Description | Unit |
|---|---|---|
| Sharding Queries Per Second (QPS) | The number of queries executed per second in sharded clusters. | Count/second |
| Sharding Inserted Documents Per Second | The number of documents inserted per second per node in a sharded cluster. | Count/second |
| Sharding Updated Documents Per Second | The number of documents updated per second per node in a sharded cluster. | Count/second |
| Sharding Read Bandwidth Node | The read bandwidth per node in a sharded cluster. | Bytes/second |
| Sharding Connection Utilization | The percentage of sharding connection utilization. | Percentage |
| Sharding Replica Updates | The number of replicated updates per node in a sharded cluster. | Count |
| Sharding Logical IOPS | The number of log IOPS in a sharded cluster per node. | Count |
Threshold configuration
- Go to Admin > Configuration Profiles > Threshold and Availability.
- Create or edit a threshold profile for ApsaraDB for MongoDB.
- Assign the profile to the respective monitors to trigger alerts.
IT automation
Site24x7's IT Automation tools help with automatically resolving performance degradation issues. When a breach occurs, the alarm engine continuously examines the system events for which thresholds have been defined and performs the mapped automation.
- Go to Admin > IT Automation Templates.
- Create a new automation rule.
- Map the rule to the monitor for proactive resolution.
How to configure IT Automation for a monitor
Configuration rules
With Site24x7's Configuration Rules, you can set parameters like Threshold Profile, Notification Profile, Tags, and Monitor Group for multiple monitors and automate the configuration settings of your monitoring resources. Automatically assign these settings when new ApsaraDB for MongoDB monitors are added.
How to add a Configuration Rule
