Alibaba Cloud Lindorm Monitoring Integration
Site24x7 offers comprehensive out-of-the-box monitoring for Lindorm instances deployed in your Alibaba Cloud environment. You can gain real-time visibility into system resource usage, storage consumption, query rates, and latency trends to ensure smooth operations for your time-series, search, and table workloads. Once your Alibaba Cloud account is integrated with Site24x7, all associated Lindorm instances are auto-discovered and continuously monitored.
Use cases
- Resource utilization control: Track CPU, memory, and worker thread availability to prevent system overloads.
- Storage efficiency monitoring: Monitor hot, cold, and Solr storage usage to manage capacity and avoid bottlenecks.
- Performance insights: Analyze query response times (P95, P99), write/read latencies, and Solr operations to improve efficiency.
- Operational tracking: Monitor query counts, LQL operations, and scan requests for workload visibility.
- Search service health: Ensure stable search operations by monitoring memory usage and CPU idle time for search nodes.
Setup and configuration
- Log in to your Site24x7 account and navigate to Cloud > Alibaba Cloud > Add Monitor.
- In the Edit Alibaba Cloud Monitor page, select Lindorm from the Service Types list.
- Once added, go to Cloud > Alibaba > Lindorm to view dashboards and performance metrics.
Supported metrics
CPU / Memory / System Resource Metrics
| Metric name | Description | Unit |
|---|---|---|
| CPU Wait I/O Time | The percentage of CPU time spent waiting for I/O operations. | Percentage |
| Lindorm Multi CPU System Usage | The percentage of CPU utilization in system/kernel mode across multiple nodes. | Percentage |
| Memory Used Percentage | The percentage of system memory used. | Percentage |
| Search Memory Used Percentage | The percentage of memory consumed by search services. | Percentage |
| Lindorm Multi Free Memory | The amount of free memory available across nodes. | Bytes |
| Lindorm Multi Buffer/Cache Memory | The memory consumed by buffer and cache across nodes. | Bytes |
| Lindorm Multi CPU User Time | The percentage of CPU utilization in user mode across nodes. | Percentage |
| Search CPU Idle Percentage | The percentage of CPU idle time in search services. | Percentage |
| Search Free Memory | The free memory available for search services. | Bytes |
| Lindorm Multi Worker Count | The number of worker threads available for Lindorm operations. | Count |
Storage Utilization Metrics
| Metric name | Description | Unit |
|---|---|---|
| Lindorm Multi Cold Storage Used (%) | The percentage of cold storage used across nodes. | Percentage |
| Multi Storage Used (%) | The percentage of total storage used across multiple nodes. | Percentage |
| Lindorm Multi Hot Storage Used (%) | The percentage of hot storage used across nodes. | Percentage |
| Lindorm Multi Total Storage (Bytes) | The total storage capacity available across nodes. | Bytes |
| Lindorm Multi Used Storage (Bytes) | The total storage consumed across nodes. | Bytes |
| Lindorm Multi Solr Storage Used (%) | The percentage of Solr storage used. | Percentage |
| Search Hot Storage Used (Bytes) | The amount of hot storage used by search services. | Bytes |
| Lindorm Multi Table Hot Storage Used (Bytes) | The amount of hot storage consumed by tables across nodes. | Bytes |
| Lindorm Multi Cold Storage Used (Bytes) | The amount of cold storage consumed across nodes. | Bytes |
| TSDB Hot Storage Used (Bytes) | The amount of hot storage consumed by the TSDB service. | Bytes |
Performance & Latency Metrics
| Metric name | Description | Unit |
|---|---|---|
| GET Response Time P99 | The 99th percentile response time for get operations. | Milliseconds |
| Lindorm Multi Write PUT Response Time (Max) | The maximum response time for write put operations. | Milliseconds |
| Lindorm Multi Search Update Response Time P99 | The 99th percentile response time for search update requests. | Milliseconds |
| Lindorm Multi PUT Response Time (Average) | The average response time for put operations across nodes. | Milliseconds |
| Solr Select Response Time P99 | The 99th percentile response time for Solr select queries. | Milliseconds |
| Lindorm Multi Search Select Response Time P95 | The 95th percentile response time for search select queries. | Milliseconds |
| Lindorm Multi Search Update Response Time (Mean) | The mean response time for search update requests. | Milliseconds |
| Solr Update Response Time P99 | The 99th percentile response time for Solr update operations. | Milliseconds |
| Lindorm Multi Read Response Time | The average response time for read operations across nodes. | Milliseconds |
| Write Response Time | The average response time for write operations. | Milliseconds |
Query Rate & Ops Metrics
| Metric name | Description | Unit |
|---|---|---|
| Search Update Count | The number of search update operations executed. | Count |
| Lindorm Multi Solr Select Count | The number of Solr select queries executed across nodes. | Count |
| LQL Select Operations | The number of LQL select operations performed. | Count |
| LQL Delete Operations | The number of LQL delete operations performed. | Count |
| Lindorm Multi LQL Upsert Operations | The number of LQL upsert operations across nodes. | Count |
| Write Operations | The number of write operations executed. | Count |
| Read Operations | The number of read operations executed. | Count |
| Lindorm Multi GET Operations | The number of get operations across nodes. | Count |
| Lindorm Multi Write Delete Operations | The number of wide delete operations performed across nodes. | Count |
| Scan Operations | The number of scan operations executed. | Count |
Threshold configuration
- Go to Admin > Configuration Profiles > Threshold and Availability.
- Create or edit a threshold profile for Lindorm.
- Assign the profile to the respective monitors to trigger alerts.
IT automation
Site24x7's IT Automation tools help with automatically resolving performance degradation issues. When a breach occurs, the alarm engine continuously examines the system events for which thresholds have been defined and performs the mapped automation.
- Go to Admin > IT Automation Templates.
- Create a new automation rule.
- Map the rule to the monitor for proactive resolution.
How to configure IT Automation for a monitor
Configuration rules
With Site24x7's Configuration Rules, you can set parameters like Threshold Profile, Notification Profile, Tags, and Monitor Group for multiple monitors and automate the configuration settings of your monitoring resources. Automatically assign these settings when new Lindorm monitors are added.
How to add a Configuration Rule
