Help Introduction Server Monitoring (Agent Based)

Site24x7 Server Monitoring (Agent Based)

Monitor your servers with our light weight, easy-to-install agent. Stay ahead of outages and performance degradation issues with over 60 performance metrics, intuitive dashboards, RCA, IT Automation, and instant alerting mechanisms.

Agent Architecture

In Site24x7, two data centers -- Primary Data Center (PDC) and Disaster Recovery Data Center (DRDC) are set up at different locations to ensure monitoring services remain uninterrupted in the event of a data center failure.

  • Primary Data Center (PDC) - plus.site24x7.com
  • Disaster Recovery Data Center (DRDC) - Primary (plus2.site24x7.com) and Secondary (plus3.site24x7.com)  

In case of any error in the PDC, status information via heartbeat check is automatically sent to the Primary DRDC, thus avoiding false alerts due to intermittent network failure. If there occurs an error in the Primary DRDC, the Secondary DRDC will still receive the heartbeat check to ensure continued monitoring. Server shutdowns under normal circumstances is clearly communicated by the agent to our data centers. Also, ensure the necessary domains, port, and IP addresses are whitelisted to allow access in your firewall. 

Once the agent is successfully installed in the user machine, its communication to the PDC is initiated using a one-way HTTPS connection. Agent communication occurs via three major means:

Heartbeat Check:

This is a status information check done every minute by the agent to the PDC. The server's uptime is decided by how long it's been since Site24x7 received a heartbeat check from the agent running on your server. 

If the heartbeat check has not occurred for a continuous time interval of seven minutes, the Server Availability Watcher will alert the user about possible network communication failure or system crash. You can also customize this time interval in the Admin > Server Monitor > Settings page and set an appropriate value for Set Threshold for Heartbeat Check. Learn more.

Data Collection with a Polling Interval of One Minute:

The server monitoring agent will collect data every minute, consolidate it and communicate it to the Site24x7 data center every five minutes. Only in case of any threshold violation, the agent will immediately communicate to the data center and trigger an alert to the user. In addition, you can also set polling interval from a range of 1 min, 5 min, 10 mins, 15 mins, 30 mins, 1 hour, 3 hours, 6 hours, 12 hours, and one day. 

Device Messaging System (DMS):

This ensures real-time communication to the monitored servers. Communication to the DMS (dms.zoho.com) happens every 30 seconds and the agent performs several actions triggered by the user like starting/stopping a process, discovering a service/process, upgrading the agent, root cause analysis, adding a port, URL, file, directory, and event log/syslog checks.  

Server Monitoring ArchitectureServer Monitoring Architecture

Network Sanity Checks:

Before reporting an outage, Site24x7 checks the availability status of its own internal network by performing a Network Sanity Check. This is to make sure that it is the resource that's down and not its network. Check out the possible outage scenarios and steps to troubleshoot them.

Agent Security

As the agent has to be downloaded and installed on every server that needs monitoring, it is important to understand about its security, its resource usage and the prerequisites for successful installation. Learn more.

OS and Features

Learn more about the platforms supported, the bulk deployment options available for installing the agent in 100's of servers in one go, and the various performance metrics provided for each of these platforms. 

Plugin Integrations

Monitor your databases, applications, web servers, caches, message brokers and keep track of data that matters the most. Take a look at our 50+ out-of-the-box plugins. If you don't find the one you wish to monitor, write your own using Python and Shell script for Linux and VB, Powershell, Batch and DLL for Windows. 

Reports, Dashboards, & IT Automation

Receive a detailed root cause analysis report for every server downtime, view your entire server environment in one dashboard, and automate fault resolution with exclusive IT Automation tools. 

Troubleshooting Tips

Having a trouble with any of the servers or applications installed? Check our troubleshooting tips.

Related Articles

Was this document helpful?
Thanks for taking the time to share your feedback. We’ll use your feedback to improve our online help resources.

Help Introduction Server Monitoring (Agent Based)