- Notifications
You must be signed in to change notification settings - Fork 513
Description
Nagios is a free and open-source software application that provides monitoring and alerting services for servers, switches, applications, and services. It watches hosts and services that the owner specifies, alerting the owner when things go wrong and again when they get better. Nagios XI is an extended interface of Nagios Core or the original Nagios, intended as the enterprise-level version of the monitoring tool. Nagios XI has REST API which allows users to export data from the backend. We will build an agent-based integration that will allow customers to ingest status and event information from Nagios to Elastic Cloud for Observability use-cases using the aforementioned REST API.
We will collect the following metrics:
| Metric Name | Description |
|---|---|
| nagios.host.pl | The host command to probe the specified host for packet loss (%) |
| nagios.host.rta | The host command to probe the specified host for round trip average travel time (ms) |
| nagios.metrics.processed.per.execution | The number of metrics processed in an execution (count) |
| nagios.script.execution.time | The time taken by the script to process metrics per execution (s) |
| nagios.service.current.load.load1 | The current system load average over 1 minute |
| nagios.service.current.load.load15 | The current system load average over 5 minutes |
| nagios.service.current.load.load5 | The current system load average over 15 minutes |
| nagios.service.current.users.users | The number of users currently logged in on the local system |
| nagios.service.http.size | The HTTP check command response message size (Bytes) |
| nagios.service.http.time | The HTTP check command response time (s) |
| nagios.service.ping.pl | The ping command to probe the specified host for packet loss (%) |
| nagios.service.ping.rta | The ping command to probe the specified host for round trip average travel time (ms) |
| nagios.service.root.partition | The amount of used disk space on a mounted file system (MB) |
| nagios.service.ssh.time | The time taken to connect to an SSH server at specified server and port |
| nagios.service.swap.usage.swap | The swap memory usage on machine (MB) |
| nagios.service.total.processes.procs | The number of currently running processes |
And we will collect and parse the following event logs:
- ServiceFlappingAlert
- AcknowledgeSvcProblem
- ServiceAlert
- HostAlert
- AcknowledgeHostProblem
- ServiceNotification
- HostDowntimeAlert
- ProcessServiceCheckResult
- ServiceDowntimeAlert
Integration release checklist
This checklist is intended for integrations maintainers to ensure consistency
when creating or updating a Package, Module or Dataset for an Integration.
All changes
- Change follows the contributing guidelines
- Supported versions of the monitoring target are documented
- Supported operating systems are documented (if applicable)
- Integration or System tests exist
- Documentation exists
- Fields follow ECS and naming conventions
- At least a manual test with ES / Kibana / Agent has been performed.
- Required Kibana version set to: