SNMP device monitoring with Netdata
Collects data from any SNMP device and uses the net-snmp module.
It supports:
- all SNMP versions: SNMPv1, SNMPv2c and SNMPv3
- any number of SNMP devices
- each SNMP device can be used to collect data for any number of charts
- each chart may have any number of dimensions
- each SNMP device may have a different update frequency
- each SNMP device will accept one or more batches to report values (you can set
max_request_size
per SNMP server, to control the size of batches).
#
Requirementsnodejs
minimum required version 4
#
ConfigurationYou will need to create the file /etc/netdata/node.d/snmp.conf
with data like the following.
In this example:
- the SNMP device is
10.11.12.8
. - the SNMP community is
public
. - we will update the values every 10 seconds (
update_every: 10
under the server10.11.12.8
). - we define 2 charts
snmp_switch.bandwidth_port1
andsnmp_switch.bandwidth_port2
, each having 2 dimensions:in
andout
. Note that the charts and dimensions must not contain any white space or special characters, other than.
and_
.
update_every
is the update frequency for each server, in seconds.
max_request_size
limits the maximum number of OIDs that will be requested in a single call. The default is 50. Lower this number of you get TooBig
errors in Netdata's error.log
.
family
sets the name of the submenu of the dashboard each chart will appear under.
multiplier
and divisor
are passed by the plugin to the Netdata daemon and are applied to the metric to convert it properly to units
. For incremental counters with the exception of Counter64 type metrics, offset
is added to the metric from within the SNMP plugin. This means that the value you will see in debug mode in the DEBUG: setting current chart to... SET
line for a metric will not have been multiplied or divided, but it will have had the offset added to it.
Caution: Counter64 metrics do not support `offset` (issue #5028).
The SNMP plugin supports Counter64 metrics with the only limitation that the `offset` parameter should not be defined. Due to the way Javascript handles large numbers and the fact that the offset is applied to metrics inside the plugin, the offset will be ignored silently.If you need to define many charts using incremental OIDs, you can use something like this:
This is like the previous, but the option multiply_range
given, will multiply the current chart from 1
to 24
inclusive, producing 24 charts in total for the 24 ports of the switch 10.11.12.8
.
Each of the 24 new charts will have its id (1-24) appended at:
- its chart unique id, i.e.
snmp_switch.bandwidth_port1
tosnmp_switch.bandwidth_port24
- its
title
, i.e.Switch Bandwidth for port 1
toSwitch Bandwidth for port 24
- its
oid
(for all dimensions), i.e. dimensionin
will be1.3.6.1.2.1.2.2.1.10.1
to1.3.6.1.2.1.2.2.1.10.24
- its priority (which will be incremented for each chart so that the charts will appear on the dashboard in this order)
The options
given for each server, are:
port
- UDP port to send requests too. Defaults to161
.retries
- number of times to re-send a request. Defaults to1
.sourceAddress
- IP address from which SNMP requests should originate, there is no default for this option, the operating system will select an appropriate source address when the SNMP request is sent.sourcePort
- UDP port from which SNMP requests should originate, defaults to an ephemeral port selected by the operation system.timeout
- number of milliseconds to wait for a response before re-trying or failing. Defaults to5000
.transport
- specify the transport to use, can be eitherudp4
orudp6
. Defaults toudp4
.version
- either0
(v1) or1
(v2) or3
(v3). Defaults to0
.idBitsSize
- either16
or32
. Defaults to32
. Used to reduce the size of the generated id for compatibility with some older devices.
#
SNMPv3To use SNMPv3:
- set
version
to 3 - use
user
instead ofcommunity
User syntax:
Security levels:
- 1 is
noAuthNoPriv
- 2 is
authNoPriv
- 3 is
authPriv
Authentication protocols:
- "1" is
none
- "2" is
md5
- "3" is
sha
Privacy protocols:
- "1" is
none
- "2" is
des
For additional details please see net-snmp module readme.
#
Retrieving names from snmpYou can append a value retrieved from SNMP to the title, by adding titleoid
to the chart.
You can set a dimension name to a value retrieved from SNMP, by adding oidname
to the dimension.
Both of the above will participate in multiply_range
.
#
Testing the configurationTo test it, you can run:
The above will run it on your console and you will be able to see what Netdata sees, but also errors. You can get a very detailed output by appending debug
to the command line.
If it works, restart Netdata to activate the snmp collector and refresh the dashboard (if your SNMP device responds with a delay, you may need to refresh the dashboard in a few seconds).
#
Data collection speedKeep in mind that many SNMP switches and routers are very slow. They may not be able to report values per second. If you run node.d.plugin
in debug
mode, it will report the time it took for the SNMP device to respond. My switch, for example, needs 7-8 seconds to respond for the traffic on 24 ports (48 OIDs, in/out).
Also, if you use many SNMP clients on the same SNMP device at the same time, values may be skipped. This is a problem of the SNMP device, not this collector.
#
Finding OIDsUse snmpwalk
, like this:
-t 20
is the timeout in seconds-v 1
is the SNMP version-O fn
will display full OIDs in numeric format (you may want to run it also without this option to see human readable output of OIDs)-c public
is the SNMP community10.11.12.8
is the SNMP device
Keep in mind that snmpwalk
outputs the OIDs with a dot in front them. You should remove this dot when adding OIDs to the configuration file of this collector.
#
Example: Linksys SRW2024PThis is what I use for my Linksys SRW2024P. It creates:
- A chart for power consumption (it is a PoE switch)
- Two charts for packets received (total packets received and packets received with errors)
- One chart for packets output
- 24 charts, one for each port of the switch. It also appends the port names, as defined at the switch, to the chart titles.
This switch also reports various other metrics, like snmp, packets per port, etc. Unfortunately it does not report CPU utilization or backplane utilization.
This switch has a very slow SNMP processors. To respond, it needs about 8 seconds, so I have set the refresh frequency (update_every
) to 15 seconds.