SNMPSTAT monitoring system.
(Version 1.5)
User guide.
Table of content.
2.5.3. Router journal and tickets.
2.6.1. Link performance charts.
2.6.3. Link performance - data view.
2.6.4. Link - Journal and tickets.
2.9. Daily and weekly reports.
2.10. Changing password and managing
users.
SNMPSTAT is monitoring system, designed specifically
to monitor network devices. It is a core component of monitoring server,
because it includes http service and user directory, shared by
other components.
System components
How it works:
System collects performance data for most objects, and stores this data forever, using aggregation to limit disk usage. Data are recorded for every 6 minutes (configurable) for the first months, aggregated for 1-hour interval after 3 month. This data may be used for graphs, reports or seen in raw format.
See ‘Introduction into SNMPSTAT’ for additional details.
System menu (on the top) calls different components of monitoring system (on this screen – documentation; mail archive for error notifications; inventory data base; log server; IDS; CCR system; ipmonitor server; other monitoring servers in the network; ProBIND; other components; user administration screen). One button here is a part of snmpstat system – ADMIN button, which allows to create / modify / delete users of this system.
Next line – snmpstat menu and snmpstat status line. Menu has few buttons; status line shows summary status of your system, and plays alarm music (when required).
Next frame used by different components of the system; here you can see most commonly used snmpstat screen – ‘show active links’ screen, which shows all objects in unusual state and objects, which are not in idle state (have some traffic). This frame reused by few buttons on the main system menu (top line).
SNMPSTAT presents data in different table formats. Most views shows list of objects, with statistic bars (tiny colored line on the right) and additional information. You can always click on the object name to open object-specific view (it is opened on the separate screen).
Condensed views are a very important part of the snmpstat web interface. In addition to the embedded filters and rich set of color-coded states, they allow seeing all networks in a glance, in a very condensed and informative format.
You can see here a lot of information in a very compact form:
This section describes how to use SNMPSTAT system by the operator. For configuration issues, read SNMPSTAT configuration guide.
Status frame is on the left:
· Colored boxes present summary status line, numbers are numbers of objects in every state. We do not see any new failures on this screen – 6 Bgp objects in OK state, 323 Channel objects in OK state, 3 in Debugging (deep blue color) state, 6 in Unused show (black) state, 30 Router objects in OK state.
· [music] – notification mode (sound is turned on);
· Router – selection button, allows quick access to the full router views;
(If you cannot see status
frame, redraw screen, or redraw status frame by clicking on ‘sound on’ or
‘sound off’ button.).
Main menu is on the right:
Click on SHOW to open view selection menu:
This menu selects different views and search object. There are 4 types of view:
Next part of window allows searching for the object.
Low part control font size, number of lines and columns on the screen, and refresh time.
Below is an example of ‘active’ view:
(As we can see, one of links became NORMAL less than 5 minutes ago – bright green color).
You can open full router view by:
o click on the router name on the network view;
o select full view from ‘SHOW’ screen (and enter router name);
o Select router by name from Status frame.
o Click on ‘stat’ button in router menu.
In first two cases, router view will be opened in separate window.
(If nothing happen, when you
click on the router name, check you minimized windows. Window title starts with
‘Total’ word).
Here we can see:
First analyze router information:
Let’s continue with links. For every link, we have:
(Link utilization scaled to the declared link bandwidth).
Click on ‘graphs’ or ‘zoom’ (‘zoom’ means _zoomed graphs) to see a performance graphs:
Here you can see CPU performance and used memory for the current day. Blue marks system failures.
To change a day, change date in the window (entering new date or clicking + or – button) and click outside of this window to redraw chart.
To see monthly statistics, click on ‘months’ button.
Click on report to see performance report (the same rules for date selection):
Other menu items are:
· stat – raw statistics file;
· config – link to the configuration (optional);
· enter – link to ‘telnet’ into the router (optional);
· archive – fast access to the archived data (including raw data);
· card – informational card (optional) about router;
· zoomed – performance graphs in other scale;
· alerts – alert configuration (see configuration guide);
· journal – system journal and tickets, see below.
Raw data example:
04.04.23 17:12:16 12/30 17532/17532 0
04.04.23 17:15:16 5/11 17531/17529 0
04.04.23 17:18:16 9/16 17520/17514 0
04.04.23 17:21:16 14/17 17532/17532 0
04.04.23 17:24:17 10/12 17655/17533 0
04.04.23 17:27:16 8/11 17718/17541 0
04.04.23 17:30:16 43/99 17490/17475 0
04.04.23 17:32:15 DOWN
04.04.23 17:32:15 UP
04.04.23 17:39:17 7/8 17910/17910 0
Data fields:
- date;
- time;
- CPU load (average/Maximum), %;
- Free memory (average/maximum), Kb.
Click on ‘Journal’ to open journal and ticket template.
Picture 9. Journal screen for the router (example).
Screen elements:
- menu bar;
- current object state (created when you selected object, it is not real time state);
- permanent tickets (are nit shown here);
- ticket template and other existing tickets;
- object journal.
To change object state[5]:
· For permanent state change, select ‘Permanent comment’ (by default, temporary ticket will be created – ‘Comment to the event’)[6];
· Set new state in Set up: selection field;
· Fill in time of life, if you want to create time-limited ticket;
· Change ‘Remove…’ field, if you do not want this ticket to be removed, when object change its status (some objects can change status back and forth few times, for example, overload conditions can change or link errors fluctuate).
· Write out a comment;
· Checkboxes below control mail notification – if checked, system will send e-mail when you write this ticket;
· Click on ‘Write’ (or ‘Replace’ button), or you can click on ‘Journal record only’ button if you want to write a message, but not create a status change ticket.
See example of permanent ticket setting below (in the ‘Link journal and tickets’ section).
Lets return to the ‘condensed system view’ or ‘router full view’ and click on the link name to open Link view screen:[7]
To draw
performance graphs, click graphs or zoom:
Average packet size is calculated from link utilization and packets/second rate. Blue risks mark link failures. To change date, use ‘+’ or ‘-‘ buttons or enter new date into the date field. To see monthly statistics, click on ‘month’ button. If you change date, you should click on ‘graph’ button again.
Click on ‘report’:
Click on ‘raw. st.’[8]:
Data fields are:
- date;
- time;
- status change or bandwidth, kbits/second;
- inbound link utilization, average / maximum, %;
- inbound link performance, packets/second (average / maximum);
- outbound link utilization, % (average / maximum);
- outbound link performance, packets/second (average / maximum);
- inbound error level, % (average / maximum);
- outbound error level, % (average / maximum);
- inbound packet drop level, % (average / maximum);
- outbound packet drop level, % (average / maximum).
Click on ‘journal’ to open link journal and ticket (see ‘Picture 10. Example of temporary ticket.’ above). Tickets and journal are similar to the router ticket and journal (see ‘2.5.3. Router journal and tickets.’, but object type is ‘Channel’ instead of ‘Router’. To acknowledge event, write a message into the journal, and change object state (temporary or permanently):
- select ticket type (normal or permanent);
- select new object state;
- fill in expiration time, if necessary;
- write out comment;
- set up ‘Remove …’ field;
- Click on write or Journal record only[9].
Other menu buttons are:
- alerts – alert configuration (see Configuration guide);
- archive – historical archive;
- card – informational record about link. This is optional service, which depends of the installation.
BGP links are monitored in snmpstat 1.5, but does not
initiate alerts and do not collect statistics. They can be seen in Total
router view (see ‘2.5.1.
Full router view.’):
To open daily system journal,
use log button in the main menu (‘Picture 3. Main menu (in 1 line format):’).
This journal contains duplicates of all events for all objects, organized by days.
Menu items:
- alerts – configure alert setting for the all objects;
- view – view system journal;
- write - write record into the journal;
- raw format – records in raw format.
To see list of predefined reports, use reports button in main menu (‘2.3. Main menu.’):
This reports are prepared every day and every week, and stored 1 month (by default). Reports covers all network elements.
User management described in ‘SNMPSTAT configuration guide.’ . Use ADMIN button in the first line of system screen (‘2.1. Main screen.’). Regular user can only change his password, administrator can change/delete other users.
Groups are apache access groups, with a few important
exceptions:
- admin allows to administrate users list, except superadmin;
- superadmin can do anything;
- tacacs and tacacs_7 means create this user in tacacs+.
Other groups are (by default, it can be changed in apache configurations):
- monitor – access to different monitors;
- read – read access to the tickets and journal;
- write – write access to tickets and journal;
- logs – access to system logs (if implemented);
- config – access to configurations in CCR system;
- saveconfig – allows to configure CCR system;
- mrtg – access to mrtg (if implemented);
- audit – access to change control archive (if implemented).
(Groups assignment depends of the installation).
Standard assignments are:
- read, monitor, mrtg, logs, write – system operator;
- --//--, config, saveconfig, audit – network administrator;
- --//--, admin – user administrator.
Read ‘SNMPSTAT configuration’ for additional information about users management.
1. SNMPSTAT documentation index.
[1] All pictures are examples; real screen views can differ from these examples, because system can be configured by other ways.
[2] Screen views and menu items are documented as is, without correcting language errors.
[3] System uses a few synonyms for the channel – channel, link, and port – all means the same (MIB2 interface, which can be interface on router, port on switch, or network link on server or firewall).
[4] Change text size to the ‘small font’ in Internet explorer, if required to fit picture into your screen.
[5] It is recommended
to set up tickets and change object state for all failures. Status line shows
only 1 failure, so if you do not change state (masquerade known failure), you
will not see new failures on the state bar. If you use sound signals, ticket is
the only way to stop sound without turning it off globally. If you configured
alerts so that system sends alert on the first event only, system will not send new alert until you
masquerade first one. SNMPSTAT design proposes, that you masquerade (change state) all failures.
[6] Read ‘Introduction to SNMPSTAT’ for additional information about tickets.
[7] This screen
will be opened in separate browser window, which is reused. If you cannot see
new windows on the screen, check you minimized windows. Window title starts as Channel.
[8] Adjust text size, if necessary.
[9] To delete a ticket, click on Delete button for this ticket.