% Network Management & Monitoring % Smokeping - Part II 5. Add new probes to Smokeping ------------------------------ The current entry in the Probes file is fine, but if you wish to use additional Smokeping checks you can add them in here and you can specify their default behavior. You can do this, as well, in the Targets file if you wish. To add a probe to check for HTTP latency as well as DNS lookup latency, edit the Probes file and add the following text TO THE END of that file: # vi Probes ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ + EchoPingHttp + DNS binary = /usr/bin/dig pings = 5 step = 180 lookup = www.kenet.or.ke ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ The DNS probe will look up the IP address of www.kenet.or.ke using any other open DNS server (resolver) you specify in the Targets file. You will see this a bit futher on in the exercises. Now Save and exit from the file and verify that your changes are working: # service smokeping reload 6. Add HTTP latency checks for the classroom PCs ------------------------------------------------ Edit the Targets file again and go to the end of the file: # vi Targets At the end of the file add: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ # # Local Web server response # +HTTP menu = Local HTTP Response title = HTTP Response Student PCs ++host1 menu = host1 title = host1 HTTP response time probe = EchoPingHttp host = host1.ws.nsrc.org ++host2 menu = host2 title = host2 HTTP response time probe = EchoPingHttp host = host2.ws.nsrc.org ++host3 menu = host3 title = host3 HTTP response time probe = EchoPingHttp host = host3.ws.nsrc.org ++host4 menu = host4 title = host4 HTTP response time probe = EchoPingHttp host = host4.ws.nsrc.org ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ You could actually just use the "probe = EchoPingHttp" statement once for host1, and then this would be the default probe until another "probe = " statement is seen in the Targets file. You can add more PC entries if you wish, or you could consider checking the latency on remote machines - these are likely to be more interesting. Machines such as your own publicly accessible servers are a good choice, or, perhaps other web servers you use often (Google, Yahoo, Government pages, stores, etc.?). For example, consider adding something like this at the bottom of the Targets file: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ # # Remote Web server response # +HTTPRemote menu = Remote HTTP Response title = HTTP Response Remote Machines ++usiu menu = United States International University title = usiu.ac.ke HTTP response time probe = EchoPingHttp host = usiu.ac.ke ++uonbi menu = University of Nairobi title = uonbi.ac.ke HTTP response time probe = EchoPingHttp host = uonbi.ac.ke ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Add your own hosts that you use at your organization to the list of Remote Web Servers. Once you are done, save and exit from the Targets file and verify your work: # service smokeping reload 7. Add DNS latency checks ------------------------- At the end of the Targets file we are going to add some entries to verify the latency from our location to remote recursive DNS servers to look up an entry for kenet.or.ke. You would likely substitue an important address for your institution in the Probes file instead. In addition, you can change the address you are looking up inside the Targets file as well. For more information see: and Now edit the Targets file again. Be sure to go to the end of the file: # cd /etc/smokeping/config.d (just to be sure...) # vi Targets At the end of the file add: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ # # Sample DNS probe # +DNS probe = DNS menu = DNS Latency title = DNS Latency Probes ++LocalDNS1 menu = 10.10.0.241 title = DNS Delay for local DNS Server on s1.ws.nsrc.org host = s1.ws.nsrc.org ++KenetSecondCache menu = 41.89.1.4 title = KENET Second caching DNS Server host = 41.89.1.4 ++GoogleA menu = 8.8.8.8 title = DNS Latency for google-public-dns-a.google.com host = google-public-dns-a.google.com ++GoogleB menu = 8.8.4.4 title = DNS Latency for google-public-dns-b.google.com host = google-public-dns-b.google.com ++OpenDNSA menu = 208.67.222.222 title = DNS Latency for resolver1.opendns.com host = resolver1.opendns.com ++OpenDNSB menu = 208.67.220.220 title = DNS Latency for resolver2.opendns.com host = resolver2.opendns.com ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Now save the Targets file and exit and verify your work: # service smokeping reload Look at additional Smokeping probes and consider implementing some of them if they are useful to your organization: 8. MultiHost graphing --------------------- Once you have defined a group of hosts under a single probe type in your /etc/smokeping/config.d/Targets file, then you can create a single graph that will show you the results of all smokeping tests for all hosts that you define. This has the advantage of letting you quickly compare, for example, a group of hosts that you are monitoring with the FPing probe. The MultiHost graph function in Smokeping is extremely picky - pay close attention! To create a MultiHost graph first edit the file Targets: # vi Targets We will create a MultiHost graph for the DNS Latency probes we just added. To do this go to the end of the Targets file and add: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ # # Multihost Graph of all DNS latency checks # ++MultiHostDNS menu = MultiHost DNS title = Consolidated DNS Responses host = /DNS/LocalDNS1 /DNS/KenetSecondCache /DNS/GoogleA /DNS/GoogleB /DNS/OpenDNSA /DNS/OpenDNSB ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ And, as always, save and exit from the file Targets and test your new configuration. # service smokeping reload If this fails you almost certainly have an error in the entries. If you cannot figure out what the error is (remember to try "tail /var/log/syslog" first!) ask your instructor for some help. You can add MultiHost graphs for any other set of probe tests (FPing, EchoPingHttp) that you have configured. You must add the MultiHost entry at the end of a probe section. If you do not understand how this works you can ask your instructors for help. In addition, on the workshop NOC there are sample configuration files available, including one for SmokePing that includes multiple MultiHost graph examples. 9. Send Smokeping alerts ------------------------ If you wish to receive an email when an alert condition is met on one of the Smokeping checks first do this: # cd /etc/smokeping/config.d # vi Alerts Update the top of the file where it says: *** Alerts *** to = alertee@address.somewhere from = smokealert@company.xy to include a proper "to" and "from" field for your server. Something like: *** Alerts *** to = sysadm@localhost from = smokeping-alert@localhost Now you must update your device entries to include a line that reads: alerts = alertName1, alertName2, etc, etc... For instance, the alert named, "someloss" has already been defined in the file Alerts: To read about Smokeping alerts and what they are detecting, how to create your own, etc. see: and at the bottom of the page is a section titled `*** Alerts ***` To place some alert detection on some of your hosts open the file Targets: # vi Targets and go near the start of the file where we defined our PCs. Just under the "host =" line add another line that looks like this: alerts = someloss So, for example, the host1 entry would look like this: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ++host1 menu = host1 title = Virtual Machine 1 host = host1.ws.nsrc.org alerts = someloss ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ If you want to add an alerts option to other hosts go ahead. Once you are done save and exit from the Targets file and then verify that your configuration works: # service smokeping reload If any of the hosts that have the "alerts = " option set meet the conditions to set off the alert, then an email will arrive to the sysadm user's mailbox on the Smokeping server machine (localhost). It's not likely that an alert will be set off for most machines. To check you can read the email for the root user by using an email client like mail which we installed in part 1 - # mail To exit type # exit 10. Slave instances - Informational Only ---------------------------------------- This is a description only for informational purposes in case you wish to attempt this type of configuration once the workshop is over. The idea behind this is that you can run multiple smokeping instances at multiple locations that are monitoring the same hosts and/or services as your master instance. The slaves will send their results to the master server and you will see these results side-by-side with your local results. This allows you to view how users outside your network see your services and hosts. This can be a powerful tool for resolving service and host issues that may be difficult to troubleshoot if you only have local data. Graphically this looks this: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ [slave 1] [slave 2] [slave 3] | | | +-------+ | +--------+ | | | v v v +---------------+ | master | +---------------+ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ You can see example of this data here: Look at the various graph groups and notice that many of the graphs have multiple lines with the color code chart listing items such as "median RTT from mipsrv01" - These are not MultiHost graphs, but rather graphs with data from external smokeping servers. To configure a smokeping master/slave server you can see the documentation here: In addition, a sample set of steps for configuring this is available in the file sample-smokeping-master-slave.txt which should be listed as an additional reference at the bottom of the Agenda page on your classroom wiki.