Changes for page Monitoring your deployed Applications
Last modified by Pascal Robert on 2013/11/24 17:08
From version 18.1
edited by David Avendasora
on 2010/11/30 05:43
on 2010/11/30 05:43
Change comment:
There is no comment for this version
To version 21.1
edited by Pascal Robert
on 2013/11/24 17:08
on 2013/11/24 17:08
Change comment:
There is no comment for this version
Summary
-
Page properties (4 modified, 0 added, 0 removed)
Details
- Page properties
-
- Parent
-
... ... @@ -1,0 +1,1 @@ 1 +Deployment - Author
-
... ... @@ -1,1 +1,1 @@ 1 -XWiki. avendasora1 +XWiki.probert - Tags
-
... ... @@ -1,0 +1,1 @@ 1 +deployment - Content
-
... ... @@ -2,16 +2,24 @@ 2 2 3 3 === Chuck Hill === 4 4 5 -We usually just use a script that runs as a cron job. 5 +We usually just use a script that runs as a cron job. It hits a direct action on the app that returns a known "all is OK" response. If that does not get returned, alarms ring. 6 6 7 7 === Pascal Robert === 8 8 9 -We use [[Nagios>>http://nagios.org]] with the check //http plugin. This plugin actually make a HTTP (or HTTPS, if the plugin was compiled with openssl support) to our apps and check a DirectAction to see if it's answer "1". If not, Nagios send a trigger.//9 +We use [[Nagios>>url:http://nagios.org||shape="rect"]] with the check_http plugin. This plugin actually make a HTTP (or HTTPS, if the plugin was compiled with openssl support) to our apps and check a DirectAction to see if it's answer "1". If not, Nagios send a trigger. 10 10 11 -= ===Ray Kiddy====11 +(% style="color: rgb(0,0,0);" %)**Ray Kiddy** 12 12 13 -We use nagios in several ways. We use check //ping to make sure the machines are alive. We use check//http to check that the front-end of our cluster is responding on port 80 and we have another service that checks WebDAV directories that we use. These last need to check that authentication is working, as well as access. We also have a DirectAction that checks the whole chain, from the web server to the apps and then to the databases. We send pages only from this last service.13 +We use nagios in several ways. We use check_ping to make sure the machines are alive. We use check_http to check that the front-end of our cluster is responding on port 80 and we have another service that checks WebDAV directories that we use. These last need to check that authentication is working, as well as access. We also have a DirectAction that checks the whole chain, from the web server to the apps and then to the databases. We send pages only from this last service. 14 14 15 -We check that wotaskd is running with a script that acts as a plugin. It would be nice if we could use check //http but there is no single string which tells us what the wotaskd is doing. For example, it would be helpful if there was, somewhere, a single line in the wotaskd port 1085 output which said something like "host: am1.bigserver.com, applications expected 200, applications running 200". Then we could just use the s parameter on check//http instead of doing a bunch of extra stuff ourselves.15 +We check that wotaskd is running with a script that acts as a plugin. It would be nice if we could use check_http but there is no single string which tells us what the wotaskd is doing. For example, it would be helpful if there was, somewhere, a single line in the wotaskd port 1085 output which said something like "host: am1.bigserver.com, applications expected 200, applications running 200". Then we could just use the -s parameter on check_http instead of doing a bunch of extra stuff ourselves. 16 16 17 17 I prefer many small checks to help diagnose problems, but sending a page in the middle of the night is only from the one service that checks the whole chain of dependencies. We now have 176 services, but I add at least a few a week. There are always more things trying to break our deployments.... 18 + 19 +== Monitoring Presentation == 20 + 21 +A presentation about monitoring your WebObjects applications was made in 2010, you can [[watch it here>>url:http://wocommunity.org/podcasts/MonitoringYourWOApps.mov||shape="rect"]]. 22 + 23 +== Plugin for Nagios == 24 + 25 +A script that acts as a Nagios plugin (without performance data) [[is available on GitHub>>url:https://github.com/wocommunity/Deployment-Tools/tree/master/Nagios||shape="rect"]].