On 03/23/2016 10:27 AM, Yvan Calas wrote: >> On 23 Mar 2016, at 10:25, Fabrice Jammes <[log in to unmask]> wrote: >> >> Would it be please possible to set up a Nagios instance in order to monitor IN2P3 cluster state? >> I would be available to write Nagios plugin in order to monitor if Qserv node are up/down. > Nagios is available (and used) at CC-IN2P3. What kind of probes do you need on the cluster? > > > > --- > Yvan Calas > CC-IN2P3 -- Storage Group > 21 Avenue Pierre de Coubertin > CS 70202 > F-69627 Villeurbanne Cedex > Tel: +33 4 72 69 41 73 > > > ######################################################################## > Use REPLY-ALL to reply to list > > To unsubscribe from the QSERV-L list, click the following link: > https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=QSERV-L&A=1 Hi Yvan, I'd like to add a Qserv related probe: 1. Check that correct ports are open on all nodes: on master (for now ccqserv100, ccqserv 125): 2131 4040 5012 1094 on all other ccqserv{nnn}: 5012 1094 2. Add a probe to check that Qserv container is running on all nodes might also be usefull: docker inspect --format='{{ .State.Running }}' qserv Could you please gave us the Nagios URL? Thanks Fabrice ######################################################################## Use REPLY-ALL to reply to list To unsubscribe from the QSERV-L list, click the following link: https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=QSERV-L&A=1