Print

Print


	Hi Pete,

Peter Elmer wrote:
>   Hi Andreas,
> 
> On Fri, Sep 09, 2005 at 11:33:37AM +0200, Andreas Petzold wrote:
> 
>>during BABAR skim production at GridKa I just saw a client crash during
>>a server restart.
>>
>>client:
>>
>>2005-09-09 10:39:55 7107 Err : TXMessage::ReadRaw             - Error
>>reading 8 bytes
>>2005-09-09 10:39:55 7107 Err : ReadPartialAnswer              - Error
>>reading msg from connmgr (server [10.65.5.115:1094]).
>>
>>server:
>>
>>050909 09:17:02 4943 XrootdXeq: skimprod.7107:19@c01-013-120 login
>>
>>then no more messages from this client even after the server restart. I
>>also didn't find any error messages during the restart.
>>
>>We are running wersion  20050623-0016 on this dataserver.
>>
>>Unfortunately, I don'tr have a core from the client. If you need more
>>info like the full log, please let me know.
> 
> 
>   Which version of the client software is being run? (i.e. I assume it
> is a recent release where we are using TXNetFile taken from ROOT itself,
> so what is the ROOT release?)
> 
>   I guess it was just a single client you've seen do this so it is 
> something rare? (Not that it matters, the client shouldn't crash, but
> just to understand the scale of the problem.)

as Stephen has mentioned we are using 18.2.1b. I've seen only one client 
crash, but I'll have to dig through all the logs to be sure. Random 
samples didn't show the problem for any other job.

	Cheers,

		Andreas

> 
>                                  thanks,
>                                    Pete
> 
> -------------------------------------------------------------------------
> Peter Elmer     E-mail: [log in to unmask]      Phone: +41 (22) 767-4644
> Address: CERN Division PPE, Bat. 32 2C-14, CH-1211 Geneva 23, Switzerland
> -------------------------------------------------------------------------