Print

Print


Thanks so much Yvan,

The bug could also be related to corrupted XFS mount: 
http://oss.sgi.com/archives/xfs/2015-01/msg00167.html

/qserv is under xfs, and docker store its images here, maybe moving 
toward a better docker storage backend could solve it?

Cheers,

Fabrice
<http://oss.sgi.com/archives/xfs/2015-01/msg00167.html>
On 11/23/2015 08:58 PM, Yvan Calas wrote:
>> On 23 Nov 2015, at 19:14, Fabrice Jammes <[log in to unmask]> wrote:
>>
>> Hi Yvan,
>>
>> ccqserv132 is stalled and doesn't answer to ping, would it be please possible to restart it?
>> Would it be possible to restart it please?
>> I was doing an rsync on it and running Qserv worker node, it's strange cluster nodes are so fragile?
> This node has been rebooted. According to one of our sysadmins, this problem may come from the following error message:
>
> kernel BUG at arch/x86/mm/pageattr.c:216!
>
> and could be related to qemu:
>
> https://lkml.org/lkml/2014/6/19/755
>
> Cheers,
>
> Yvan
>
> ---
> Yvan Calas
> CC-IN2P3 -- Storage Group
> 21 Avenue Pierre de Coubertin
> CS70202
> F-69627 Villeurbanne Cedex
> Tel: +33 4 72 69 41 73
>


########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the QSERV-L list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=QSERV-L&A=1