LISTSERV mailing list manager LISTSERV 16.5

Help for ATLAS-SCCS-PLANNING-L Archives


ATLAS-SCCS-PLANNING-L Archives

ATLAS-SCCS-PLANNING-L Archives


ATLAS-SCCS-PLANNING-L@LISTSERV.SLAC.STANFORD.EDU


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

ATLAS-SCCS-PLANNING-L Home

ATLAS-SCCS-PLANNING-L Home

ATLAS-SCCS-PLANNING-L  July 2014

ATLAS-SCCS-PLANNING-L July 2014

Subject:

Re: T2 Reliability & Availability - June 2014

From:

"Yang, Wei" <[log in to unmask]>

Reply-To:

Atlas SCCS Planning Mailing List <[log in to unmask]>

Date:

Sat, 19 Jul 2014 01:06:06 -0700

Content-Type:

text/plain

Parts/Attachments:

Parts/Attachments

text/plain (102 lines)

Hi Charlie,

The LSF event log file is hosted by the LSF master. We switched the LSF master host during that time and I did’t notice that. As a result, Grid jobs can continue to come to SLAC but they won’t be able to get their status update (because the Grid can’t access LSF event log). ATLAS jobs are generic pilots and don’t care about this status (pilot update its status with Panda). WLCG submits a probe job to SLAC once a hour, and will check the status. So that is why saw no problem from ATLAS side, but the WLCG reliability and availability test failed for those 6 days. 

regards,
Wei Yang  |  [log in to unmask]  |  650-926-3338(O)



On Jul 19, 2014, at 12:51 AM, Young, Charles C. <[log in to unmask]> wrote:

> Hi Wei, 
> 
> Just to understand the point about LSF scheduler. Are you saying that issues with accessing LSF log files biased the numbers in the report and they are actually better? Or are you saying that LSF scheduler problem led to lower availability? 
> 
> The numbers for June are about 20% down. Translates to 6 days out of the month. I wasn't paying close attention but did not notice jobs not running for a week. Nor did I get complaints from others — someone must have been running batch jobs separate from Tier-2 production. Was the drop-off not a global one but somehow reduced the number of machines available? Cheers.
> 
> Charlie
> --
> Charles C. Young
> M.S. 43, Stanford Linear Accelerator Center      
> P.O. Box 20450                                        
> Stanford, CA 94309                                      
> [log in to unmask]                                
> voice  (650) 926 2669                        
> fax    (650) 926 2923                      
> CERN GSM +41 76 487 2069
> 
> From: <Yang>, Wei Yang <[log in to unmask]>
> Date: Thursday, July 17, 2014 7:33 PM
> To: atlas-sccs-planning-l <[log in to unmask]>
> Subject: Fwd: T2 Reliability & Availability - June 2014
> 
> fyi, We were pretty low in june. We changed the LSF scheduler master host from Solaris to Linux, and ran into subtle issues in accessing LSF log files via NFS. and it all happened when I took a few days of sick leave … there was also a scheduled outage in June.
> 
> Wei Yang  |  [log in to unmask]  |  1-650-926-3338
> 
> 
> 
> 
> Begin forwarded message:
> 
>> From: WLCG Office <[log in to unmask]>
>> Subject: RE: T2 Reliability & Availability - June 2014
>> Date: July 17, 2014 at 8:27:43 AM PDT
>> To: "project-wlcg-cb (Members of the WLCG CB)" <[log in to unmask]>
>> Cc: "project-lcg-gdb (LCG - Grid Deployment Board)" <[log in to unmask]>, "sam-support (SAM support)" <[log in to unmask]>, "[log in to unmask]" <[log in to unmask]>, "[log in to unmask]" <[log in to unmask]>
>> 
>> Dear all,
>> 
>> The final T2 reliability & availability for June 2014 is now available at:
>> 
>> https://espace2013.cern.ch/WLCG-document-repository/ReliabilityAvailability/2014/june-14/  under titles starting with "WLCG_All_Sites..."
>> 
>> The reports take into consideration all re-computation requests received in the last 10 calendar days as described in the re-computation policy. 
>> 
>> Kind regards,
>> Cath
>> 
>> 
>> -----------------------------------------------
>> WLCG Office
>> IT Dept - CERN
>> CH-1211 Genève, Switzerland
>> www.cern.ch/wlcg
>> From: WLCG Office
>> Sent: 02 July 2014 11:02
>> To: project-wlcg-cb (Members of the WLCG CB)
>> Cc: project-lcg-gdb (LCG - Grid Deployment Board); sam-support (SAM support); [log in to unmask]; [log in to unmask]
>> Subject: T2 Reliability & Availability - June 2014
>> 
>> Dear all,
>> 
>> The draft T2 reliability & availability reports for June 2014 are now available at:
>> 
>> http://sam-reports.web.cern.ch/sam-reports/2014/201406/wlcg/ under titles starting with "WLCG_All_Sites..."
>> 
>> Please verify your data and send any comments to WLCG Office by 12 July.
>> 
>> Any requests for recomputation must be submitted via GGUS within the next 10 calendar days; full details here.
>> 
>> Kind regards,
>> Cath
>> 
>> 
>> -----------------------------------------------
>> WLCG Office
>> IT Dept - CERN
>> CH-1211 Genève, Switzerland
>> www.cern.ch/wlcg
> 
> 
> Use REPLY-ALL to reply to list
> To unsubscribe from the ATLAS-SCCS-PLANNING-L list, click the following link:
> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=ATLAS-SCCS-PLANNING-L&A=1

########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the ATLAS-SCCS-PLANNING-L list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=ATLAS-SCCS-PLANNING-L&A=1

Top of Message | Previous Page | Permalink

Advanced Options


Options

Log In

Log In

Get Password

Get Password


Search Archives

Search Archives


Subscribe or Unsubscribe

Subscribe or Unsubscribe


Archives

September 2016
July 2016
June 2016
May 2016
April 2016
March 2016
November 2015
September 2015
July 2015
June 2015
May 2015
April 2015
February 2015
November 2014
October 2014
September 2014
August 2014
July 2014
June 2014
April 2014
March 2014
February 2014
January 2014
December 2013
November 2013
September 2013
August 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
December 2012
November 2012
October 2012
September 2012
August 2012
July 2012
June 2012
May 2012
April 2012
March 2012
February 2012
January 2012
November 2011
October 2011
September 2011
August 2011
July 2011
June 2011
May 2011
April 2011
March 2011
February 2011
January 2011
December 2010
November 2010
October 2010
September 2010
August 2010
July 2010
June 2010
May 2010
April 2010
February 2010
January 2010
December 2009
November 2009
October 2009
September 2009
August 2009
July 2009
June 2009
May 2009
April 2009
March 2009
February 2009
January 2009
December 2008
November 2008
October 2008
September 2008
August 2008
July 2008
June 2008
May 2008
April 2008
March 2008
February 2008
January 2008
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
June 2007
May 2007
April 2007
March 2007
February 2007
January 2007
December 2006
November 2006
October 2006
September 2006
August 2006
July 2006
June 2006
May 2006
April 2006
March 2006
February 2006

ATOM RSS1 RSS2



LISTSERV.SLAC.STANFORD.EDU

Secured by F-Secure Anti-Virus CataList Email List Search Powered by the LISTSERV Email List Manager

Privacy Notice, Security Notice and Terms of Use