Print

Print


Hi Charlie,

Most of them don't fit well to the RT system. Some of them are not trivial.

The main thing we need to do is OS/ZFS upgrade to handle failed SATA drives
since the failure during STEP09 was the second in a month on the same
(replaced) disk. We had planned last week but had to cancel because of the
current cosmic data run.

We have put in a method to control batch jobs, and are looking at other
ideas. This will be long term planning/development for xrootd (which Andy
will outline on Wed's OSG storage meeting at Fermi lab). We also did some
ZFS turning. We did that on half of our storage in order to see the
effectiveness of those changes.

Regards,
Wei Yang  |  [log in to unmask]  |  650-926-3338(O)





> From: "Young, Charles C." <[log in to unmask]>
> Date: Mon, 29 Jun 2009 06:30:37 -0700
> To: Wei Yang <[log in to unmask]>, atlas-sccs-planning-l
> <[log in to unmask]>
> Subject: RE: Step09 stress test postmortem for US sites
> 
> Hi Wei,
> 
> Thanks for the link. Some of these look like local problems. Are there trouble
> tickets for tracking or are they so trivial they are already fixed? Cheers.
> 
> Charlie
> --
> Charles C. Young
> M.S. 43, Stanford Linear Accelerator Center
> P.O. Box 20450   
> Stanford, CA 94309
> [log in to unmask]
> voice  (650) 926 2669
> fax    (650) 926 2923
> CERN GSM +41 76 487 2069
> 
>> -----Original Message-----
>> From: [log in to unmask]
>> [mailto:[log in to unmask]] On
>> Behalf Of Wei Yang
>> Sent: Monday, June 29, 2009 1:10 AM
>> To: atlas-sccs-planning-l
>> Subject: Step09 stress test postmortem for US sites
>> 
>> http://www.usatlas.bnl.gov/twiki/bin/view/Admins/AnalysisStep0
>> 9PostMortem
>> 
>> Wei Yang  |  [log in to unmask]  |  650-926-3338(O)
>> 
>> 
>> 
>> 
>>