Print

Print


FYI

Rafo


-------- Forwarded Message --------
Subject: 	CCPR 155445 UPDATE (files are failed to be staged out)
Date: 	Wed, 21 Feb 2018 14:57:41 -0500 (EST)
From: 	[log in to unmask]
To: 	[log in to unmask], [log in to unmask]
CC: 	[log in to unmask], [log in to unmask], [log in to unmask], 
[log in to unmask], [log in to unmask], [log in to unmask], [log in to unmask], 
[log in to unmask]



Here is an update to the help request you submitted.
When you reply to THIS message, Please DO NOT include the original text below.

Mod Date:   2018/02/21
Mod Time:   14:57:40
Mod User:   philpott
Current State:   COMPLETE
STATE changed from (RECEIVED) to COMPLETE.


--------------------------------------------------------------

Overnight a disk server hung; the issue was resolved this morning.  If
you are using SWIF, those jobs can be rerun automatically, or you
may need to resubmit them manually.

Regards,
Sandy

									
--------------------------------------------------------------		
Here is a copy of your Original Request:
	
Email:     [log in to unmask]
Name:      Rafayel Paremuzyan
Username:  rafopar
Staff:	   philpott
Platform:  other,Netscape,537.36 (KHTML, like Gecko) Chrome
Building:  12_2
Room:      F292-9
Hostname:  129.57.113.47
Category:  SCIENTIFIC COMPUTING
Subject:   files are failed to be staged out
Submitted: 2/21/2018 11:05 AM
			
Request:
Hi,

We (hps) started cooking of data yesterday,
and we noticed significant amount of jobs that failed because files were not properly staged out to disk or to tape.
Below are single example job IDs from different kind of failures.

JOBID: 49355427  Failed to transfer file to disk: hps_005563.13_dst_v4.0.2-pre.root, looking into log file, shows that file was produced properly and has non-zero size.
JOBID: 49355585  Failed to transfer file to tape, Status is ' (COPYING)', file has non-zero size
JOBID: 49355583  Failed to transfer file to tape, Status is ' (FAILED)', file has non-zero size
JOBID  49355581  there is no even log file neither in ~/.farmout nor in our logfile directory, the status shows as 'FAILED (No job status in batch system and we never recorded a finish.)'

The following link will bring you to a list of open CCPR's

http://mis.jlab.org/mis/ccpr/ccpr_user/ccprframe_user.html
	


########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the HPS-SOFTWARE list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=HPS-SOFTWARE&A=1