Print

Print


Good guess. I think found the problem:

"file:/mss"...




From: McCormick, Jeremy I. <[log in to unmask]>
Sent: Wednesday, June 28, 2017 9:42:33 PM
To: Nathan Baltzell
Cc: Bradley T Yale; hps-software
Subject: Re: Conditions system frozen on the farm
 
The ‘0 events’ issue looks like what can happen when you accidentally try to read a stub MSS file instead of the real file but that’s just a guess….

> On Jun 28, 2017, at 6:13 PM, Nathan Baltzell <[log in to unmask]> wrote:
>
> Yeah, conditions "freezing" is usually normal I thought.  (Although probably worthwhile to compare to your previous successfull logs).  Do you have a batch farm script replicating the problem that is ready to run so that others could test it?
>
>
>
> On Jun 28, 2017, at 8:50 PM, "McCormick, Jeremy I." <[log in to unmask]> wrote:
>
>> That just means the detector and run number are locked for the run so it isn’t a bug.
>>
>> But I don’t know why its only processing 0 events.  Some other issue maybe.
>>
>>> On Jun 28, 2017, at 5:02 PM, Bradley T Yale <[log in to unmask]> wrote:
>>>
>>> Performing readout with the latest jar, the conditions system appears to be frozen, but only when submitting jobs to the farm:
>>>
>>> Command:
>>> /usr/bin/time /apps/scicomp/java/jdk1.7/bin/java -DdisableSvtAlignmentConstants -XX:+UseSerialGC -Xmx500m -jar /u/group/hps/hps_soft/git/hps-java/distribution/target/hps-distribution-3.11-SNAPSHOT-bin.jar /u/group/hps/production/mc/EngRun2015Scripts/Prelim2018Singles0_4pt4.lcsim -i /cache/mss/hallb/hps/production/rotationFix/slic/wab-beam-tri_WAB50MeV/4pt4/wabv3SF_E50MeV-egsv5-triv2MG5_ESum2GeV-noHad_HPS-Proposal2017-Nominal-v2-4pt4-fieldmap_1628.slcio -DoutputFile=out -Ddetector=HPS-Proposal2017-Nominal-v2-4pt4-fieldmap -Drun=1000000
>>>
>>> 2017-06-28 17:43:53 [INFO] org.hps.detector.svt.SvtDetectorSetup loadDefault :: loading default SVT conditions onto subdetector Tracker
>>> 2017-06-28 17:43:53 [INFO] org.hps.detector.svt.SvtDetectorSetup loadDefault :: setting up 44 SVT sensors
>>> 2017-06-28 17:43:53 [INFO] org.hps.detector.svt.SvtDetectorSetup loadDefault :: channel map has 25088 entries
>>> 2017-06-28 17:43:54 [INFO] org.hps.conditions.database.DatabaseConditionsManager initialize :: conditions system initialized successfully
>>> 2017-06-28 17:43:54 [CONFIG] org.hps.conditions.database.DatabaseConditionsManager freeze :: conditions system is frozen
>>> 2017-06-28 17:43:54 [INFO] org.lcsim.job.JobControlManager run :: Job processed 0 events.
>>>
>>> I have tried resubmitting the jobs several times today with the same result, so probably not a hiccup. Also, it works fine interactively on a centos7 ifarm with the same command. What could be causing this?
>>>
>>>
>>> Use REPLY-ALL to reply to list
>>> To unsubscribe from the HPS-SOFTWARE list, click the following link:
>>> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=HPS-SOFTWARE&A=1
>>
>>
>> Use REPLY-ALL to reply to list
>>
>> To unsubscribe from the HPS-SOFTWARE list, click the following link:
>> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=HPS-SOFTWARE&A=1
>>
>



Use REPLY-ALL to reply to list

To unsubscribe from the HPS-SOFTWARE list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=HPS-SOFTWARE&A=1