Print

Print


ifarm can access the conditions DB at hpsdb.jlab.org. We have not had 
problems running recon jobs inside the farm firewall.

Omar suggested rerunning again because we have sometimes seen that a job 
will randomly fail because the connection failed, and the error message we 
get from that is similar to the one you posted. This problem has nothing 
to do with the data being run. If you consistently see that your old files 
work and your new files fail, there is some other problem and we can keep 
looking for it.

On Mon, 15 Jun 2015, Omar Moreno wrote:

> Hi Sebouh,
>
> I believe Jeremy has instructions as to how to do this somewhere on
> confluence.
>
> Jeremy, can you point Sebouh to these instructions?  Thanks.
>
> --Omar Moreno
>
> On Mon, Jun 15, 2015 at 2:22 PM, Sebouh Paul <[log in to unmask]> wrote:
>
>> I am attempting to run this on ifarm1101 at JLab, which cannot access
>> things outside of JLab.  I would like to see if there is a purely offline
>> solution to this problem.
>>
>> On Mon, Jun 15, 2015 at 5:20 PM, Omar Moreno <[log in to unmask]> wrote:
>>
>>> ?Even though SimpleMCRecon may be outdated, the error has to do with
>>> conditions for the detector? HPS-Proposal2014-v8-4pt4  not being found.
>>>
>>> Looking at the error, it looks like a connection to the DB wasn't
>>> established.  Sebouh, if you try rerunning again, does the error persist?
>>>
>>> --Omar Moreno
>>>
>>> On Mon, Jun 15, 2015 at 2:11 PM, McCormick, Jeremy I. <
>>> [log in to unmask]> wrote:
>>>
>>>> It is possible that SimpleMCRecon.lcsim has become outdated and no
>>>> longer works (in which case we should update it).
>>>>
>>>> I have run the MC ECal recon lately in a test case, and I think it is
>>>> working fine.
>>>>
>>>> Not sure about MC tracking jobs but assume it is working okay...
>>>>
>>>> -----Original Message-----
>>>> From: [log in to unmask] [mailto:
>>>> [log in to unmask]] On Behalf Of Sebouh Paul
>>>> Sent: Monday, June 15, 2015 1:50 PM
>>>> To: hps-software
>>>> Subject: recon not working
>>>>
>>>> i am attempting to reconstruct some monte-carlo data.   If I reconstruct
>>>> some of my old data files, it works fine.  When I attempt to reconstruct
>>>> some newer files the exact same way, it doesn't work.
>>>>
>>>>
>>>> For this one it works:
>>>>
>>>>
>>>>
>>>> ifarm1101> java -Xmx1500m -jar
>>>> ~/hps-distribution-3.0.3-20141018.011336-132-bin.jar ~/SimpleMCRecon.lcsim
>>>> -i /work/hallb/hps/sebouh/trident_slic/radmuon_12.lcio.slcio
>>>> -DoutputFile=/work/hallb/hps/sebouh/test -DrunNumber=100
>>>>
>>>> --- Drivers ---
>>>>
>>>> org.lcsim.job.EventMarkerDriver
>>>>
>>>>     eventInterval = 10
>>>>
>>>> org.hps.conditions.deprecated.CalibrationDriver
>>>>
>>>> org.hps.recon.tracking.SimpleTrackerDigiDriver
>>>>
>>>>     debug = false
>>>>
>>>> org.hps.recon.tracking.HelicalTrackHitDriver
>>>>
>>>>     debug = false
>>>>
>>>>     maxSeperation = 20.0
>>>>
>>>>     tolerance = 1.0
>>>>
>>>> org.hps.recon.tracking.TrackerReconDriver
>>>>
>>>>     debug = false
>>>>
>>>>     strategyResource = /HPS-Full-3lay.xml
>>>>
>>>> org.hps.recon.ecal.EcalClusterICBasic
>>>>
>>>>     ecalName = Ecal
>>>>
>>>>     ecalCollectionName = EcalHits
>>>>
>>>>     timeCut = false
>>>>
>>>> org.hps.recon.particle.HpsReconParticleDriver
>>>>
>>>> org.lcsim.util.loop.LCIODriver
>>>>
>>>>     outputFilePath = /work/hallb/hps/sebouh/test
>>>>
>>>> org.lcsim.recon.tracking.digitization.sisim.config.ReadoutCleanupDriver
>>>>
>>>> --- End Drivers ---
>>>>
>>>> No input files in XML file.
>>>>
>>>> Adding SimTrackerHitIdentifierReadoutDriver with readouts: [TrackerHits]
>>>>
>>>> Got ConditionsEvent with run: 0
>>>>
>>>> Reading calibrations calibSVT/base for run: 0
>>>>
>>>> Use this calibration from run -1: calibSVT/default.base
>>>>
>>>> Reading calibrations calibSVT/tp for run: 0
>>>>
>>>> Use this calibration from run -1: calibSVT/default.tp
>>>>
>>>> Loading the SVT bad channels for run 0
>>>>
>>>> File daqmap/svt0.badchannels was not found! Continuing with only QA bad
>>>> channels
>>>>
>>>> Loading SVT gains ...
>>>>
>>>> Loading SVT t0 shifts ...
>>>>
>>>> Loading fieldmap for run 0
>>>>
>>>> reading ECal DAQ map
>>>>
>>>> reading ECal bad channels
>>>>
>>>> reading pedestals for ECal
>>>>
>>>> reading pedestals for ECal
>>>>
>>>>>> Event 0
>>>>
>>>>>> Event 10
>>>>
>>>>>> Event 20
>>>>
>>>>>> Event 30
>>>>
>>>>>> Event 40
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> And for this one it doesn't:
>>>>
>>>> ifarm1101> java -Xmx1500m -jar
>>>> ~/hps-distribution-3.0.3-20141018.011336-132-bin.jar ~/SimpleMCRecon.lcsim
>>>> -i /work/hallb/hps/sebouh/trident_slic6p6/radmuon_4.slcio
>>>> -DoutputFile=/work/hallb/hps/sebouh/test -DrunNumber=100
>>>>
>>>> --- Drivers ---
>>>>
>>>> org.lcsim.job.EventMarkerDriver
>>>>
>>>>     eventInterval = 10
>>>>
>>>> org.hps.conditions.deprecated.CalibrationDriver
>>>>
>>>> org.hps.recon.tracking.SimpleTrackerDigiDriver
>>>>
>>>>     debug = false
>>>>
>>>> org.hps.recon.tracking.HelicalTrackHitDriver
>>>>
>>>>     debug = false
>>>>
>>>>     maxSeperation = 20.0
>>>>
>>>>     tolerance = 1.0
>>>>
>>>> org.hps.recon.tracking.TrackerReconDriver
>>>>
>>>>     debug = false
>>>>
>>>>     strategyResource = /HPS-Full-3lay.xml
>>>>
>>>> org.hps.recon.ecal.EcalClusterICBasic
>>>>
>>>>     ecalName = Ecal
>>>>
>>>>     ecalCollectionName = EcalHits
>>>>
>>>>     timeCut = false
>>>>
>>>> org.hps.recon.particle.HpsReconParticleDriver
>>>>
>>>> org.lcsim.util.loop.LCIODriver
>>>>
>>>>     outputFilePath = /work/hallb/hps/sebouh/test
>>>>
>>>> org.lcsim.recon.tracking.digitization.sisim.config.ReadoutCleanupDriver
>>>>
>>>> --- End Drivers ---
>>>>
>>>> No input files in XML file.
>>>>
>>>> Adding SimTrackerHitIdentifierReadoutDriver with readouts: [TrackerHits]
>>>>
>>>> java.lang.RuntimeException:
>>>> org.lcsim.conditions.ConditionsManager$ConditionsNotFoundException:
>>>> Conditions not found for detector HPS-Proposal2014-v8-4pt4
>>>>
>>>> at org.lcsim.event.base.BaseLCSimEvent.<init>(BaseLCSimEvent.java:54)
>>>>
>>>> at org.lcsim.lcio.LCIOEvent.<init>(LCIOEvent.java:62)
>>>>
>>>> at org.lcsim.lcio.LCIOEvent.<init>(LCIOEvent.java:25)
>>>>
>>>> at org.lcsim.lcio.LCIOReader.read(LCIOReader.java:59)
>>>>
>>>> at org.lcsim.util.loop.LCIOEventSource.next(LCIOEventSource.java:129)
>>>>
>>>> at
>>>> org.freehep.record.loop.DefaultRecordLoop.fetchRecord(DefaultRecordLoop.java:809)
>>>>
>>>> at
>>>> org.freehep.record.loop.DefaultRecordLoop.loop(DefaultRecordLoop.java:648)
>>>>
>>>> at
>>>> org.freehep.record.loop.DefaultRecordLoop.execute(DefaultRecordLoop.java:566)
>>>>
>>>> at org.lcsim.util.loop.LCSimLoop.loop(LCSimLoop.java:151)
>>>>
>>>> at org.lcsim.job.JobControlManager.run(JobControlManager.java:418)
>>>>
>>>> at org.lcsim.job.JobControlManager.main(JobControlManager.java:180)
>>>>
>>>> Caused by:
>>>> org.lcsim.conditions.ConditionsManager$ConditionsNotFoundException:
>>>> Conditions not found for detector HPS-Proposal2014-v8-4pt4
>>>>
>>>> at
>>>> org.lcsim.conditions.ConditionsReader.create(ConditionsReader.java:203)
>>>>
>>>> at
>>>> org.lcsim.conditions.ConditionsReader.create(ConditionsReader.java:214)
>>>>
>>>> at
>>>> org.lcsim.conditions.ConditionsManagerImplementation.setDetector(ConditionsManagerImplementation.java:41)
>>>>
>>>> at org.lcsim.event.base.BaseLCSimEvent.<init>(BaseLCSimEvent.java:52)
>>>>
>>>> ... 10 more
>>>>
>>>> Caused by: java.net.SocketTimeoutException: connect timed out
>>>>
>>>> at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
>>>>
>>>> at
>>>> sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
>>>>
>>>> at
>>>> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
>>>>
>>>> at java.lang.reflect.Constructor.newInstance(Constructor.java:525)
>>>>
>>>> at
>>>> sun.net.www.protocol.http.HttpURLConnection$6.run(HttpURLConnection.java:1664)
>>>>
>>>> at
>>>> sun.net.www.protocol.http.HttpURLConnection$6.run(HttpURLConnection.java:1662)
>>>>
>>>> at java.security.AccessController.doPrivileged(Native Method)
>>>>
>>>> at
>>>> sun.net.www.protocol.http.HttpURLConnection.getChainedException(HttpURLConnection.java:1660)
>>>>
>>>> at
>>>> sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1243)
>>>>
>>>> at org.lcsim.util.cache.FileCache.getCachedFile(FileCache.java:95)
>>>>
>>>> at
>>>> org.lcsim.conditions.ConditionsReader.downloadDetectorDescription(ConditionsReader.java:268)
>>>>
>>>> at
>>>> org.lcsim.conditions.ConditionsReader.create(ConditionsReader.java:194)
>>>>
>>>> ... 13 more
>>>>
>>>> Caused by: java.net.SocketTimeoutException: connect timed out
>>>>
>>>> at java.net.PlainSocketImpl.socketConnect(Native Method)
>>>>
>>>> at
>>>> java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:339)
>>>>
>>>> at
>>>> java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:200)
>>>>
>>>> at
>>>> java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:182)
>>>>
>>>> at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:391)
>>>>
>>>> at java.net.Socket.connect(Socket.java:579)
>>>>
>>>> at sun.net.NetworkClient.doConnect(NetworkClient.java:175)
>>>>
>>>> at sun.net.www.http.HttpClient.openServer(HttpClient.java:378)
>>>>
>>>> at sun.net.www.http.HttpClient.openServer(HttpClient.java:473)
>>>>
>>>> at sun.net.www.http.HttpClient.<init>(HttpClient.java:203)
>>>>
>>>> at sun.net.www.http.HttpClient.New(HttpClient.java:290)
>>>>
>>>> at sun.net.www.http.HttpClient.New(HttpClient.java:306)
>>>>
>>>> at
>>>> sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:995)
>>>>
>>>> at
>>>> sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:931)
>>>>
>>>> at
>>>> sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:849)
>>>>
>>>> at
>>>> sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1299)
>>>>
>>>> at
>>>> sun.net.www.protocol.http.HttpURLConnection.getHeaderField(HttpURLConnection.java:2660)
>>>>
>>>> at java.net.URLConnection.getHeaderFieldLong(URLConnection.java:639)
>>>>
>>>> at java.net.URLConnection.getContentLengthLong(URLConnection.java:511)
>>>>
>>>> at java.net.URLConnection.getContentLength(URLConnection.java:495)
>>>>
>>>> at org.lcsim.util.cache.FileCache.getCachedFile(FileCache.java:94)
>>>>
>>>> ... 15 more
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> The only difference is which input files I am using.  What am I doing
>>>> wrong?
>>>>
>>>>
>>>> ________________________________
>>>>
>>>> Use REPLY-ALL to reply to list
>>>>
>>>> To unsubscribe from the HPS-SOFTWARE list, click the following link:
>>>> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=HPS-SOFTWARE&A=1
>>>>
>>>>
>>>> ########################################################################
>>>> Use REPLY-ALL to reply to list
>>>>
>>>> To unsubscribe from the HPS-SOFTWARE list, click the following link:
>>>> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=HPS-SOFTWARE&A=1
>>>>
>>>
>>>
>>
>
> ########################################################################
> Use REPLY-ALL to reply to list
>
> To unsubscribe from the HPS-SOFTWARE list, click the following link:
> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=HPS-SOFTWARE&A=1
>

########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the HPS-SOFTWARE list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=HPS-SOFTWARE&A=1