John
I ran another 24h test, after 23h one xrootd failed. This is
probably related to the cancellation code you are working on:
(gdb) where
#0 0x00007f41023ad5e9 in raise () from /lib64/libc.so.6
#1 0x00007f41023aecf8 in abort () from /lib64/libc.so.6
#2 0x00007f4102cb19b5 in __gnu_cxx::__verbose_terminate_handler() ()
from /lib64/libstdc++.so.6
#3 0x00007f4102caf926 in ?? () from /lib64/libstdc++.so.6
#4 0x00007f4102cae8e9 in ?? () from /lib64/libstdc++.so.6
#5 0x00007f4102caf554 in __gxx_personality_v0 () from /lib64/libstdc++.so.6
#6 0x00007f4102748913 in ?? () from /lib64/libgcc_s.so.1
#7 0x00007f4102748e47 in _Unwind_Resume () from /lib64/libgcc_s.so.1
#8 0x00007f4100c85245 in
lsst::qserv::wdb::ChunkResourceMgr::Impl::release (this=0x1ccfc80,
i=...) at build/wdb/ChunkResource.cc:398
#9 0x00007f4100c83696 in
lsst::qserv::wdb::ChunkResource::~ChunkResource (this=0x7f40f1986b20,
__in_chrg=<optimized out>) at build/wdb/ChunkResource.cc:131
#10 0x00007f4100c91f0b in
lsst::qserv::wdb::QueryAction::Impl::_dispatchChannel
(this=0x7f40c4a8dd80) at build/wdb/QueryAction.cc:392
#11 0x00007f4100c905a7 in lsst::qserv::wdb::QueryAction::Impl::act
(this=0x7f40c4a8dd80) at build/wdb/QueryAction.cc:187
#12 0x00007f4100c93212 in lsst::qserv::wdb::QueryAction::operator()
(this=0x7f40c487db78) at build/wdb/QueryAction.cc:451
#13 0x00007f4100c75f46 in
lsst::qserv::wcontrol::ForemanImpl::Runner::operator()
(this=0x7f40e80d98c0) at build/wcontrol/Foreman.cc:302
#14 0x00007f4100c82cf0 in
std::_Bind_simple<lsst::qserv::wcontrol::ForemanImpl::Runner
()>::_M_invoke<>(std::_Index_tuple<>) (this=0x7f40e80d98c0) at
/usr/include/c++/4.8.2/functional:1732
#15 0x00007f4100c82a8b in
std::_Bind_simple<lsst::qserv::wcontrol::ForemanImpl::Runner
()>::operator()() (this=0x7f40e80d98c0) at
/usr/include/c++/4.8.2/functional:1720
#16 0x00007f4100c8280c in
std::thread::_Impl<std::_Bind_simple<lsst::qserv::wcontrol::ForemanImpl::Runner
()> >::_M_run() (this=0x7f40e80d98a8) at /usr/include/c++/4.8.2/thread:115
tail of the xrootd log file:
0901 20:38:42.045 [0x7f40f1b89700] DEBUG ScanSched
(build/wsched/ScanScheduler.cc:210) - Adding new task: 9358 : SELECT
COUNT(*) AS QS1_COUNT FROM LSST.Object_9358 AS QST_1_ WHERE
y_instFlux>u_instFlux
0901 20:38:42.046 [0x7f40f1b89700] DEBUG GroupSched
(build/wsched/GroupScheduler.cc:139) - _getNextTasks(3)>->->
0901 20:38:42.046 [0x7f40f1b89700] DEBUG GroupSched
(build/wsched/GroupScheduler.cc:154) - _getNextTasks <<<<<
0901 20:38:42.046 [0x7f40f1b89700] DEBUG ScanSched
(build/wsched/ScanScheduler.cc:172) - _getNextTasks(31)>->->
0901 20:38:42.046 [0x7f40f1b89700] DEBUG ScanSched
(build/wsched/ChunkDisk.cc:199) - ChunkDisk busyness: yes
0901 20:38:42.046 [0x7f40f1b89700] DEBUG ScanSched
(build/wsched/ChunkDisk.cc:171) - ChunkDisk getNext: current=
(scan=8879, cached=8571,8772,) candidate=9220
0901 20:38:42.046 [0x7f40f1b89700] DEBUG ScanSched
(build/wsched/ChunkDisk.cc:184) - ChunkDisk denying task
0901 20:38:42.046 [0x7f40f1b89700] DEBUG ScanSched
(build/wsched/ScanScheduler.cc:196) - _getNextTasks <<<<<
0901 20:38:42.046 [0x7f40f1b89700] DEBUG BlendSched
(build/wsched/BlendScheduler.cc:211) - BlendScheduler: no tasks available
0901 20:38:42.046 [0x7f40f1b89700] INFO root
(build/xrdsvc/SsiSession.cc:120) - Enqueued TaskMsg for
Resource(/chk/LSST/9358) in 0.000835 seconds
########################################################################
Use REPLY-ALL to reply to list
To unsubscribe from the QSERV-L list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=QSERV-L&A=1
|