Print

Print


On 10/21/19 4:51 AM, Andrew Hanushevsky wrote:
> Oh yes, the reason you get a core file every 30 seconds is because that 
> is the restart time when the program fails. You should see the message
> "Restarting monitor:" in the log as well.
well, being the big logfile i just grepped for Meter ..
but i found the problem and, as you said, the "metering" script of 
cms.perf had bad ownership and no execution rights for others.

i was wondering, is it possible to have an "internal" version of 
cms.perf with the reporting code being included in the actual binary?
(to be used like cms.perf pgm internal) or as fallback if pgm is invalid?

and a better error handling for the server? (it should not break just 
because an improper cms.perf declaration)

Thank you!
Adrian


> 
> On Sun, 20 Oct 2019, Adrian Sevcenco wrote:
> 
>> Hi! I have a strange situation on a server (from an ALICE colleague) 
>> that runs xrootd-server-4.9.1 from epel and have a cored dump each 30s 
>> with the below content
>> any idea what is going on?
>>
>> Thank you!
>> Adrian
>>
>> Reading symbols from /usr/bin/cmsd...Reading symbols from 
>> /usr/lib/debug/usr/bin/cmsd.debug...done.
>> done.
>> [New LWP 1557]
>> [Thread debugging using libthread_db enabled]
>> Using host libthread_db library "/lib64/libthread_db.so.1".
>> Core was generated by `/usr/bin/cmsd -b -n server -l 
>> /home/xrootd/alicexrdrun/logs/cmslog -s /home/xro'.
>> Program terminated with signal 6, Aborted.
>> #0  0x00007ff528cfa2c7 in __GI_raise (sig=sig@entry=6) at 
>> ../nptl/sysdeps/unix/sysv/linux/raise.c:55
>> 55        return INLINE_SYSCALL (tgkill, 3, pid, selftid, sig);
>> Missing separate debuginfos, use: debuginfo-install 
>> bzip2-libs-1.0.6-13.el7.x86_64 elfutils-libelf-0.172-2.el7.x86_64 
>> elfutils-libs-0.172-2.el7.x86_64 libattr-2.4.46-13.el7.x86_64 
>> libcap-2.22-9.el7.x86_64 libgcrypt-1.5.3-14.el7.x86_64 
>> libgpg-error-1.12-3.el7.x86_64 libselinux-2.5-14.1.el7.x86_64 
>> lz4-1.7.5-2.el7.x86_64 pcre-8.32-17.el7.x86_64 
>> systemd-libs-219-62.el7_6.9.x86_64 
>> xrootd-aggregatingname2name-1.0.1-1.el7.centos.x86_64 
>> xz-libs-5.2.2-1.el7.x86_64 zlib-1.2.7-18.el7.x86_64
>> (gdb) bt
>> #0  0x00007ff528cfa2c7 in __GI_raise (sig=sig@entry=6) at 
>> ../nptl/sysdeps/unix/sysv/linux/raise.c:55
>> #1  0x00007ff528cfb9b8 in __GI_abort () at abort.c:90
>> #2  0x00007ff5296097d5 in __gnu_cxx::__verbose_terminate_handler () at 
>> ../../../../libstdc++-v3/libsupc++/vterminate.cc:95
>> #3  0x00007ff529607746 in __cxxabiv1::__terminate (handler=<optimized 
>> out>) at ../../../../libstdc++-v3/libsupc++/eh_terminate.cc:38
>> #4  0x00007ff529607773 in std::terminate () at 
>> ../../../../libstdc++-v3/libsupc++/eh_terminate.cc:48
>> #5  0x00007ff529607371 in __cxxabiv1::__gxx_personality_v0 
>> (version=<optimized out>, actions=<optimized out>, 
>> exception_class=<optimized out>, ue_header=<optimized out>, 
>> context=<optimized out>)
>>    at ../../../../libstdc++-v3/libsupc++/eh_personality.cc:670
>> #6  0x00007ff5290a09b4 in _Unwind_ForcedUnwind_Phase2 
>> (exc=exc@entry=0x7ff5253a1d70, context=context@entry=0x7ff5253a0870) 
>> at ../../../libgcc/unwind.inc:175
>> #7  0x00007ff5290a0d04 in _Unwind_ForcedUnwind (exc=0x7ff5253a1d70, 
>> stop=stop@entry=0x7ff529ac6180 <unwind_stop>, stop_argument=<optimized 
>> out>) at ../../../libgcc/unwind.inc:207
>> #8  0x00007ff529ac6302 in __GI___pthread_unwind (buf=<optimized out>) 
>> at unwind.c:129
>> #9  0x00007ff529ac6435 in __pthread_enable_asynccancel () at 
>> ../nptl/sysdeps/unix/sysv/linux/x86_64/cancellation.S:80
>> #10 0x00007ff529ac6752 in close () at 
>> ../sysdeps/unix/syscall-template.S:81
>> #11 0x00007ff529d1ed6c in XrdOucStream::Close 
>> (this=this@entry=0x6519a0 <XrdCms::Meter>, hold=hold@entry=0) at 
>> /usr/src/debug/xrootd-4.9.1/src/XrdOuc/XrdOucStream.cc:295
>> #12 0x000000000042df58 in ~XrdOucStream (this=0x6519a0 
>> <XrdCms::Meter>, __in_chrg=<optimized out>) at 
>> /usr/src/debug/xrootd-4.9.1/src/XrdOuc/XrdOucStream.hh:56
>> #13 XrdCmsMeter::~XrdCmsMeter (this=0x6519a0 <XrdCms::Meter>, 
>> __in_chrg=<optimized out>) at 
>> /usr/src/debug/xrootd-4.9.1/src/XrdCms/XrdCmsMeter.cc:108
>> #14 0x00007ff528cfdc29 in __run_exit_handlers (status=255, 
>> listp=0x7ff52908a6c8 <__exit_funcs>, 
>> run_list_atexit=run_list_atexit@entry=true) at exit.c:77
>> #15 0x00007ff528cfdc77 in __GI_exit (status=<optimized out>) at exit.c:99
>> #16 0x00007ff529d1f282 in XrdOucStream::Exec (this=this@entry=0x6519a0 
>> <XrdCms::Meter>, parm=0x7ff5253a0ad0, inrd=inrd@entry=0, 
>> efd=efd@entry=0) at 
>> /usr/src/debug/xrootd-4.9.1/src/XrdOuc/XrdOucStream.cc:498
>> #17 0x00007ff529d1f6f8 in XrdOucStream::Exec (this=this@entry=0x6519a0 
>> <XrdCms::Meter>, theCmd=0x7372e0 "@9q", inrd=inrd@entry=0, 
>> efd=efd@entry=0) at 
>> /usr/src/debug/xrootd-4.9.1/src/XrdOuc/XrdOucStream.cc:396
>> #18 0x000000000042e43c in XrdCmsMeter::Run (this=0x6519a0 
>> <XrdCms::Meter>) at 
>> /usr/src/debug/xrootd-4.9.1/src/XrdCms/XrdCmsMeter.cc:330
>> #19 0x000000000042e5a9 in XrdCmsMeterRun (carg=<optimized out>) at 
>> /usr/src/debug/xrootd-4.9.1/src/XrdCms/XrdCmsMeter.cc:65
>> #20 0x00007ff529d08617 in XrdSysThread_Xeq (myargs=0x738000) at 
>> /usr/src/debug/xrootd-4.9.1/src/XrdSys/XrdSysPthread.cc:86
>> #21 0x00007ff529abfdd5 in start_thread (arg=0x7ff5253a1700) at 
>> pthread_create.c:307
>> #22 0x00007ff528dc202d in clone () at 
>> ../sysdeps/unix/sysv/linux/x86_64/clone.S:111
>>
>>
>>
>>
>> ########################################################################
>> Use REPLY-ALL to reply to list
>>
>> To unsubscribe from the XROOTD-L list, click the following link:
>> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1
>>
> 
> ########################################################################
> Use REPLY-ALL to reply to list
> 
> To unsubscribe from the XROOTD-L list, click the following link:
> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1


-- 
----------------------------------------------
Adrian Sevcenco, Ph.D.                       |
Institute of Space Science - ISS, Romania    |
adrian.sevcenco at {cern.ch,spacescience.ro} |
----------------------------------------------


########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the XROOTD-L list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1