Print

Print


On 9/25/19 6:30 PM, Michal Kamil Simon wrote:
> Hi Adrian,
Hi!

> I don't think the segv is due to the fact you haven't added any jobs,
> at least I cannot reproduce it.
> 
> Could you provide a small reproducer and/or a backtrace from your
> coredump?
yeah, i protected againt empty copyjob but i still have the segfault..

i have no idea how to provide the reproducer, it is a copy jobs that 
runs from around the line
https://github.com/adriansev/jalien_py/blob/master/xjalienfs/alien.py#L605

i download a long list of files (1534) and after a bunch of messages ~ 
1484 (from a total of 1534) from here
https://github.com/adriansev/jalien_py/blob/master/xjalienfs/alien.py#L431

i get this "Segmentation fault (core dumped)"

but i do not find the core file, i will search why it is disabled 
(probably in systemd)

in meantime, i get this in gdb :

Thread 1 "python3" received signal SIGSEGV, Segmentation fault.
0x00007ffff54b1bdf in XrdCl::JobManager::StopWorkers 
(this=this@entry=0x7fffffff7180, n=4294967295) at 
/usr/include/c++/9/bits/stl_vector.h:1040
1040          operator[](size_type __n) _GLIBCXX_NOEXCEPT

(gdb) py-bt
Traceback (most recent call first):
   File 
"/home/adrian/.local/lib/python3.7/site-packages/XRootD/client/copyprocess.py", 
line 145, in run
     status, results = self.__process.run(ProgressHandlerWrapper(handler))
   File "/home/adrian/bin/alien.py", line 614, in XrdCopy
     process.run(handler)
   File "/home/adrian/bin/alien.py", line 466, in ProcessXrootdCp
     token_list_upload_ok = XrdCopy(url_list_src, url_list_dst, 
isDownload, my_cp_args)
   File "/home/adrian/bin/alien.py", line 1071, in ProcessInput
     await ProcessXrootdCp(websocket, args)
   File "/home/adrian/bin/alien.py", line 1138, in JAlien
     for token in cmds_tokens: await ProcessInput(websocket, token, None)
   <built-in method run of Context object at remote 0x7ffff606e780>
   File "/usr/lib64/python3.7/asyncio/events.py", line 88, in _run
     self._context.run(self._callback, *self._args)
   File "/usr/lib64/python3.7/asyncio/base_events.py", line 1771, in 
_run_once
     handle._run()
   File "/usr/lib64/python3.7/asyncio/base_events.py", line 534, in 
run_forever
     self._run_once()
   File "/usr/lib64/python3.7/asyncio/base_events.py", line 566, in 
run_until_complete
     self.run_forever()
   File "/home/adrian/bin/alien.py", line 1221, in main
     asyncio.get_event_loop().run_until_complete(JAlien(cmd_string))
   File "/home/adrian/bin/alien.py", line 1225, in <module>
     main()


Thanks!!
Adrian

> 
> Cheers,
> Michal
> ________________________________________
> From: Adrian Sevcenco
> Sent: 25 September 2019 09:39
> To: Michal Kamil Simon; xrootd-l
> Subject: python :: segfault in copyprocess.py
> 
> Hi Michal! I have a strange occurrence of
> Fatal Python error: Segmentation fault
> 
> Thread 0x00007fbab90a4700 (most recent call first):
>     File "/usr/lib64/python3.7/concurrent/futures/thread.py", line 78 in
> _worker
>     File "/usr/lib64/python3.7/threading.py", line 870 in run
>     File "/usr/lib64/python3.7/threading.py", line 926 in _bootstrap_inner
>     File "/usr/lib64/python3.7/threading.py", line 890 in _bootstrap
> 
> Current thread 0x00007fbabaa25740 (most recent call first):
>     File
> "/home/adrian/.local/lib/python3.7/site-packages/XRootD/client/copyprocess.py",
> line 145 in run
>     File "/home/adrian/bin/alien.py", line 594 in XrdCopy
>     File "/home/adrian/bin/alien.py", line 451 in ProcessXrootdCp
>     File "/home/adrian/bin/alien.py", line 1051 in ProcessInput
>     File "/home/adrian/bin/alien.py", line 1118 in JAlien
>     File "/usr/lib64/python3.7/asyncio/events.py", line 88 in _run
>     File "/usr/lib64/python3.7/asyncio/base_events.py", line 1771 in
> _run_once
>     File "/usr/lib64/python3.7/asyncio/base_events.py", line 534 in
> run_forever
>     File "/usr/lib64/python3.7/asyncio/base_events.py", line 566 in
> run_until_complete
>     File "/home/adrian/bin/alien.py", line 1199 in main
>     File "/home/adrian/bin/alien.py", line 1203 in <module>
> Segmentation fault (core dumped)
> 
> i believe that this happens because the i do run without any job added
> ... of course i could protect the case of no jobs added but maybe can
> upstream check if copy jobs are in queue and if not then give a message
> that "No copy jobs to run" or something?
> 
> Thanks!!
> Adrian
> 
> 
> 


-- 
----------------------------------------------
Adrian Sevcenco, Ph.D.                       |
Institute of Space Science - ISS, Romania    |
adrian.sevcenco at {cern.ch,spacescience.ro} |
----------------------------------------------


########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the XROOTD-L list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1