On 9/25/19 6:30 PM, Michal Kamil Simon wrote:
> Hi Adrian,
Hi!
> I don't think the segv is due to the fact you haven't added any jobs,
> at least I cannot reproduce it.
>
> Could you provide a small reproducer and/or a backtrace from your
> coredump?
yeah, i protected againt empty copyjob but i still have the segfault..
i have no idea how to provide the reproducer, it is a copy jobs that
runs from around the line
https://github.com/adriansev/jalien_py/blob/master/xjalienfs/alien.py#L605
i download a long list of files (1534) and after a bunch of messages ~
1484 (from a total of 1534) from here
https://github.com/adriansev/jalien_py/blob/master/xjalienfs/alien.py#L431
i get this "Segmentation fault (core dumped)"
but i do not find the core file, i will search why it is disabled
(probably in systemd)
in meantime, i get this in gdb :
Thread 1 "python3" received signal SIGSEGV, Segmentation fault.
0x00007ffff54b1bdf in XrdCl::JobManager::StopWorkers
(this=this@entry=0x7fffffff7180, n=4294967295) at
/usr/include/c++/9/bits/stl_vector.h:1040
1040 operator[](size_type __n) _GLIBCXX_NOEXCEPT
(gdb) py-bt
Traceback (most recent call first):
File
"/home/adrian/.local/lib/python3.7/site-packages/XRootD/client/copyprocess.py",
line 145, in run
status, results = self.__process.run(ProgressHandlerWrapper(handler))
File "/home/adrian/bin/alien.py", line 614, in XrdCopy
process.run(handler)
File "/home/adrian/bin/alien.py", line 466, in ProcessXrootdCp
token_list_upload_ok = XrdCopy(url_list_src, url_list_dst,
isDownload, my_cp_args)
File "/home/adrian/bin/alien.py", line 1071, in ProcessInput
await ProcessXrootdCp(websocket, args)
File "/home/adrian/bin/alien.py", line 1138, in JAlien
for token in cmds_tokens: await ProcessInput(websocket, token, None)
<built-in method run of Context object at remote 0x7ffff606e780>
File "/usr/lib64/python3.7/asyncio/events.py", line 88, in _run
self._context.run(self._callback, *self._args)
File "/usr/lib64/python3.7/asyncio/base_events.py", line 1771, in
_run_once
handle._run()
File "/usr/lib64/python3.7/asyncio/base_events.py", line 534, in
run_forever
self._run_once()
File "/usr/lib64/python3.7/asyncio/base_events.py", line 566, in
run_until_complete
self.run_forever()
File "/home/adrian/bin/alien.py", line 1221, in main
asyncio.get_event_loop().run_until_complete(JAlien(cmd_string))
File "/home/adrian/bin/alien.py", line 1225, in <module>
main()
Thanks!!
Adrian
>
> Cheers,
> Michal
> ________________________________________
> From: Adrian Sevcenco
> Sent: 25 September 2019 09:39
> To: Michal Kamil Simon; xrootd-l
> Subject: python :: segfault in copyprocess.py
>
> Hi Michal! I have a strange occurrence of
> Fatal Python error: Segmentation fault
>
> Thread 0x00007fbab90a4700 (most recent call first):
> File "/usr/lib64/python3.7/concurrent/futures/thread.py", line 78 in
> _worker
> File "/usr/lib64/python3.7/threading.py", line 870 in run
> File "/usr/lib64/python3.7/threading.py", line 926 in _bootstrap_inner
> File "/usr/lib64/python3.7/threading.py", line 890 in _bootstrap
>
> Current thread 0x00007fbabaa25740 (most recent call first):
> File
> "/home/adrian/.local/lib/python3.7/site-packages/XRootD/client/copyprocess.py",
> line 145 in run
> File "/home/adrian/bin/alien.py", line 594 in XrdCopy
> File "/home/adrian/bin/alien.py", line 451 in ProcessXrootdCp
> File "/home/adrian/bin/alien.py", line 1051 in ProcessInput
> File "/home/adrian/bin/alien.py", line 1118 in JAlien
> File "/usr/lib64/python3.7/asyncio/events.py", line 88 in _run
> File "/usr/lib64/python3.7/asyncio/base_events.py", line 1771 in
> _run_once
> File "/usr/lib64/python3.7/asyncio/base_events.py", line 534 in
> run_forever
> File "/usr/lib64/python3.7/asyncio/base_events.py", line 566 in
> run_until_complete
> File "/home/adrian/bin/alien.py", line 1199 in main
> File "/home/adrian/bin/alien.py", line 1203 in <module>
> Segmentation fault (core dumped)
>
> i believe that this happens because the i do run without any job added
> ... of course i could protect the case of no jobs added but maybe can
> upstream check if copy jobs are in queue and if not then give a message
> that "No copy jobs to run" or something?
>
> Thanks!!
> Adrian
>
>
>
--
----------------------------------------------
Adrian Sevcenco, Ph.D. |
Institute of Space Science - ISS, Romania |
adrian.sevcenco at {cern.ch,spacescience.ro} |
----------------------------------------------
########################################################################
Use REPLY-ALL to reply to list
To unsubscribe from the XROOTD-L list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-L&A=1
|