Print

Print


Hi Matevz,

   What problem are you referring to? The redirect counter issue or the thread 
safety issues? The thread safety is already there in 3.2.2 so if you still see 
the problems then we need to investigate...

Cheers,
   Lukasz

On Friday, August 31, 2012 01:07:56 PM Matevz Tadel wrote:
> Luckily we just noticed this one show up again:
>    https://savannah.cern.ch/bugs/?93794
> 
> Please include the fix!
> 
> Cheers,
> Matevz
> 
> On 08/31/12 08:08, Brian Bockelman wrote:
> > Perfectly fine by me.
> > 
> > So:
> > 1) Detailed monitoring fix.
> > 2) Sendfile monitoring fix.
> > 
> > I can't think of anything else?
> > 
> > Brian
> > 
> > On Aug 31, 2012, at 9:58 AM, Lukasz Janyst <[log in to unmask]
> > 
> > <mailto:[log in to unmask]>> wrote:
> >> Yes, but I won't manage today. Is Monday fine with you?
> >> 
> >> Lukasz
> >> 
> >> On Friday, August 31, 2012 07:34:44 AM Brian Bockelman wrote:
> >> > Can we cut a 3.2.3 patch release with these two fixes?
> >> > 
> >> > Brian
> >> > 
> >> > On Aug 31, 2012, at 12:14 AM, "Yang, Wei" <[log in to unmask]
> >> 
> >> <mailto:[log in to unmask]>> wrote:
> >> > > I tested the second. I didn't get a chance to test the 1st before I
> >> > > lost
> >> > > the window of restarting the cluster. But I have sendfile() turned
> >> > > off
> >> > > and I do get correct results, so it implicitly confirms the 1st one.
> >> > > 
> >> > > regards,
> >> > > Wei Yang | [log in to unmask] <mailto:[log in to unmask]> |
> >> 
> >> 650-926-3338(O)
> >> 
> >> > > On Aug 30, 2012, at 9:29 PM, Wilko Kroeger wrote:
> >> > >> Hello Brian
> >> > >> 
> >> > >> Yes, we also noticed that the detailed monitoring is not working in
> >> > >> v3.2.2. We build a version on top of v3.2.2 adding the two commits:
> >> > >> 
> >> > >> commit e0ad3459c89a163e600070a15936b8fd5d26ff35
> >> > >> Author: Andrew Hanushevsky <[log in to unmask]
> >> > >> <mailto:[log in to unmask]>>
> >> > >> Date: Wed Aug 22 18:56:19 2012 -0700
> >> > >> 
> >> > >> Make sure read statistics are updated for sendfile() and mmap I/O.
> >> > >> 
> >> > >> commit e51db4bb0178a21bbe87ccf7c9349b079c2d7455
> >> > >> Author: Andrew Hanushevsky <[log in to unmask]
> >> > >> <mailto:[log in to unmask]>>
> >> > >> Date: Mon Jul 30 16:52:56 2012 -0700
> >> > >> 
> >> > >> Correct monitor initialization test to start monitor under all
> >> > >> configs.
> >> > >> 
> >> > >> As far as I can tell the detailed monitoring is now working. Wei
> >> > >> might
> >> > >> have done more testing.
> >> > >> 
> >> > >> Cheers,
> >> > >> 
> >> > >> Wilko
> >> > >> 
> >> > >> On Thu, 30 Aug 2012, Brian Bockelman wrote:
> >> > >>> Hi Andy,
> >> > >>> 
> >> > >>> The core wasn't interesting. However, I tracked it down to this
> >> > >>> change
> >> > >>> (line 334 in XrdXrootdConfig.cc):
> >> > >>> 
> >> > >>> if ((!isRedir || (RQList.Next() != 0 &&
> >> > >>> XrdXrootdMonitor::Redirect())))
> >> > >>> 
> >> > >>> became:
> >> > >>> 
> >> > >>> if ((!isRedir || (RQList.Next() != 0)) &&
> >> > >>> XrdXrootdMonitor::Redirect())
> >> > >>> 
> >> > >>> (in 3.2.2). In master, it is this test:
> >> > >>> 
> >> > >>> if (!isRedir || XrdXrootdMonitor::Redirect())
> >> > >>> 
> >> > >>> Note that XrdXrootdMonitor::Redirect always returns 0 (I suspect
> >> > >>> the bug
> >> > >>> is this).
> >> > >>> 
> >> > >>> So, basically, I think detailed monitoring is broken in the 3.2.2
> >> > >>> release. Matevz, take note...
> >> > >>> 
> >> > >>> What's the minimal patch? I can ask OSG to push this out ASAP.
> >> > >>> 
> >> > >>> Brian
> >> > >>> 
> >> > >>> On Aug 28, 2012, at 9:26 PM, Andrew Hanushevsky <[log in to unmask]
> >> 
> >> <mailto:[log in to unmask]>> wrote:
> >> > >>>> Hi Brian,
> >> > >>>> 
> >> > >>>> Best to get a gcore on this one. Seems like the monitoring did not
> >> > >>>> initialize correctly as it's trying to send to fd 0.
> >> > >>>> 
> >> > >>>> Andy
> >> > >>>> 
> >> > >>>> -----Original Message----- From: Brian Bockelman
> >> > >>>> Sent: Tuesday, August 28, 2012 7:15 PM
> >> > >>>> To: <[log in to unmask]
> >> > >>>> <mailto:[log in to unmask]>>
> >> > >>>> Subject: Strange detailed monitoring issue
> >> > >>>> 
> >> > >>>> After a power outage locally, Matevz noticed he is not receiving
> >> > >>>> monitoring messages.
> >> > >>>> 
> >> > >>>> Sure enough, from strace:
> >> > >>>> 
> >> > >>>> [pid 1705] sendto(0,
> >> > >>>> "t8\5\270\0\0\0\0\340\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\200\0\0\0\302^
> >> > >>>> v#".
> >> > >>>> .., 1464, 0, {sa_family=AF_UNSPEC,
> >> > >>>> sa_data="\0\0\0\0\0\0\0\0\0\0\0\0\0\0"}, 16) = -1 ENOTSOCK (Socket
> >> > >>>> operation on non-socket)
> >> > >>>> 
> >> > >>>> Version info:
> >> > >>>> 
> >> > >>>> [root@red-gridftp3 ~]# rpm -q xrootd-server
> >> > >>>> xrootd-server-3.2.2-1.osg.el5.xu
> >> > >>>> 
> >> > >>>> Log startup is below. Config file snippet is:
> >> > >>>> 
> >> > >>>> xrootd.monitor all auth flush 30s mbuff 1472 window 5s dest files
> >> > >>>> io
> >> > >>>> info user xrootd.t2.ucsd.edu <http://xrootd.t2.ucsd.edu>:9930
> >> 
> >> xrd.report xrootd.t2.ucsd.edu <http://xrootd.t2.ucsd.edu>:9931
> >> 
> >> > >>>> every 30s all sync
> >> > >>>> 
> >> > >>>> Any ideas? We are at a loss as to what might be happening.
> >> > >>>> 
> >> > >>>> Brian
> >> > >>>> 
> >> > >>>> 120828 21:07:13 1663 Scalla is starting. . .
> >> > >>>> Copr. 2010 Stanford University, xrd version v3.2.2
> >> > >>>> ++++++ xrootd [log in to unmask]
> >> 
> >> <mailto:[log in to unmask]> initialization started.
> >> 
> >> > >>>> Config using configuration file /etc/xrootd/xrootd-clustered.cfg
> >> > >>>> =====> xrd.port 1094
> >> > >>>> =====> xrd.trace conn
> >> > >>>> =====> all.adminpath /var/run/xrootd
> >> > >>>> =====> xrd.report xrootd.t2.ucsd.edu
> >> > >>>> <http://xrootd.t2.ucsd.edu>:9931
> >> 
> >> every 30s all sync
> >> 
> >> > >>>> Config maximum number of connections restricted to 65536
> >> > >>>> Copr. 2007 Stanford University, xrootd version 2.9.7 build v3.2.2
> >> > >>>> ++++++ xrootd protocol initialization started.
> >> > >>>> =====> all.export / nostage
> >> > >>>> =====> xrootd.trace emsg login stall redirect
> >> > >>>> =====> xrootd.seclib /usr/lib64/libXrdSec.so
> >> > >>>> Config warning: ignoring fslib; libXrdOfs.so is built-in.
> >> > >>>> =====> xrootd.fslib /usr/lib64/libXrdOfs.so
> >> > >>>> =====> all.pidpath /var/run/xrootd
> >> > >>>> =====> xrootd.monitor all auth flush 30s mbuff 1472 window 5s dest
> >> > >>>> files io info user xrootd.t2.ucsd.edu
> >> > >>>> <http://xrootd.t2.ucsd.edu>:9930
> >> 
> >> Config exporting /
> >> 
> >> > >>>> ++++++ Authentication system initialization started.
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: ***
> >> > >>>> ------------------------------------------------------------ ***
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: Mode: server
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: Debug: -1
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: CA dir:
> >> > >>>> /etc/grid-security/certificates 120828 21:07:13 1663
> >> > >>>> secgsi_InitOpts:
> >> > >>>> CA verification level: 1
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: CRL dir:
> >> > >>>> /etc/grid-security/certificates/ 120828 21:07:13 1663
> >> > >>>> secgsi_InitOpts:
> >> > >>>> CRL extension: .r0
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: CRL check level: 1
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: CRL refresh time: 86400
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: Certificate:
> >> > >>>> /etc/grid-security/xrd/xrdcert.pem 120828 21:07:13 1663
> >> > >>>> secgsi_InitOpts: Key: /etc/grid-security/xrd/xrdkey.pem 120828
> >> > >>>> 21:07:13 1663 secgsi_InitOpts: Proxy delegation option: 0
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: GRIDmap file:
> >> > >>>> /etc/grid-security/grid-mapfile 120828 21:07:13 1663
> >> > >>>> secgsi_InitOpts:
> >> > >>>> GRIDmap option: 10
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: GRIDmap cache entries
> >> > >>>> expiration
> >> > >>>> (secs): 0 120828 21:07:13 1663 secgsi_InitOpts: Authorization
> >> > >>>> function: libXrdLcmaps.so 120828 21:07:13 1663 secgsi_InitOpts:
> >> > >>>> Authorization function parms:
> >> > >>>> --osg,--lcmapscfg,/etc/xrootd/lcmaps.cfg,--loglevel,0|useglobals
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: Authorization cache entries
> >> > >>>> expiration (secs): -1 120828 21:07:13 1663 secgsi_InitOpts: Client
> >> > >>>> proxy availability in XrdSecEntity.endorsement: 0 120828 21:07:13
> >> > >>>> 1663
> >> > >>>> secgsi_InitOpts: VOMS option: 1
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: MonInfo option: 0
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: Crypto modules: ssl
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: Ciphers:
> >> > >>>> aes-128-cbc:bf-cbc:des-ede3-cbc 120828 21:07:13 1663
> >> > >>>> secgsi_InitOpts:
> >> > >>>> MDigests: sha1:md5
> >> > >>>> 120828 21:07:13 1663 secgsi_InitOpts: ***
> >> > >>>> ------------------------------------------------------------ ***
> >> > >>>> 120828 21:07:13 1663 secgsi_LoadAuthzFun: using
> >> > >>>> 'XrdSecgsiAuthzFun()'
> >> > >>>> from libXrdLcmaps.so =====> sec.protocol /usr/lib64 gsi
> >> > >>>> -certdir:/etc/grid-security/certificates
> >> > >>>> -cert:/etc/grid-security/xrd/xrdcert.pem
> >> > >>>> -key:/etc/grid-security/xrd/xrdkey.pem -crl:1
> >> > >>>> -authzfun:libXrdLcmaps.so -authzfunparms:--osg,--lcmapscfg,/et
> >> > >>>> Config
> >> > >>>> 1 authentication directives processed in
> >> > >>>> /etc/xrootd/xrootd-clustered.cfg ------ Authentication system
> >> > >>>> initialization completed.
> >> > >>>> ++++++ File system initialization started.
> >> > >>>> =====> all.role server
> >> > >>>> Config warning: ignoring invalid trace option 'none'.
> >> > >>>> =====> ofs.trace none
> >> > >>>> =====> ofs.authorize
> >> > >>>> =====> ofs.osslib /usr/lib64/libXrdHdfs.so
> >> > >>>> ++++++ Authorization system initialization started.
> >> > >>>> 120828 21:07:13 1663 acc_Config: Authorization system using
> >> > >>>> configuration in /etc/xrootd/xrootd-clustered.cfg =====>
> >> > >>>> acc.authdb
> >> > >>>> /etc/xrootd/Authfile
> >> > >>>> =====> acc.audit deny grant
> >> > >>>> Config 2 authorization directives processed in
> >> > >>>> /etc/xrootd/xrootd-clustered.cfg Config 1 auth entries processed
> >> > >>>> in
> >> > >>>> /etc/xrootd/Authfile
> >> > >>>> ------ Authorization system initialization completed.
> >> > >>>> Copr. 2009, Brian Bockelman, Hdfs Version
> >> > >>>> 120828 21:07:13 1663 hdfs_Config: Copr. 2009, Brian Bockelman,
> >> > >>>> Hdfs
> >> > >>>> Version
> >> > >>>> 120828 21:07:13 1663 hdfs_Config: Configuring HDFS.
> >> > >>>> =====> oss.namelib /usr/lib64/libXrdCmsTfc.so
> >> > >>>> file:/etc/xrootd/storage.xml?protocol=hadoop Copr. 2009 University
> >> > >>>> of
> >> > >>>> Nebraska-Lincoln TFC plugin v 1.0
> >> > >>>> Params: file:/etc/xrootd/storage.xml?protocol=hadoop
> >> > >>>> Xerces-c has been initialized.
> >> > >>>> Connecting to the catalog
> >> > >>>> file:/etc/xrootd/storage.xml?protocol=hadoop
> >> > >>>> Using catalog file /etc/xrootd/storage.xml
> >> > >>>> ------ HDFS storage system initialization completed.
> >> > >>>> 120828 21:07:13 1663 hdfs_HDFS storage system initialization.:
> >> > >>>> completed.
> >> > >>>> ++++++ Configuring server role. . .
> >> > >>>> =====> all.manager srm.unl.edu <http://srm.unl.edu>:1213
> >> > >>>> =====> cms.trace all
> >> > >>>> =====> all.adminpath /var/run/xrootd
> >> > >>>> 120828 21:07:13 1663 Configure Global System Identification:
> >> > >>>> anon-s
> >> > >>>> 1213srm.unl.edu <http://1213srm.unl.edu>>>>>
> >> > >>>> Config effective /etc/xrootd/xrootd-clustered.cfg ofs
> >> > >>>> configuration:
> >> > >>>> ofs.role server
> >> > >>>> ofs.authorize
> >> > >>>> ofs.maxdelay 60
> >> > >>>> ofs.osslib /usr/lib64/libXrdHdfs.so
> >> > >>>> ofs.persist manual hold 600 logdir /var/run/xrootd/.ofs/posc.log
> >> > >>>> ofs.trace 0
> >> > >>>> 
> >> > >>>> ------ File system server initialization completed.
> >> > >>>> Config warning: 'xrootd.prepare logdir' not specified; prepare
> >> > >>>> tracking
> >> > >>>> disabled. 120828 21:07:13 1675 cms_Finder: Connected to cmsd via
> >> > >>>> /var/run/xrootd/.olb/olbd.admin ------ xrootd protocol
> >> > >>>> initialization
> >> > >>>> completed.
> >> > >>>> ------ xrootd [log in to unmask]
> >> 
> >> <mailto:[log in to unmask]>:1094 initialization completed.
> >> 
> >> > >>>> ##################################################################
> >> > >>>> #####
> >> > >>>> #
> >> > >>>> Use REPLY-ALL to reply to list
> >> > >>>> 
> >> > >>>> To unsubscribe from the XROOTD-DEV list, click the following link:
> >> > >>>> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=
> >> > >>>> 1
> >> > >>> 
> >> > >>> ###################################################################
> >> > >>> #####
> >> > >>> Use REPLY-ALL to reply to list
> >> > >>> 
> >> > >>> To unsubscribe from the XROOTD-DEV list, click the following link:
> >> > >>> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1
> >> > >> 
> >> > >> ####################################################################
> >> > >> ####
> >> > >> Use REPLY-ALL to reply to list
> >> > >> 
> >> > >> To unsubscribe from the XROOTD-DEV list, click the following link:
> >> > >> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1
> >> > > 
> >> > > #####################################################################
> >> > > ###
> >> > > Use REPLY-ALL to reply to list
> >> > > 
> >> > > To unsubscribe from the XROOTD-DEV list, click the following link:
> >> > > https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1
> >> > > <winmail.dat>
> >> > 
> >> > #######################################################################
> >> > #
> >> > Use REPLY-ALL to reply to list
> >> > 
> >> > To unsubscribe from the XROOTD-DEV list, click the following link:
> >> > https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1
> >> 
> >> -------------------------------------------------------------------------
> >> -------
> >> 
> >> Use REPLY-ALL to reply to list
> >> 
> >> To unsubscribe from the XROOTD-DEV list, click the following link:
> >> https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1
> > 
> > --------------------------------------------------------------------------
> > ------
> > 
> > Use REPLY-ALL to reply to list
> > 
> > To unsubscribe from the XROOTD-DEV list, click the following link:
> > https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1

########################################################################
Use REPLY-ALL to reply to list

To unsubscribe from the XROOTD-DEV list, click the following link:
https://listserv.slac.stanford.edu/cgi-bin/wa?SUBED1=XROOTD-DEV&A=1