Print

Print


>
> Actually since we are running many CPF processes in parallel it is
> impossible to remove duplicates if you read the same hbook file or the
> same run in two different jobs.
> About the creation of the reduced collections you mean you are using the
> same tcl file we used to produce hbook files. If any run is present in two
> different tcl's it produces for sure duplicated events, correct?

yep
	ric
>
> Daniele
>
> On Wed, 6 Feb 2002, Riccardo Faccini wrote:
>
> > Hi,
> > one of things Cecilia noticed in using CPFramework is that the duplicate
> > removal seems not to be turned on by default (I did not remember this). Tu
> > turn it on you should toggle cpf/nomulti.
> > As far as the reduced collection creation the way it works is that for
> > each event we check if it is in the list. The only way to get duplicates
> > is to run twice on the same event AND to have the TS twice in the files.
> >
> > Of course I cannot completely exclude an operational error such as filling
> > twice the same reduced collection.
> > It might be worthwhile making a passa at removing duplicates from the
> > reduced collections I am doing right now
> > 	ciao
> > 	ric
> >
> >
> >
> > On Wed, 6 Feb 2002, Daniele del Re wrote:
> >
> > >
> > > I looked at the TS files for run1 and I discovered that some duplicated
> > > are present already at TS file level.
> > > The effect seems to be around 0.5% of the events for Dstar0, 0.3% for Dc,
> > > no effect for Dstar and D0.
> > > Actually it is possible that the TS were produced running twice on the
> > > same hbook but in prinicple the Riccardo script had to remove them when
> > > collections were produced.
> > > Anyway the effect seems to be small and the new reduced collections we are
> > > preparing (including also the run1) should be unaffected by this
> > > duplication problem.
> > >
> > > Daniele
> > >
> > > On Wed, 6 Feb 2002, Alessio Sarti wrote:
> > >
> > > > Hi all,
> > > > some BAD news on the timestamp side.
> > > > Working hard on cutting the duplicates I've always assumed that among the
> > > > same mode (& superblock) there are not duplicated events.
> > > > Checking on the timestamp provided by CPFramework (but it was a very quick
> > > > check) I had validated this assumption.
> > > > Now the problem comes out again:
> > > > running on Dstar0 (and now I'm trying to check also the other seeds)
> > > > reduced entuples I found that in the SAME REDUCED ROOT FILE and NTUPLE
> > > > there are duplicated events.
> > > > The checked file is :
> > > >
> > > > /u/ec/ursl/d/output/breco-112101/data/
> > > > dstar0_110901_superblock2_2000_b1-s2_aa.root (.hbook)
> > > >
> > > > I have ran CPF and anaRecoil on this file and I got some duplicated
> > > > events:
> > > > first chunk
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E2D/C03C52A3:P  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E38/AB59C9FB:J  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E43/CE1A9DE3:J  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E06/4A2C19EF:S  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E0F/47D4E057:S  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E31/5FE2AB1B:K  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E13/A575A2C3:S  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E30/8B7862E3:J  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E30/AFA6889B:M  run=00012318
> > > > second chunk
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E2D/C03C52A3:P  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E38/AB59C9FB:J  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E43/CE1A9DE3:J  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E06/4A2C19EF:S  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E0F/47D4E057:S  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E31/5FE2AB1B:K  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E13/A575A2C3:S  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E30/8B7862E3:J  run=00012318
> > > > CPHbook/BCH_3_stprova.ts_3:7F:7FFFFF:00154E30/AFA6889B:M  run=00012318
> > > >
> > > > It is really difficult to find those events because they are replicated in
> > > > the same order but in really different parts of the output files
> > > >
> > > > (from the first to the second chunk I got ~5000 events....)
> > > >
> > > > We really have to find out how this is possible.... Now we are using
> > > > ntuples that may contain duplicated SIGNAL events.
> > > >
> > > > I'm going to quantify the problem and to check it among the various seeds
> > > > but I think that is a very hard work.
> > > >
> > > > Does anybody has an idea on how this can happen?
> > > > Cheers,
> > > > Alessio
> > > >
> > > > ______________________________________________________
> > > > Alessio Sarti
> > > >  Universita' & I.N.F.N. Ferrara
> > > >  tel  +39-0532-781928  Ferrara
> > > >
> > > > "Quod non fecerunt barbari, fecerunt Berlusconi"
> > > >
> > > > "Che il bianco sia bianco, e che il nero sia nero
> > > >  che uno e uno fanno due e che la scienza dice il vero....
> > > >  DIPENDE !"
> > > >
> > > >
> > >
> > >
> >
> >
> >
>
>