Next: Probability Distribution Functions (PDFs)
Up: tsigex
Previous: tsigex
Introduction
Statistical data analysis with multivariate distributions
is often performed
with one-dimensional projections when variable distributions consist
of essentially uncorrelated variables. This approach works well
when observables are indeed uncorrelated.
In some other cases, ignoring correlations for a given experiment
leads to biases or event mis-classifications.
In this note, the joint probability distribution functions (PDFs) are
approximated with three different techniques. Ensemble tests
are then performed to identify the best general method for signal
extraction with different classes of events. The limitation of each
method are investigated with large statistics toy Monte Carlo simulations.
The three methods
used for the evaluation of the joint PDFs are [1]:
- The 1-dimensional distribution functions. This standard method
used the product of the marginal distribution functions for the evaluation
of the join PDFs. Monte Carlo samples are often used for the computation of
the 1-dimensional distribution functions.
- The Projection and Correlation Approximation (PCA). The PCA method
is a modification to the standard joint PDF calculation. It accounts
for correlations in the input variables with a canonical transformation
of the correlated input variables into a set of uncorrelated variables [2].
- The multi-dimensional distribution functions. This is a general
purpose multivariate classification technique that is easy to understand
and apply to binned data. Normally, one parametrized the data as a function
of binned variables and the PDFs are extracted for each bin of the input
variables. The multi-dimensional must then relies on very large
Monte Carlo samples to insure the adequate representation of the joint
PDFs in each bins (even those with low statistical population).
This SNO analysis note also describes how to use qSigEx for the proposed task
and how to investigate biases and binned effects in multivariate data analysis
for event classification.
Next: Probability Distribution Functions (PDFs)
Up: tsigex
Previous: tsigex
Alain Bellerive
2006-05-19