par Croux, Christophe ;Haesbroeck, Gentiane
Référence Computational statistics & data analysis, 25, 2, page (125-141)
Publication Publié, 1997-07
Article révisé par les pairs
Résumé : In a robust analysis, the minimum volume ellipsoid (MVE) estimator is very often used to estimate both multivariate location and scatter. The MVE estimator for the scatter matrix is defined as the smallest ellipsoid covering half of the observations, while the MVE location estimator is the midpoint of that ellipsoid. The MVE estimators can be computed by minimizing a certain criterion over a high-dimensional space. In practice, one mostly uses algorithms based on minimization of the objective function over a sequence of trial estimates. One of these estimators uses a resampling scheme, and yields the (p + 1)-subset estimator. In this note, we show how this estimator can easily be adapted, yielding a considerable increase of statistical efficiency at finite samples. This gain in precision is also observed when sampling from contaminated distributions, and it becomes larger when the dimension increases. Therefore, we do not need more computation time nor do we lose robustness properties. Moreover, only a few lines have to be added to existing computer programs. The key idea is to average over several trials close to the optimum, instead of just picking out the trial with the lowest value for the objective function. The resulting estimator keeps the equivariance and robustness properties of the original MVE estimator. This idea can also be applied to several other robust estimators, including least-trimmed-squares regression. © 1997 Elsevier Science B.V.