Confirmation, Correction and Improvement for Outlier Validation Using Dummy Variables

   Dummy variables can be used to detect, validate and measure the impact of outliers in data. This paper uses a model to evaluate the effectiveness of dummy variables in detecting outliers. While generally confirming some findings in the literature, the model refutes the presumption that the t˗statistic or the F-incremental statistic is enough to validate an observation as an outlier. In order to rectify this fallacy, this paper recommends an easily-calculable robust standardized residual statistic that is more compatible with the definition of outliers. The robust standardized residual statistic suggested herein is still used in many robust regression methods and is more effective than the t-statistic or the F-incremental statistic in validating outliers with dummy variables. The results of this study suggest some practical recommendations for dealing with outliers and improvements in maintaining the integrity of data. We recommend all previous studies using this statistics be revised in light of the findings presented in this paper.

Keywords::Dummy Variable, t-Statistic, Outlier, Robust Dummy Statistic, Robust Standardized Residual
JEL Classifications:C2, C20, C51, C52.
AMS 2000 Subject Classifications: Primary 60K35, 60K35; secondary 60K35.