SciELO - Scientific Electronic Library Online

 
vol.31 issue1Electric motor failure detection based on indexes of powers and neural networksDevelopment of a pre-cooked nutraceutical cake from biofortified vegetables for older adults author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Revista Tecnología en Marcha

On-line version ISSN 0379-3982Print version ISSN 0379-3982

Abstract

CALVO-VALVERDE, Luis-Alexander  and  ACUNA-ALPIZAR, Nelson José. Application of ensemble methods in outlier point detection in meteorological time series. Tecnología en Marcha [online]. 2018, vol.31, n.1, pp.98-109. ISSN 0379-3982.  http://dx.doi.org/10.18845/tm.v31i1.3500.

[20]

For this research work, the performance of ensemble methods in the task of outlier points detection in meteorological univariate time series was studied, using the F1 metric to measure the performance. For this purpose, an application was created that allows applying 3 nonensemble classifiers (support vector regression, ARIMA, bayesian networks) and 3 ensemble classifiers (stacking, bagging and AdaBoost) to 3 meteorological datasets (rainfall, maximum temperature and solar radiation).

[21]

Using this application, an experiment was executed to compare the different classifiers. In this experiment, first, the F1 average of the algorithms was obtained by executing multiple tests in each dataset. Then, using a statistical hypothesis test we compared the obtained averages to find out if the observed differences were significant. Finally, a result analysis was performed, focused on comparing the performance of the ensemble classifiers versus the performance of the best non-ensemble classifier for each dataset.

[22]

In general the results indicate that it is possible to significantly improve the performance in the outlier point detection task in some uni-variate time series by using ensemble methods. However, to obtain this improvement several conditions must be met. Although the conditions vary depending on the ensemble method, in general these conditions aim to improve the diversity in the base classifiers. When these conditions were not met, the ensemble methods didn’t have a significant difference in the performance compared to the non-ensemble classifier that got the best performance in the datasets.

Keywords : Outliers; Ensemble methods; ARIMA; Support vector regression; SVR; Bayesian network; Stacking; Bagging; AdaBoost.

        · abstract in Spanish     · text in Spanish     · Spanish ( pdf )