9545

Grid-based outlier detection in large data sets for combine harvesters

Ying Gu, Ramkumar Ganesan, Benjamin Bischke, Ansgar Bernardi, Alexander Maier, Heinrich Warkentin, Thilo Steckel, Andreas Dengel

15th IEEE International Conference on Industrial Informatics IEEE International Conference on Industrial Informatics, July 24-26, Emden, Germany , Pages: 811-818 , IEEE , 2017
Outlier detection is one of the most widely used technique to identify abnormal behavior in raw data. The sense of abnormal deviation mentioned here accounts not only for human made or system errors that naturally occur as part of the data but also as seldomly occuring events. In this paper, we propose a new algorithm called Grid Based Outlier Detection (GBOD) to find the hidden outliers in large data sets. In contrast to existing grid based methods which are limited to only some statistical based approaches, the GBOD algorithm is raised with two alternations to figure out different range of outliers depending on the interest of the user. First, the number of points in a local grid cell is used to decide whether a point is an outlier or not. In a second step, this approach is extended to method that assigns an outlier score to each data point. The simple design makes this algorithm extremely efficient for large data sets.

Show BibTex:

@inproceedings {
       abstract = {Outlier detection is one of the most widely used
technique to identify abnormal behavior in raw data. The sense of
abnormal deviation mentioned here accounts not only for human
made or system errors that naturally occur as part of the data but
also as seldomly occuring events. In this paper, we propose a new
algorithm called Grid Based Outlier Detection (GBOD) to find
the hidden outliers in large data sets. In contrast to existing grid
based methods which are limited to only some statistical based
approaches, the GBOD algorithm is raised with two alternations
to figure out different range of outliers depending on the interest
of the user. First, the number of points in a local grid cell is
used to decide whether a point is an outlier or not. In a second
step, this approach is extended to method that assigns an outlier
score to each data point. The simple design makes this algorithm
extremely efficient for large data sets.},
       number = {}, 
       month = {7}, 
       year = {2017}, 
       title = {Grid-based outlier detection in large data sets for combine harvesters}, 
       journal = {}, 
       volume = {}, 
       pages = {811-818}, 
       publisher = {IEEE}, 
       author = {Ying Gu, Ramkumar Ganesan, Benjamin Bischke, Ansgar Bernardi, Alexander Maier, Heinrich Warkentin, Thilo Steckel, Andreas Dengel}, 
       keywords = {Outlier Detection, Grid, KNN, Large Data Set, Sensor, Combine Harvester},
       url = {https://doi.org/10.1109/INDIN.2017.8104877}
}