Anomaly detection through spatio-temporal data mining, with application to near real-time outlying sensor identification

dc.contributor.advisorChairperson, Graduate Committee: John Paxton; Rafal A. Angryk (co-chair)en
dc.contributor.authorGalarus, Douglas Edwarden
dc.date.accessioned2017-12-27T21:26:56Z
dc.date.available2017-12-27T21:26:56Z
dc.date.issued2017en
dc.description.abstractThere is a need for robust solutions to the challenges of near real-time spatio-temporal outlier and anomaly detection. In our dissertation, we define and demonstrate quality measures for evaluation and comparison of overlapping, real-time, spatio-temporal data providers and for assessment and optimization of data acquisition, system operation and data redistribution. Our measures are tested on real-world data and applications, and our results show the need and potential to develop our own mechanisms for outlier and anomaly detection. We then develop a representative, near real-time solution for the identification of outlying sensors that far outperforms state of the art methods in terms of accuracy and is computationally efficient. When applied to a real-world, meteorological data set, we identify numerous problematic sites that otherwise have not been flagged as bad. We identify sites for which metadata is incorrect. We identify observations that have been mislabeled by provider quality control processes. And, we demonstrate that our method outperforms enhanced versions of state of the art methods for assessment of accuracy using comparable or less computation time. There are many quality-related problems with real data sets and, in the absence of an approach like ours, these problems may have largely gone unidentified. Our approach is novel for the simple but effective way that it accounts for spatial and temporal variation, and that it addresses more than just accuracy. Collectively these contributions form an overarching data-mining framework and example that can be used and extended for data-mining method development, model building and evaluation of spatio-temporal outlier and anomaly detection processes.en
dc.identifier.urihttps://scholarworks.montana.edu/handle/1/13096en
dc.language.isoenen
dc.publisherMontana State University - Bozeman, College of Engineeringen
dc.rights.holderCopyright 2017 by Douglas Edward Galarusen
dc.subject.lcshData miningen
dc.subject.lcshQuality controlen
dc.subject.lcshEvaluationen
dc.subject.lcshDatabasesen
dc.titleAnomaly detection through spatio-temporal data mining, with application to near real-time outlying sensor identificationen
dc.typeDissertationen
mus.data.thumbpage41en
thesis.degree.committeemembersMembers, Graduate Committee: Brendan Mummey.en
thesis.degree.departmentComputer Science.en
thesis.degree.genreDissertationen
thesis.degree.namePhDen
thesis.format.extentfirstpage1en
thesis.format.extentlastpage270en

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
GalarusD0817.pdf
Size:
9.78 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
826 B
Format:
Plain Text
Description:
Copyright (c) 2002-2022, LYRASIS. All rights reserved.