Transactions on Machine Learning and Data Mining (ISSN: 1865-6781)
Volume 7 - Number 1 - July 2014 - Pages 3-25
Experiencing the Shotgun Distance for Time Series Analysis
Patrick Schäfer1
1Zuse Institute Berlin, Berlin, Germany
Abstract
Similarity search is a core functionality in many data mining algorithms. Over the past decade algorithms were designed to mostly work with human assistance to extract characteristic, aligned patterns of equal length and scaling. We propose the shotgun distance similarity measure that extracts, scales, and aligns segments from a query to a sample time series. This greatly simplifies the time series analysis task of those time series produced by sensors. We show the applicability of our shotgun distance in the context of hierarchical clustering of heraldic shields, and human motion detection. A time series is segmented using varying lengths as part of our shotgun ensemble classifier. This classifier improves the best published accuracies on case studies in the context of bioacoustics, human motion detection, spectrographs or personalized medicine. Finally, it performs better than state of the art on the official UCR classification benchmarks.
Keywords: Time Series, Distance Measure, Similarity, Classification, Hierarchical Clustering, Shotgun Analysis, Segments
Download Paper (348 KB)