Adaptive Machine Learning of Interaction Sequences
Data mining and machine learning offer a wide variety of algorithms for Cognitive Interaction Technologies. With data mining techniques systems can identify relevant data, recognize important data changes, or focus their resources and capabilities toward a given target. In a Cognitive Interaction environment such as a smart room as developed in the Ambient Intelligence Group with a dense net of sensors most of the collected data can be naturally described as multivariate time series, and it is obvious that reliable, robust, and flexible data mining and machine learning algorithms for time series hold a key role in application design.
Data mining and machine learning of multivariate time series is a difficult task. Because of varying lengths of time series, varying development over time, or possible long term correlations traditional machine learning algorithms do not qualify for this type of data.
Data mining and machine learning of multivariate time series is a difficult task. Because of varying lengths of time series, varying development over time, or possible long term correlations traditional machine learning algorithms do not qualify for this type of data. Embedding time series in a consistent feature space is necessary, which mostly conflicts with the varying structure of the data, and solutions like truncating or time series features are often impractical or come along with loss of information. Other approaches, e.g. alignment-kernel, may be successful on small data sets but are time-consuming and therefore problematic. Ordered means models present excellent starting points for this project. Machine learning of multivariate time series in a highly dynamic environment requires high degrees of adaptation and online learning capability. This project addresses exactly this problem, namely, to explore how OMMs can be extended such that the algorithms will be enabled to adapt learned models to new and unknown data and, thus, to new and unknown situations. Further concerns in dynamic environments are the tasks of unsupervised learning. Latest developments do not only learn and analyze sequences, but also generate new instances of learned time series data.
We develop models for unsupervised and reinforcement learning of hierarchical structure from sequential data both for the analysis and generation of multivariate sequence patterns, in particular, unsuspervised hierachical approaches.
As a first outcome, we developed two python packages: pyOMM -- a python package for supervised and unsupervised machine learning of time series with ordered means models --, an pyKTree: an implementation of the K-tree algorithm in the Python programming language.