Monotonic Detection and Plotting
One important preprocessing step for time series data processing is the detection of monotonic data and transform it to non-monotonic type by using the finite difference operator.
Artificial Data Example
Let's create an artificial monotonic data and apply our monotonic transformer to normalize it. We can use the Plotter
filter to visualize the generated data.
using TSML
Random.seed!(123)
pltr = Plotter(Dict(:pdfoutput => false))
mdates = DateTime(2017,12,1,1):Dates.Hour(1):DateTime(2017,12,31,10) |> collect
mvals = rand(length(mdates)) |> cumsum
df = DataFrame(Date=mdates ,Value = mvals)
fit_transform!(pltr,df);
Now that we have a monotonic data, let's use the Monotonicer
to normalize and plot the result:
using TSML
mono = Monotonicer(Dict())
pipeline = mono |> pltr
res=fit_transform!(pipeline,df);
Real Data Example
We will now apply the entire pipeline starting from reading csv data, aggregate, impute, normalize if it's monotonic, and plot. We will consider three different data types: a regular time series data, a monotonic data, and a daily monotonic data. The difference between monotonic and daily monotonic is that the values in daily monotonic resets to zero or some baseline and cumulatively increases in a day until the next day where it resets to zero or some baseline value. Monotonicer
automatically detects these three different types and apply the corresponding normalization accordingly.
using TSML
regularfile = joinpath(dirname(pathof(TSML)),"../data/typedetection/regular.csv")
monofile = joinpath(dirname(pathof(TSML)),"../data/typedetection/monotonic.csv")
dailymonofile = joinpath(dirname(pathof(TSML)),"../data/typedetection/dailymonotonic.csv")
regularfilecsv = CSVDateValReader(Dict(:filename=>regularfile,:dateformat=>"dd/mm/yyyy HH:MM"))
monofilecsv = CSVDateValReader(Dict(:filename=>monofile,:dateformat=>"dd/mm/yyyy HH:MM"))
dailymonofilecsv = CSVDateValReader(Dict(:filename=>dailymonofile,:dateformat=>"dd/mm/yyyy HH:MM"))
valgator = DateValgator(Dict(:dateinterval=>Dates.Hour(1)))
valnner = DateValLinearImputer(Dict(:dateinterval=>Dates.Hour(1)))
stfier = Statifier(Dict(:processmissing=>true))
mono = Monotonicer(Dict())
pltr = Plotter(Dict(:pdfoutput => false))
Regular TS Processing
Let's test by feeding the regular time series type to the pipeline. We expect that for this type, Monotonicer
will not perform further processing:
- Pipeline with
Monotonicer
: regular time series
pipeline = regularfilecsv |> valgator |> valnner |> mono |> pltr
fit_transform!(pipeline);
- Pipeline without
Monotonicer
: regular time series
pipeline = regularfilecsv |> valgator |> valnner |> pltr
fit_transform!(pipeline);
Notice that the plots are the same with or without the Monotonicer
instance.
Monotonic TS Processing
Let's now feed the same pipeline with a monotonic csv data.
- Pipeline without
Monotonicer
: monotonic time series
pipeline = monofilecsv |> valgator |> valnner |> pltr
fit_transform!(pipeline);
- Pipeline with
Monotonicer
: monotonic time series
pipeline = monofilecsv |> valgator |> valnner |> mono |> pltr
fit_transform!(pipeline);
Notice that without the Monotonicer
instance, the data is monotonic. Applying the Monotonicer
instance in the pipeline converts the data into a regular time series but with outliers.
We can use the Outliernicer
filter to remove outliers. Let's apply this filter after the Monotonicer
and plot the result.
- Pipeline with
Monotonicer
andOutliernicer
: monotonic time series
using TSML: Outliernicer
outliernicer = Outliernicer(Dict(:dateinterval=>Dates.Hour(1)));
pipeline = monofilecsv |> valgator |> valnner |> mono |> outliernicer |> pltr
fit_transform!(pipeline);
Daily Monotonic TS Processing
Lastly, let's feed the daily monotonic data using similar pipeline and examine its plot.
- Pipeline without
Monotonicer
: daily monotonic time series
pipeline = dailymonofilecsv |> valgator |> valnner |> pltr
fit_transform!(pipeline);
This plot is characterized by monotonically increasing trend but resets to certain baseline value at the end of the day and repeat similar trend daily. The challenge for the monotonic normalizer is to differentiate between daily monotonic from the typical monotonic function to apply the correct normalization.
- Pipeline with
Monotonicer
: daily monotonic time series
pipeline = dailymonofilecsv |> valgator |> valnner |> mono |> pltr
fit_transform!(pipeline);
While the Monotonicer
filter is able to transform the data into a regular time series, there are significant outliers due to noise and the nature of this kind of data or sensor.
Let's remove the outliers by applying the Outliernicer
filter and examine the result.
- Pipeline with
Monotonicer
andOutliernicer
: daily monotonic time series
pipeline = dailymonofilecsv |> valgator |> valnner |> mono |> outliernicer |> pltr
fit_transform!(pipeline);
The Outliernicer
filter effectively removed the outliers as shown in the plot.