# Exponential smoothing

Exponential smoothing theory

I would write an article about technical indicators and tell you about **exponential moving average**. However, it turned out that by studying the theory of this indicator, I came across some interesting things more related to statistics than to the stock market or forex.

Since statistics already been mentioned on this site, I've decided to write a separate article about it - the article about **exponential smoothing** in **time series** analysis.

This topic was raised in the article Seasonal fluctuations. Seasonal indices. Method of simple averages. The calculation of average seasonality indexes of average mean methods could be mainly applied to time series where there were no upward/downward trends, or they are negligible. In other words, the observed value fluctuates around some permanent value._

What does that mean? It means that the average is constant, and because of that, it can not capture the trend.

Let's illustrate it with a graph

Generally speaking, all methods of averaging are intended to eliminate "noise" from the random scatter of the data that allows identifying the trend more clearly or the seasonal or cyclic changes, that is, the internal structure of the data, seemingly random, and use it to build the model, followed by analysis and forecasting of future values - but as we see, the simple averaging method does not work if there is a pronounced trend. We cannot predict anything with its help.

We must be able to receive not only one average but average series. And the most popular (and simple) method to obtain those series is exponential smoothing.

It can be described as follows - **When forecasting, newer values of observed values are given the greater weight comparing with older values. At the same time, older values are given exponentially decreasing weights.**

Now we describe the definition with formulas.

Traditionally denote the observed value as , and smoothed average as .

Then,

undefined

and, generalized

where, takes the value from the range [0;1)

Whence comes to the exhibitor - reveal the previous average.

and, generalized

, for t > 2

Thus, the weight before - is an infinitely decreasing geometric progression with multiplier

And the farther S, the less it is affected by the initial values.

Let's assume that and see how its contribution changes for the various S.

For S2, it is taken as it is, but in S3 with a coefficient alpha of 0.5, the contribution of y1 is only 250, in S4 - 125, and so on.

Simultaneously, the choice of the coefficient is important. If you play around with the parameter "a" in the calculator (see. Above), it is clear that the higher the value, the faster the countdown actually ceases to affect the smoothed average, and vice versa - the lower, the longer it retains its influence.

Accordingly, for small , the method of obtaining S2 has a great influence on the result. Assignment is just one of the methods. As an alternative, the initial value may be a simple average of the first few values of y, for example.

But how do you choose ? Which index is most suitable for the simulation of the dynamics of this series? There is no mathematical formulas for calculating exact . This indicator is most often chosen by selection or by the "trials and errors" method.

The method consists in the fact that you take multiple values then, among them, select the best one. What is the criterion of "best" in our case?

That criterion is to minimize the mean of squared errors. Error - is the deviation of the actual value of the forecast. For each S value, it is squared to get rid of the sign's influence and then calculate the average of all values. That index, for which the average value and the minimum are the best of several.

Now a few words about the prediction.

The next value of the series is predicted directly from the formula

If it is necessary to get a forecast for a larger number of samples, the technique called bootstrapping is used. The last known value of "y" is taken as a constant and is used in the recursive formula.

Now apply this knowledge when calculating the smoothed average for the graph shown at the beginning of this article. To make this more interesting, we calculate the smoothed average for the three values at once , and at the same time calculate the mean square error.

The graph shows for reference the following predicted value, i.e., moving average extended for one count further than actual data.

By the way, I should note that the best default value for the calculator above will be 0.7

With equals 1, smoothing degenerates into a repeat of penultimate values that under significant variation neighboring values do not always give a minimum mean square error.

## Comments