
Eran Raviv Blog » Statistics and Econometrics
1000 FOLLOWERS
check my blogs on the topic of Modern statistics and econometrics with applications to financial markets. I am Eran Raviv, working for a large pension fund manager and the author of this blog. Here, you can read about topics I think I want to know more about, or topics I feel there is something to be said about, not easily found elsewhere.
Eran Raviv Blog » Statistics and Econometrics
3w ago
Tree-based methods like decision trees and their powerful random forest extensions are one of the most widely used machine learning algorithms. They are easy to use and provide good forecasting performance off the cuff more or less. Another machine learning community darling is the deep learning method, particularly neural networks. These are ultra flexible algorithms with impressive forecasting performance even (and especially) in highly complex real-life environments.
This post is shares:
Two academic references lauding the powerful performance of tree-based methods.
Because both neural net ..read more
Eran Raviv Blog » Statistics and Econometrics
1M ago
The word spurious refers to “outwardly similar or corresponding to something without having its genuine qualities.” Fake.
While the meanings of spurious correlation and spurious regression are common knowledge nowadays, much less is understood about spurious factors. This post draws your attention to recent, top-shelf, research flagging the risks around spurious factor analysis. While formal solutions are still pending there are couple of heuristics we can use to detect possible problems.
Since you know what spurious correlation is, it’s easy to board the train of thought at this station. When ..read more
Eran Raviv Blog » Statistics and Econometrics
5M ago
Hyper-parameters are parameters which are not estimated as an integral part of the model. We decide on those parameters but we don’t estimate them within, but rather beforehand. Therefore they are called hyper-parameters, as in “above” sense.
Almost all machine learning algorithms have some hyper-parameters. Data-driven choice of hyper-parameters means typically, that you re-estimate the model and check performance for different hyper-parameters’ configurations. This adds considerable computational burden. One popular approach to set hyper-parameters is based on a grid-search over possible val ..read more
Eran Raviv Blog » Statistics and Econometrics
5M ago
Every so often I read about the kernel trick. Each time I read about it I need to relearn what it is. Now I am thinking “Eran, don’t you have this fancy blog of yours where you write about statistics you don’t want to forget?” and then: “why indeed I do have a fancy blog where I write about statistics I don’t want to forget”. So in this post I explain the “trick” in kernel trick and why it is useful.
Why do we need the kernel trick anyway?
The kernel trick is helpful for expanding any data from its original dimension to higher dimension. Fine, who cares?. Well, in life we often benefit from di ..read more