ISEG

Aluno: JoÃo Diogo Marques Ferraz

Resumo

Options are financial derivatives used for risk management and speculation, for example, and have been studied extensively in order to forecast its price. Before the technological revolution, parametric models were used with strict assumptions to forecast the options price, such as the Black-Scholes Model. Since then, Machine Learning models, such as the XGBoost model have been created to make forecasts without such strict assumptions from parametric models. The purpose of this dissertation is to show how the XGBoost model can forecast option prices accurately using variables from the BSM. In addition, by using the struc- ture of the standard deviation, third and fourth moment of the distribution of the stock price, the days to the next dividend and the next dividend to be paid, this study aims to understand by how much it improves the price forecasted from the XGBoost model with variables from the BSM. Thus, options from 100 of the biggest companies in the S&P 500 between November and February of 2020 are used to train and test the two XGBoost models. The BSM is used as the benchmark. The results are very favorable towards the XGBoost models since the RMSE of the first and the second model are lower than the BSM by 29.51% and 35.47% , respec- tively. When looking at the options by its distance to the strike price, the XGBoost models always perform better than the BSM, but when the latter has a terrible per- formance for ITM, XGBoost has a bad performance too. For OTM put options, BSM underprices the options while XGBoost models don’t. For short maturities, the XGBoost models don’t improve the performance relative to the BSM by much. Although, they provide a good forecast when compared to BSM for options with long maturities. In a nutshell, the second XGBoost model is always better than the first and almost always they forecast with better accuracy and less bias than the BSM.

Trabalho final de Mestrado

TFM_JoÃo Diogo Marques Ferraz