Charles Explorer logo
🇬🇧

Sector categorization using gradient boosted trees trained on fundamental firm data

Publication at Faculty of Mathematics and Physics |
2020

Abstract

We examine to what extent the GICS sector categorization of equity securities may be systematically reconstructed from historical quarterly firm fundamental data using gradient boosted tree classification. Model complexity and performance tradeoffs are examined and relative feature importance is described.

Potential extensions are outlined including ideas to improve feature engineering, validating internal consistency and integrating additional data sources to further improve classification accuracy.