Loading [MathJax]/extensions/MathMenu.js
Comparative text analytics via topic modeling in banking | IEEE Conference Publication | IEEE Xplore

Comparative text analytics via topic modeling in banking


Abstract:

In this paper, we compare and evaluate multiple topic modeling approaches and their effectiveness in analyzing a large set of SEC filings by US public banks. More specifi...Show More

Abstract:

In this paper, we compare and evaluate multiple topic modeling approaches and their effectiveness in analyzing a large set of SEC filings by US public banks. More specifically, we apply four major topic modeling methods to a corpus of 8-K and 10-K filings, from the years 2005-2016, of 578 bank holding companies. These methods include Principal Component Analysis, Non-negative Matrix Factorization, Latent Dirichlet Allocation and KATE, a novel k-competitive autoencoder for text documents. Separately for 8-K and 10-K, the usefulness and effectiveness of these methods is evaluated by comparing their performances on two classification tasks: (i) predicting which section each document corresponds to, where we consider each section within an 8-K or 10-K filing as an individual document, and (ii) detecting text from a bank's year of failure, a task for which we use bank failure data from the 2008 financial crisis. In addition, we qualitatively compare the topics discovered by the different methods. We conclude that topic modeling can be an effective tool in financial decision making and risk management.
Date of Conference: 27 November 2017 - 01 December 2017
Date Added to IEEE Xplore: 05 February 2018
ISBN Information:
Conference Location: Honolulu, HI, USA

Contact IEEE to Subscribe

References

References is not available for this document.