TY - GEN
T1 - FINQA
T2 - 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021
AU - Chen, Zhiyu
AU - Chen, Wenhu
AU - Smiley, Charese
AU - Shah, Sameena
AU - Borova, Iana
AU - Langdon, Dylan
AU - Moussa, Reema
AU - Beane, Matt
AU - Huang, Ting Hao
AU - Routledge, Bryan
AU - Wang, William Yang
N1 - Publisher Copyright:
© 2021 Association for Computational Linguistics
PY - 2021
Y1 - 2021
N2 - The sheer volume of financial statements makes it difficult for humans to access and analyze a business's financials. Robust numerical reasoning likewise faces unique challenges in this domain. In this work, we focus on answering deep questions over financial data, aiming to automate the analysis of a large corpus of financial documents. In contrast to existing tasks on general domain, the finance domain includes complex numerical reasoning and understanding of heterogeneous representations. To facilitate analytical progress, we propose a new large-scale dataset, FINQA, with Question-Answering pairs over Financial reports, written by financial experts. We also annotate the gold reasoning programs to ensure full explainability. We further introduce baselines and conduct comprehensive experiments in our dataset. The results demonstrate that popular, large, pre-trained models fall far short of expert humans in acquiring finance knowledge and in complex multi-step numerical reasoning on that knowledge. Our dataset - the first of its kind - should therefore enable significant, new community research into complex application domains. The dataset and code are publicly available.
AB - The sheer volume of financial statements makes it difficult for humans to access and analyze a business's financials. Robust numerical reasoning likewise faces unique challenges in this domain. In this work, we focus on answering deep questions over financial data, aiming to automate the analysis of a large corpus of financial documents. In contrast to existing tasks on general domain, the finance domain includes complex numerical reasoning and understanding of heterogeneous representations. To facilitate analytical progress, we propose a new large-scale dataset, FINQA, with Question-Answering pairs over Financial reports, written by financial experts. We also annotate the gold reasoning programs to ensure full explainability. We further introduce baselines and conduct comprehensive experiments in our dataset. The results demonstrate that popular, large, pre-trained models fall far short of expert humans in acquiring finance knowledge and in complex multi-step numerical reasoning on that knowledge. Our dataset - the first of its kind - should therefore enable significant, new community research into complex application domains. The dataset and code are publicly available.
UR - http://www.scopus.com/inward/record.url?scp=85121704735&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85121704735&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85121704735
T3 - EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings
SP - 3697
EP - 3711
BT - EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings
PB - Association for Computational Linguistics (ACL)
Y2 - 7 November 2021 through 11 November 2021
ER -