Finance Seminar - Textual Factors:
A Scalable, Interpretable, and Data-driven
Approach to Analyzing Unstructured Information
10:15am - 11:45am
LSK1003

Abstract :


 


We introduce a general framework for analyzing large-scale text-based data, combining the strengths of neural network language models and generative statistical modeling. Our methodology generate textual factors by (i) representing texts using vector word embedding, (ii) clustering words using locality-sensitive hashing, and (iii) identifying spanning vector clusters through topic modeling. Our data-driven approach captures complex linguistic structures while ensuring computational scalability and economic interpretability. We also discuss applications of textual factors in (i) prediction and inference, (ii) interpreting existing models and variables, and (iii) constructing new metrics and explanatory variables, with illustrations using topics in finance and economics such as macroeconomic forecasting and factor asset pricing.


 

講者/ 表演者:
Prof. Lin William Cong
Cornell University
語言
英文
適合對象
教職員
研究生
主辦單位
財務學系
聯絡方法
新增活動
請各校內團體將活動發布至大學活動日曆。