Author ORCID Identifier

0000-0002-6348-5207

Document Type

Dissertation

Date of Award

5-31-2025

Degree Name

Doctor of Philosophy in Computing Sciences - (Ph.D.)

Department

Computer Science

First Advisor

Guiling Wang

Second Advisor

Zhi Wei

Third Advisor

Shantanu Sharma

Fourth Advisor

Rong Liu

Fifth Advisor

Wenpeng Yin

Abstract

This dissertation explores the evolution and application of artificial intelligence techniques across three critical domains: financial modeling, mathematical reasoning, and structured data analysis. The dissertation presents seven research projects that chart a progression from specialized neural architectures to sophisticated large language models (LLMs), contributing novel methodologies and frameworks at each stage.

In the financial domain, the research first introduces TS-Mixer, a MLP-based architecture for time-series forecasting that captures both feature relationships and temporal dependencies through a simple yet effective design, outperforming more complex models in S&P500 index prediction. The dissertation then presents DySTAGE, a dynamic graph representation learning framework that addresses the evolving nature of financial markets by modeling changing asset relationships, demonstrating superior performance in both predictive accuracy and portfolio optimization. Finally, the dissertation proposes a hybrid framework integrating LLMs with reinforcement learning for adaptive margin trading, enabling dynamic risk management through explainable market reasoning.

For mathematical reasoning, the research develops two novel evaluation frameworks that expand beyond traditional correctness metrics: CreativeMath and FaultyMath. CreativeMath assesses LLMs' ability to generate novel, insightful solutions to mathematical problems, introducing a comprehensive benchmark of competition-level problems with multiple human solutions. FaultyMath evaluates logical robustness by testing whether models can identify logically flawed or unsolvable problems, revealing significant gaps in current systems' critical thinking capabilities.

In structured data analysis, the dissertation introduces DataFrame QA, a privacy-preserving framework that enables natural language interaction with tabular data without exposing sensitive information, achieving high accuracy while eliminating data exposure risks. The research also presents TextFlow, a modular approach to flowchart understanding that separates visual extraction from semantic reasoning, demonstrating substantial improvements over end-to-end vision-language models in accuracy and interpretability.

Collectively, these contributions advance AI capabilities across multiple dimensions—efficiency, adaptability, creativity, logical robustness, privacy, and interpretability—while establishing methodologies that leverage the strengths of different AI paradigms for complex analytical tasks. The dissertation provides both theoretical insights and practical frameworks that bridge the gap between specialized neural architectures and general-purpose language models, with applications in finance, education, data science, and beyond.

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.