Computational cognitive science and AI

Hanbo Xie

I study how people think, decide, and learn by combining cognitive modeling, think-aloud data, and large language models.

Ph.D. student, Georgia Tech Psychology Princeton CoCoSci Lab visit, Mar-Jul 2025

Cognitive Modeling Think-Aloud Protocols Human-Centered AI

Hanbo Xie

About

Computational approaches to cognition, decision-making, and human-centered AI.

I am a Ph.D. student in Psychology at the Georgia Institute of Technology, specializing in computational cognitive science. My work asks how language, behavior, and computational models can reveal the hidden structure of human thought.

My thesis uses large language models to analyze think-aloud data from decision-making and learning tasks. Rather than treating behavior as only button presses or choices, I use participants' verbal reports as a richer window into strategies, beliefs, and latent cognitive processes. I am especially interested in when LLMs can help measure these processes reliably, and where human judgment remains essential.

I also study AI systems through ideas from psychology and neuroscience: how models explore, reason, explain decisions, and interact with people. More broadly, I am interested in AI-assisted scientific discovery that expands, rather than replaces, careful empirical work.

Before Georgia Tech, I earned an M.A. from the University of Arizona and spent three years as a full-time research assistant at Peking University's CBCS. From March to July 2025, I visited Tom Griffiths' CoCoSci Lab at Princeton University as a Visiting Student Research Collaborator.

Research Highlights

Selected themes connecting cognitive science, language, behavior, and AI.

Modeling Thought from Think-Aloud Data

The think-aloud protocol asks participants to verbalize their thoughts while they perform psychological tasks. Traditional work has mostly relied on behavioral outputs (often button presses) to infer latent cognitive processes. In many cases, candidate cognitive models are proposed and tested by researchers, which can limit the hypothesis space and introduce bias. By directly analyzing participants' verbal reports, we gain a richer and more direct view of cognition during task performance. However, most prior think-aloud research depends on manual coding by experts, which is labor-intensive, subjective, and difficult to scale.

Recent advances in LLMs make it possible to revisit this classic protocol with stronger computational tools. LLMs can help quantify, interpret, and even predict subsequent behavior from think-aloud language. Our work evaluates when and how these models can be used reliably, with the goal of building a more systematic and scalable framework for studying human thought processes.

Representative publications:

  • Xie, H., Xiong, H., & Wilson, R. C. (2023). Text2Decision: Decoding Latent Variables in Risky Decision Making from Think Aloud Text. NeurIPS 2023 AI for Science Workshop.
  • Xie, H., Xiong, H., & Wilson, R. C. (2024). From Strategic Narratives to Code-Like Cognitive Models: An LLM-Based Approach in A Sorting Task. First Conference on Language Modeling (COLM).
  • Xie, H.†, Xiong, H. D., & Wilson, R. C. (2025). Rethinking Think-Aloud in the Age of Language Models. PsyArXiv. https://osf.io/preprints/psyarxiv/6ta3z_v1 (submitted).
  • Xie, H.†, Jagadish, A. K., Pan, L., & Wilson, R. C. (2026). Think-aloud reshapes automated cognitive model discovery beyond behavior. arXiv preprint arXiv:2605.05091. https://doi.org/10.48550/arXiv.2605.05091 (submitted).
  • Zhang, Z.*, Xie, H.*, Baker, T., Peters, M., & Wilson, R. C. (2025). Linking strategies to think aloud in a stochastic learning task. In Proceedings of the Annual Meeting of the Cognitive Science Society.

Reverse Engineering Human Thoughts

Human thought is central to intelligence, yet it is difficult to define, measure, and model. A core challenge in both cognitive science and AI is to characterize thought processes across tasks, identify shared principles, and generalize those principles to make useful predictions. The difficulty is that thoughts are often implicit, while language is diverse and context-dependent. As a result, verbal reports are informative but still incomplete reflections of internal cognition.

Instead of focusing only on the forward direction (how thoughts generate behavior), this project emphasizes inverse inference: given observed behavior and related measurements, can we reconstruct plausible underlying thoughts? The broader goal is to build a stronger, more general bridge between behavior and cognition. This direction also supports a human-centered understanding of machine reasoning. If the computations of complex systems (e.g., AlphaGo-like models) can be approximated by human-trained explanatory models, we may be able to describe model reasoning in natural language that is useful for teaching, interpretation, and collaboration.

This project began during my Princeton visit and remains an active research direction.

Representative publications:

  • Zhu, J.-Q.*, Xie, H.*, Arumugam, D., Wilson, R. C., & Griffiths, T. L. (2025). Using reinforcement learning to train large language models to explain human decisions. arXiv preprint arXiv:2505.11614.

Human Insights for Artificial Intelligence

This project examines AI through concepts from psychology and neuroscience. By comparing strengths and weaknesses of AI and human intelligence, we can design models that are both more capable and more interpretable. Beyond technical performance, I am interested in societal value: systems that support human decision-making, education, and collaboration. I also explore how people can learn from advanced AI models when we build the right frameworks to analyze and communicate their internal computations.

Representative publications:

  • Pan, L.*, Xie, H.*†, & Wilson, R. C. (2025). Large Language Models Think Too Fast To Explore Effectively. arXiv preprint arXiv:2501.18009. NeurIPS 2025 Poster.

Accelerating Discovery in Cognitive Science

The human mind is deeply complex. Although thoughts, emotions, and actions are part of everyday experience, formally describing and predicting cognition remains a major scientific challenge. Many cognitive theories are grounded in human intuition and then tested through experiments and computational models. These approaches are powerful, but they can remain constrained by the original hypothesis space.

In the AI era, there is an opportunity to rethink discovery pipelines in cognitive science and psychology. LLMs bring broad knowledge and strong inductive biases, and modern reasoning models can perform at levels that sometimes rival expert intuition. A central question for my work is whether we can build AI-assisted workflows that help discover new behavioral phenomena, generate computational models, and propose testable theories while reducing avoidable human bias. I view this as complementary to, not a replacement for, careful empirical research.

Representative publications:

  • Xie, H., Xiong, H., & Wilson, R. C. (2023). Text2Decision: Decoding Latent Variables in Risky Decision Making from Think Aloud Text. NeurIPS 2023 AI for Science Workshop.
  • Xie, H., Xiong, H., & Wilson, R. C. (2024). From Strategic Narratives to Code-Like Cognitive Models: An LLM-Based Approach in A Sorting Task. First Conference on Language Modeling (COLM).
  • Xie, H.†, Jagadish, A. K., Pan, L., & Wilson, R. C. (2026). Think-aloud reshapes automated cognitive model discovery beyond behavior. arXiv preprint arXiv:2605.05091. https://doi.org/10.48550/arXiv.2605.05091 (submitted).
  • Zhu, J.-Q.*, Xie, H.*, Arumugam, D., Wilson, R. C., & Griffiths, T. L. (2025). Using reinforcement learning to train large language models to explain human decisions. arXiv preprint arXiv:2505.11614. ICLR 2026.
  • Xie, H.*, & Zhu, J*. (2025, July 12). Centaur May Have Learned a Shortcut that Explains Away Psychological Tasks. https://doi.org/10.31234/osf.io/u7z4t_v1 (submitted).

Publications

* Denotes equal contribution, † Denotes Correspondence, Underscore denotes mentee. Use topic filters to navigate.

2026

  • Preprint Think-Aloud LLM
    Xie, H.†, Jagadish, A. K., Pan, L., & Wilson, R. C. (2026). Think-aloud reshapes automated cognitive model discovery beyond behavior. arXiv preprint arXiv:2605.05091. https://doi.org/10.48550/arXiv.2605.05091 (submitted).

2025

  • Journal Decision Social
    Qiu, S., Tang, Y., Yu, H., Xie, H., Dreher, J. C., Hu, Y., & Zhou, X. (2025). Toward a computational understanding of bribe-taking behavior. Annals of the New York Academy of Sciences.
  • Conference LLM Decision
    Zhu, J.-Q.*, Xie, H.*, Arumugam, D., Wilson, R. C., & Griffiths, T. L. (2025). Using reinforcement learning to train large language models to explain human decisions. arXiv preprint arXiv:2505.11614. ICLR 2026.
  • Conference LLM Decision
    Pan, L.*, Xie, H.*†, & Wilson, R. C. (2025). Large Language Models Think Too Fast To Explore Effectively. arXiv preprint arXiv:2501.18009. NeurIPS 2025 Poster.
  • Conference LLM AI
    Xie, H.†, Zhu, J. Q., Xiong, H. D., Wilson, R., & Griffiths, T. (2025). Reasoning Across Minds and Machines. In Proceedings of the Annual Meeting of the Cognitive Science Society (Vol. 47).
  • Conference Think-Aloud Learning
    Zhang, Z.*, Xie, H.*, Baker, T., Peters, M., & Wilson, R. C. (2025). Linking strategies to think aloud in a stochastic learning task. In Proceedings of the Annual Meeting of the Cognitive Science Society.
  • Preprint Think-Aloud LLM
    Xie, H.*, & Zhu, J*. (2025, July 12). Centaur May Have Learned a Shortcut that Explains Away Psychological Tasks. https://doi.org/10.31234/osf.io/u7z4t_v1 (submitted).
  • Preprint Think-Aloud LLM
    Xie, H.†, Xiong, H. D., & Wilson, R. C. (2025). Rethinking Think-Aloud in the Age of Language Models. PsyArXiv. https://osf.io/preprints/psyarxiv/6ta3z_v1 (submitted).

2024

  • Journal Decision Clinical
    Fang, Z., Zhao, M., Xu, T., Li, Y., Xie, H., Quan, P., ... & Zhang, R. Y. (2024). Individuals with anxiety and depression use atypical decision strategies in an uncertain world. eLife, 13.
  • Conference Think-Aloud LLM
    Xie, H., Xiong, H., & Wilson, R. C. (2024). From Strategic Narratives to Code-Like Cognitive Models: An LLM-Based Approach in A Sorting Task. First Conference on Language Modeling (COLM).
  • Conference Think-Aloud Decision
    Xie, H., Xiong, H., & Wilson, R. C. (2024). Evaluating Predictive Performance and Learning Efficiency of Large Language Models with Think Aloud in Risky Decision Making. Computational Cognitive Neuroscience (CCN), MIT.

2023

  • Journal AI
    Xie, H. (2023). The promising future of cognitive science and artificial intelligence. Nat Rev Psychology.
  • Conference Think-Aloud Decision
    Xie, H., Xiong, H., & Wilson, R. C. (2023). Text2Decision: Decoding Latent Variables in Risky Decision Making from Think Aloud Text. NeurIPS 2023 AI for Science Workshop.
  • Conference Think-Aloud LLM
    Xie, H., Xiong, H., & Wilson, R. C. (2023). Computational introspection: Can large language models reveal cognitive algorithms from human language? Poster session presented at the 5th Chinese Computational and Cognitive Neuroscience Conference, Beijing, China.

2022

  • Conference Decision Learning
    Guo, Y., Song, S., Xie, H., Gao, X., & Zhang, J. (2022, February). ARIMA and RNN for Selection Sequences Prediction in Iowa Gambling Task. In 2022 2nd International Conference on Artificial Intelligence and Signal Processing (AISP) (pp. 1-6). IEEE.

2020

  • Conference Social Learning
    Song, S*., Xie, H.*., Speekenbrink, M., Zhang, J., Gao, X., & Zhou, X. (2020, October). The computational basis of individuals' learning under uncertainty in groups with collective goals. Oral presentation at the Society for Neuroeconomics, Vancouver, Canada.

Blog

Essays and notes at the intersection of cognitive science and AI.

Performance Scales More Easily Than Insight

Scaling and its limits in computational cognitive science

How large behavioral datasets and powerful AI models can rapidly improve predictive performance while scientific understanding lags behind — and why data and knowledge bottlenecks matter for the future of cognitive science.

Read post

Collaborators

Mentor and Committee

Social Cognition

Think Aloud

Large Language Models and Neural Networks

Mentees

  • Zhenlong Zhang, Johns Hopkins University
  • Lan Pan
  • Yangtong Feng, Wash U St. Louis

Contact

Email: hanboxie1997@gatech.edu

Address: 750 Ferst Drive, Atlanta, GA 30332

X GitHub Google Scholar