Computational approaches to cognition, decision-making, and human-centered AI.
I am a Ph.D. student in Psychology at the Georgia Institute of Technology, specializing in computational cognitive science. My work asks how language, behavior, and computational models can reveal the hidden structure of human thought.
My thesis uses large language models to analyze think-aloud data from decision-making and learning tasks. Rather than treating behavior as only button presses or choices, I use participants' verbal reports as a richer window into strategies, beliefs, and latent cognitive processes. I am especially interested in when LLMs can help measure these processes reliably, and where human judgment remains essential.
I also study AI systems through ideas from psychology and neuroscience: how models explore, reason, explain decisions, and interact with people. More broadly, I am interested in AI-assisted scientific discovery that expands, rather than replaces, careful empirical work.
Before Georgia Tech, I earned an M.A. from the University of Arizona and spent three years as a full-time research assistant at Peking University's CBCS. From March to July 2025, I visited Tom Griffiths' CoCoSci Lab at Princeton University as a Visiting Student Research Collaborator.
Research Highlights
Selected themes connecting cognitive science, language, behavior, and AI.
Modeling Thought from Think-Aloud Data
The think-aloud protocol asks participants to verbalize their thoughts while they perform psychological tasks. Traditional work has mostly relied on behavioral outputs (often button presses) to infer latent cognitive processes. In many cases, candidate cognitive models are proposed and tested by researchers, which can limit the hypothesis space and introduce bias. By directly analyzing participants' verbal reports, we gain a richer and more direct view of cognition during task performance. However, most prior think-aloud research depends on manual coding by experts, which is labor-intensive, subjective, and difficult to scale.
Recent advances in LLMs make it possible to revisit this classic protocol with stronger computational tools. LLMs can help quantify, interpret, and even predict subsequent behavior from think-aloud language. Our work evaluates when and how these models can be used reliably, with the goal of building a more systematic and scalable framework for studying human thought processes.
Representative publications:
Xie, H., Xiong, H., & Wilson, R. C. (2023). Text2Decision: Decoding Latent Variables in Risky Decision Making from Think Aloud Text. NeurIPS 2023 AI for Science Workshop.
Xie, H., Xiong, H., & Wilson, R. C. (2024). From Strategic Narratives to Code-Like Cognitive Models: An LLM-Based Approach in A Sorting Task. First Conference on Language Modeling (COLM).
Xie, H.†, Jagadish, A. K., Pan, L., & Wilson, R. C. (2026). Think-aloud reshapes automated cognitive model discovery beyond behavior. arXiv preprint arXiv:2605.05091. https://doi.org/10.48550/arXiv.2605.05091 (submitted).
Zhang, Z.*, Xie, H.*, Baker, T., Peters, M., & Wilson, R. C. (2025). Linking strategies to think aloud in a stochastic learning task. In Proceedings of the Annual Meeting of the Cognitive Science Society.
Reverse Engineering Human Thoughts
Human thought is central to intelligence, yet it is difficult to define, measure, and model. A core challenge in both cognitive science and AI is to characterize thought processes across tasks, identify shared principles, and generalize those principles to make useful predictions. The difficulty is that thoughts are often implicit, while language is diverse and context-dependent. As a result, verbal reports are informative but still incomplete reflections of internal cognition.
Instead of focusing only on the forward direction (how thoughts generate behavior), this project emphasizes inverse inference: given observed behavior and related measurements, can we reconstruct plausible underlying thoughts? The broader goal is to build a stronger, more general bridge between behavior and cognition. This direction also supports a human-centered understanding of machine reasoning. If the computations of complex systems (e.g., AlphaGo-like models) can be approximated by human-trained explanatory models, we may be able to describe model reasoning in natural language that is useful for teaching, interpretation, and collaboration.
This project began during my Princeton visit and remains an active research direction.
Representative publications:
Zhu, J.-Q.*, Xie, H.*, Arumugam, D., Wilson, R. C., & Griffiths, T. L. (2025). Using reinforcement learning to train large language models to explain human decisions. arXiv preprint arXiv:2505.11614.
Human Insights for Artificial Intelligence
This project examines AI through concepts from psychology and neuroscience. By comparing strengths and weaknesses of AI and human intelligence, we can design models that are both more capable and more interpretable. Beyond technical performance, I am interested in societal value: systems that support human decision-making, education, and collaboration. I also explore how people can learn from advanced AI models when we build the right frameworks to analyze and communicate their internal computations.
Representative publications:
Pan, L.*, Xie, H.*†, & Wilson, R. C. (2025). Large Language Models Think Too Fast To Explore Effectively. arXiv preprint arXiv:2501.18009. NeurIPS 2025 Poster.
Accelerating Discovery in Cognitive Science
The human mind is deeply complex. Although thoughts, emotions, and actions are part of everyday experience, formally describing and predicting cognition remains a major scientific challenge. Many cognitive theories are grounded in human intuition and then tested through experiments and computational models. These approaches are powerful, but they can remain constrained by the original hypothesis space.
In the AI era, there is an opportunity to rethink discovery pipelines in cognitive science and psychology. LLMs bring broad knowledge and strong inductive biases, and modern reasoning models can perform at levels that sometimes rival expert intuition. A central question for my work is whether we can build AI-assisted workflows that help discover new behavioral phenomena, generate computational models, and propose testable theories while reducing avoidable human bias. I view this as complementary to, not a replacement for, careful empirical research.
Representative publications:
Xie, H., Xiong, H., & Wilson, R. C. (2023). Text2Decision: Decoding Latent Variables in Risky Decision Making from Think Aloud Text. NeurIPS 2023 AI for Science Workshop.
Xie, H., Xiong, H., & Wilson, R. C. (2024). From Strategic Narratives to Code-Like Cognitive Models: An LLM-Based Approach in A Sorting Task. First Conference on Language Modeling (COLM).
Xie, H.†, Jagadish, A. K., Pan, L., & Wilson, R. C. (2026). Think-aloud reshapes automated cognitive model discovery beyond behavior. arXiv preprint arXiv:2605.05091. https://doi.org/10.48550/arXiv.2605.05091 (submitted).
Zhu, J.-Q.*, Xie, H.*, Arumugam, D., Wilson, R. C., & Griffiths, T. L. (2025). Using reinforcement learning to train large language models to explain human decisions. arXiv preprint arXiv:2505.11614. ICLR 2026.
Xie, H.*, & Zhu, J*. (2025, July 12). Centaur May Have Learned a Shortcut that Explains Away Psychological Tasks. https://doi.org/10.31234/osf.io/u7z4t_v1 (submitted).
Publications
* Denotes equal contribution, † Denotes Correspondence, Underscore denotes mentee. Use topic filters to navigate.
2026
PreprintThink-AloudLLM
Xie, H.†, Jagadish, A. K., Pan, L., & Wilson, R. C. (2026). Think-aloud reshapes automated cognitive model discovery beyond behavior. arXiv preprint arXiv:2605.05091. https://doi.org/10.48550/arXiv.2605.05091 (submitted).
2025
JournalDecisionSocial
Qiu, S., Tang, Y., Yu, H., Xie, H., Dreher, J. C., Hu, Y., & Zhou, X. (2025). Toward a computational understanding of bribe-taking behavior. Annals of the New York Academy of Sciences.
ConferenceLLMDecision
Zhu, J.-Q.*, Xie, H.*, Arumugam, D., Wilson, R. C., & Griffiths, T. L. (2025). Using reinforcement learning to train large language models to explain human decisions. arXiv preprint arXiv:2505.11614. ICLR 2026.
ConferenceLLMDecision
Pan, L.*, Xie, H.*†, & Wilson, R. C. (2025). Large Language Models Think Too Fast To Explore Effectively. arXiv preprint arXiv:2501.18009. NeurIPS 2025 Poster.
ConferenceLLMAI
Xie, H.†, Zhu, J. Q., Xiong, H. D., Wilson, R., & Griffiths, T. (2025). Reasoning Across Minds and Machines. In Proceedings of the Annual Meeting of the Cognitive Science Society (Vol. 47).
ConferenceThink-AloudLearning
Zhang, Z.*, Xie, H.*, Baker, T., Peters, M., & Wilson, R. C. (2025). Linking strategies to think aloud in a stochastic learning task. In Proceedings of the Annual Meeting of the Cognitive Science Society.
PreprintThink-AloudLLM
Xie, H.*, & Zhu, J*. (2025, July 12). Centaur May Have Learned a Shortcut that Explains Away Psychological Tasks. https://doi.org/10.31234/osf.io/u7z4t_v1 (submitted).
Fang, Z., Zhao, M., Xu, T., Li, Y., Xie, H., Quan, P., ... & Zhang, R. Y. (2024). Individuals with anxiety and depression use atypical decision strategies in an uncertain world. eLife, 13.
ConferenceThink-AloudLLM
Xie, H., Xiong, H., & Wilson, R. C. (2024). From Strategic Narratives to Code-Like Cognitive Models: An LLM-Based Approach in A Sorting Task. First Conference on Language Modeling (COLM).
ConferenceThink-AloudDecision
Xie, H., Xiong, H., & Wilson, R. C. (2024). Evaluating Predictive Performance and Learning Efficiency of Large Language Models with Think Aloud in Risky Decision Making. Computational Cognitive Neuroscience (CCN), MIT.
2023
JournalAI
Xie, H. (2023). The promising future of cognitive science and artificial intelligence. Nat Rev Psychology.
ConferenceThink-AloudDecision
Xie, H., Xiong, H., & Wilson, R. C. (2023). Text2Decision: Decoding Latent Variables in Risky Decision Making from Think Aloud Text. NeurIPS 2023 AI for Science Workshop.
ConferenceThink-AloudLLM
Xie, H., Xiong, H., & Wilson, R. C. (2023). Computational introspection: Can large language models reveal cognitive algorithms from human language? Poster session presented at the 5th Chinese Computational and Cognitive Neuroscience Conference, Beijing, China.
2022
ConferenceDecisionLearning
Guo, Y., Song, S., Xie, H., Gao, X., & Zhang, J. (2022, February). ARIMA and RNN for Selection Sequences Prediction in Iowa Gambling Task. In 2022 2nd International Conference on Artificial Intelligence and Signal Processing (AISP) (pp. 1-6). IEEE.
2020
ConferenceSocialLearning
Song, S*., Xie, H.*., Speekenbrink, M., Zhang, J., Gao, X., & Zhou, X. (2020, October). The computational basis of individuals' learning under uncertainty in groups with collective goals. Oral presentation at the Society for Neuroeconomics, Vancouver, Canada.
Blog
Essays and notes at the intersection of cognitive science and AI.
Performance Scales More Easily Than Insight
Scaling and its limits in computational cognitive science
How large behavioral datasets and powerful AI models can rapidly improve predictive performance while scientific understanding lags behind — and why data and knowledge bottlenecks matter for the future of cognitive science.