"Neural Memory for Reinforcement Learning and Temporal Credit Assignmen" by Shanna Kathleen Murray

Neural Memory for Reinforcement Learning and Temporal Credit Assignment

Date of Award

Spring 2023

Document Type

Dissertation

Degree Name

Doctor of Philosophy (PhD)

Department

Neuroscience

First Advisor

Seo, Hyojung

Abstract

In this thesis, I describe results from three sets of experiments investigating the neural basis of memory used for decision-making and temporal credit assignment (TCA). In the first study, I ask whether mnemonic working memory representations are encoded in a similar way to mnemonic representations used for reinforcement learning. I perform novel analysis of previously collected data from dorsolateral prefrontal (DLPFC) and lateral intraparietal cortex (LIP), where the same neurons were recorded during multiple tasks. I find evidence for overlap of mnemonic encoding between working memory and decision-making. In the second study, I investigate the temporal dynamics of neural memory for TCA through delays. I develop a paradigm based on the matching pennies task in which variable delays occurred between choices and outcomes. I analyze neural activity from dorsomedial prefrontal cortex (DMPFC), DLPFC, and dorsal striatum. I find evidence that DLPFC may uniquely encode a neural eligibility trace signal through temporal delays, as choice-related activity was correlated with reduced behavioral credit assignment following longer delays in DLPFC only. In the third and final study, I present the lagged bandits paradigm, which was designed to study TCA under interference. I find that neither DMPFC nor dorsal striatum encodes the hypothesized dynamic eligibility trace which would be computationally optimal to solve the credit assignment problem despite interference from irrelevant actions and outcomes.

This document is currently not available here.

Share

COinS