ALERT: This system is being upgraded on Tuesday December 12. It will not be available
for use for several hours that day while the upgrade is in progress. Deposits to DukeSpace
will be disabled on Monday December 11, so no new items are to be added to the repository
while the upgrade is in progress. Everything should be back to normal by the end of
day, December 12.
Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime
Repository Usage Stats
93
views
views
22
downloads
downloads
Abstract
We study the problem of policy optimization for infinite-horizon discounted
Markov Decision Processes with softmax policy and nonlinear function
approximation trained with policy gradient algorithms. We concentrate on the
training dynamics in the mean-field regime, modeling e.g., the behavior of wide
single hidden layer neural networks, when exploration is encouraged through
entropy regularization. The dynamics of these models is established as a
Wasserstein gradient flow of distributions in parameter space. We further prove
global optimality of the fixed points of this dynamics under mild conditions on
their initialization.
Type
Journal articlePermalink
https://hdl.handle.net/10161/21649Collections
More Info
Show full item recordScholars@Duke
Andrea Agazzi
Assistant Research Professor of Mathematics
This author no longer has a Scholars@Duke profile, so the information shown here reflects
their Duke status at the time this item was deposited.
Jianfeng Lu
Professor of Mathematics
Jianfeng Lu is an applied mathematician interested in mathematical analysis and algorithm
development for problems from computational physics, theoretical chemistry, materials
science and other related fields.More specifically, his current research focuses include:Electronic
structure and many body problems; quantum molecular dynamics; multiscale modeling
and analysis; rare events and sampling techniques.
Alphabetical list of authors with Scholars@Duke profiles.

Articles written by Duke faculty are made available through the campus open access policy. For more information see: Duke Open Access Policy
Rights for Collection: Scholarly Articles
Works are deposited here by their authors, and represent their research and opinions, not that of Duke University. Some materials and descriptions may include offensive content. More info