Neuroprosthetic Decoder Training as Imitation Learning.

dc.contributor.author

Merel, Josh

dc.contributor.author

Carlson, David

dc.contributor.author

Paninski, Liam

dc.contributor.author

Cunningham, John P

dc.coverage.spatial

United States

dc.date.accessioned

2018-02-02T17:47:49Z

dc.date.available

2018-02-02T17:47:49Z

dc.date.issued

2018-02-02

dc.description.abstract

Neuroprosthetic brain-computer interfaces function via an algorithm which decodes neural activity of the user into movements of an end effector, such as a cursor or robotic arm. In practice, the decoder is often learned by updating its parameters while the user performs a task. When the user's intention is not directly observable, recent methods have demonstrated value in training the decoder against a surrogate for the user's intended movement. Here we show that training a decoder in this way is a novel variant of an imitation learning problem, where an oracle or expert is employed for supervised training in lieu of direct observations, which are not available. Specifically, we describe how a generic imitation learning meta-algorithm, dataset aggregation (DAgger), can be adapted to train a generic brain-computer interface. By deriving existing learning algorithms for brain-computer interfaces in this framework, we provide a novel analysis of regret (an important metric of learning efficacy) for brain-computer interfaces. This analysis allows us to characterize the space of algorithmic variants and bounds on their regret rates. Existing approaches for decoder learning have been performed in the cursor control setting, but the available design principles for these decoders are such that it has been impossible to scale them to naturalistic settings. Leveraging our findings, we then offer an algorithm that combines imitation learning with optimal control, which should allow for training of arbitrary effectors for which optimal control can generate goal-oriented control. We demonstrate this novel and general BCI algorithm with simulated neuroprosthetic control of a 26 degree-of-freedom model of an arm, a sophisticated and realistic end effector.

dc.identifier

https://www.ncbi.nlm.nih.gov/pubmed/27191387

dc.identifier

PCOMPBIOL-D-15-01907

dc.identifier.eissn

1553-7358

dc.identifier.uri

https://hdl.handle.net/10161/16068

dc.language

eng

dc.publisher

Public Library of Science

dc.relation.ispartof

PLoS Comput Biol

dc.relation.isversionof

10.1371/journal.pcbi.1004948

dc.subject

Algorithms

dc.subject

Arm

dc.subject

Brain-Computer Interfaces

dc.subject

Computational Biology

dc.subject

Computer Simulation

dc.subject

Humans

dc.subject

Learning

dc.subject

Robotics

dc.subject

Supervised Machine Learning

dc.subject

Task Performance and Analysis

dc.title

Neuroprosthetic Decoder Training as Imitation Learning.

dc.type

Journal article

duke.contributor.orcid

Carlson, David|0000-0003-1005-6385

pubs.author-url

https://www.ncbi.nlm.nih.gov/pubmed/27191387

pubs.begin-page

e1004948

pubs.issue

5

pubs.organisational-group

Basic Science Departments

pubs.organisational-group

Biostatistics & Bioinformatics

pubs.organisational-group

Civil and Environmental Engineering

pubs.organisational-group

Duke

pubs.organisational-group

Duke Clinical Research Institute

pubs.organisational-group

Institutes and Centers

pubs.organisational-group

Pratt School of Engineering

pubs.organisational-group

School of Medicine

pubs.publication-status

Published online

pubs.volume

12

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
1511.04156v2.pdf
Size:
926.43 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Neuroprosthetic Decoder Training as Imitation Learning.pdf
Size:
1.48 MB
Format:
Adobe Portable Document Format