Privacy-Preserving Collaborative Prediction using Random Forests

dc.contributor.author

Giacomelli, Irene

dc.contributor.author

Jha, Somesh

dc.contributor.author

Kleiman, Ross

dc.contributor.author

Page, David

dc.contributor.author

Yoon, Kyonghwan

dc.date.accessioned

2019-06-25T14:13:26Z

dc.date.available

2019-06-25T14:13:26Z

dc.date.updated

2019-06-25T14:13:25Z

dc.description.abstract

We study the problem of privacy-preserving machine learning (PPML) for ensemble methods, focusing our effort on random forests. In collaborative analysis, PPML attempts to solve the conflict between the need for data sharing and privacy. This is especially important in privacy sensitive applications such as learning predictive models for clinical decision support from EHR data from different clinics, where each clinic has a responsibility for its patients' privacy. We propose a new approach for ensemble methods: each entity learns a model, from its own data, and then when a client asks the prediction for a new private instance, the answers from all the locally trained models are used to compute the prediction in such a way that no extra information is revealed. We implement this approach for random forests and we demonstrate its high efficiency and potential accuracy benefit via experiments on real-world datasets, including actual EHR data.

dc.identifier.uri

https://hdl.handle.net/10161/19038

dc.subject

cs.LG

dc.subject

cs.LG

dc.subject

stat.ML

dc.title

Privacy-Preserving Collaborative Prediction using Random Forests

dc.type

Journal article

pubs.organisational-group

School of Medicine

pubs.organisational-group

Duke

pubs.organisational-group

Biostatistics & Bioinformatics

pubs.organisational-group

Basic Science Departments

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1811.08695v1.pdf
Size:
1.08 MB
Format:
Adobe Portable Document Format
Description:
Published version