PenPC: A two-step approach to estimate the skeletons of high-dimensional directed acyclic graphs.

Loading...

Date

2016-03

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

257
views
252
downloads

Citation Stats

Abstract

Estimation of the skeleton of a directed acyclic graph (DAG) is of great importance for understanding the underlying DAG and causal effects can be assessed from the skeleton when the DAG is not identifiable. We propose a novel method named PenPC to estimate the skeleton of a high-dimensional DAG by a two-step approach. We first estimate the nonzero entries of a concentration matrix using penalized regression, and then fix the difference between the concentration matrix and the skeleton by evaluating a set of conditional independence hypotheses. For high-dimensional problems where the number of vertices p is in polynomial or exponential scale of sample size n, we study the asymptotic property of PenPC on two types of graphs: traditional random graphs where all the vertices have the same expected number of neighbors, and scale-free graphs where a few vertices may have a large number of neighbors. As illustrated by extensive simulations and applications on gene expression data of cancer patients, PenPC has higher sensitivity and specificity than the state-of-the-art method, the PC-stable algorithm.

Department

Description

Provenance

Subjects

DAG, High dimensional, Log penalty, PC-algorithm, Penalized regression, Skeleton, Biomarkers, Tumor, Breast Neoplasms, Computer Simulation, Data Interpretation, Statistical, Female, Gene Expression Profiling, Genetic Markers, Genetic Predisposition to Disease, Humans, Models, Statistical, Neoplasm Proteins, Prevalence, Reproducibility of Results, Risk Factors, Sensitivity and Specificity

Citation

Published Version (Please cite this version)

10.1111/biom.12415

Publication Info

Ha, Min Jin, Wei Sun and Jichun Xie (2016). PenPC: A two-step approach to estimate the skeletons of high-dimensional directed acyclic graphs. Biometrics, 72(1). pp. 146–155. 10.1111/biom.12415 Retrieved from https://hdl.handle.net/10161/10825.

This is constructed from limited available data and may be imprecise. To cite this article, please review & use the official citation provided by the journal.

Scholars@Duke

Xie

Jichun Xie

Associate Professor of Biostatistics & Bioinformatics

Unless otherwise indicated, scholarly articles published by Duke faculty members are made available here with a CC-BY-NC (Creative Commons Attribution Non-Commercial) license, as enabled by the Duke Open Access Policy. If you wish to use the materials in ways not already permitted under CC-BY-NC, please consult the copyright owner. Other materials are made available here through the author’s grant of a non-exclusive license to make their work openly accessible.