Probabilistic Fréchet means for time varying persistence diagrams

Loading...

Date

2015-01-01

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

288
views
296
downloads

Citation Stats

Attention Stats

Abstract

© 2015, Institute of Mathematical Statistics. All rights reserved.In order to use persistence diagrams as a true statistical tool, it would be very useful to have a good notion of mean and variance for a set of diagrams. In [23], Mileyko and his collaborators made the first study of the properties of the Fréchet mean in (Dp, Wp), the space of persistence diagrams equipped with the p-th Wasserstein metric. In particular, they showed that the Fréchet mean of a finite set of diagrams always exists, but is not necessarily unique. The means of a continuously-varying set of diagrams do not themselves (necessarily) vary continuously, which presents obvious problems when trying to extend the Fréchet mean definition to the realm of time-varying persistence diagrams, better known as vineyards. We fix this problem by altering the original definition of Fréchet mean so that it now becomes a probability measure on the set of persistence diagrams; in a nutshell, the mean of a set of diagrams will be a weighted sum of atomic measures, where each atom is itself a persistence diagram determined using a perturbation of the input diagrams. This definition gives for each N a map (Dp)N→ℙ(Dp). We show that this map is Hölder continuous on finite diagrams and thus can be used to build a useful statistic on vineyards.

Department

Description

Provenance

Subjects

Citation

Published Version (Please cite this version)

10.1214/15-EJS1030

Publication Info

Munch, Elizabeth, Paul Bendich, Katharine Turner, Sayan Mukherjee, Jonathan Mattingly and John Harer (2015). Probabilistic Fréchet means for time varying persistence diagrams. Electronic Journal of Statistics, 9. pp. 1173–1204. 10.1214/15-EJS1030 Retrieved from https://hdl.handle.net/10161/10051.

This is constructed from limited available data and may be imprecise. To cite this article, please review & use the official citation provided by the journal.

Scholars@Duke

Bendich

Paul L Bendich

Adjunct Professor of Mathematics

I am a mathematician whose main research focus lies in adapting theory from ostensibly pure areas of mathematics, such as topology, geometry, and abstract algebra, into tools that can be broadly used in many data-centered applications.

My initial training was in a recently-emerging field called topological data analysis (TDA). I have been responsible for several essential and widely-used elements of its theoretical toolkit, with a particular focus on building TDA methodology for use on stratified spaces. Some of this work involves the creation of efficient algorithms, but much of it centers around theorem-proof mathematics, using proof techniques not only from algebraic topology, but also from computational geometry, from probability, and from abstract algebra.

Recently, I have done foundational work on TDA applications in several areas, including to neuroscience, to multi-target tracking, to multi-modal data fusion, and to a probabilistic theory of database merging. I am also becoming involved in efforts to integrate TDA within deep learning theory and practice.

I typically teach courses that connect mathematical principles to machine learning, including upper-level undergraduate courses in topological data analysis and more general high-dimensional data analysis, as well as a sophomore level course (joint between pratt and math) that serves as a broad introduction to machine learning and data analysis concepts.

Harer

John Harer

Professor Emeritus of Mathematics

Professor Harer's primary research is in the use of geometric, combinatorial and computational techniques to study a variety of problems in data analysis, shape recognition, image segmentation, tracking, cyber security, ioT, biological networks and gene expression.


Unless otherwise indicated, scholarly articles published by Duke faculty members are made available here with a CC-BY-NC (Creative Commons Attribution Non-Commercial) license, as enabled by the Duke Open Access Policy. If you wish to use the materials in ways not already permitted under CC-BY-NC, please consult the copyright owner. Other materials are made available here through the author’s grant of a non-exclusive license to make their work openly accessible.