A new phylogenetic data standard for computable clade definitions: the Phyloreference Exchange Format (Phyx).

dc.contributor.author

Vaidya, Gaurav

dc.contributor.author

Cellinese, Nico

dc.contributor.author

Lapp, Hilmar

dc.date.accessioned

2023-02-07T20:22:43Z

dc.date.available

2023-02-07T20:22:43Z

dc.date.issued

2022-01

dc.date.updated

2023-02-07T20:22:42Z

dc.description.abstract

To be computationally reproducible and efficient, integration of disparate data depends on shared entities whose matching meaning (semantics) can be computationally assessed. For biodiversity data one of the most prevalent shared entities for linking data records is the associated taxon concept. Unlike Linnaean taxon names, the traditional way in which taxon concepts are provided, phylogenetic definitions are native to phylogenetic trees and offer well-defined semantics that can be transformed into formal, computationally evaluable logic expressions. These attributes make them highly suitable for phylogeny-driven comparative biology by allowing computationally verifiable and reproducible integration of taxon-linked data against Tree of Life-scale phylogenies. To achieve this, the first step is transforming phylogenetic definitions from the natural language text in which they are published to a structured interoperable data format that maintains strong ties to semantics and lends itself well to sharing, reuse, and long-term archival. To this end, we developed the Phyloreference Exchange Format (Phyx), a JSON-LD-based text format encompassing rich metadata for all elements of a phylogenetic definition, and we created a supporting software library, phyx.js, to streamline computational management of such files. Together they form a foundation layer for digitizing and computing with phylogenetic definitions of clades.

dc.identifier

12618

dc.identifier.issn

2167-8359

dc.identifier.issn

2167-8359

dc.identifier.uri

https://hdl.handle.net/10161/26575

dc.language

eng

dc.publisher

PeerJ

dc.relation.ispartof

PeerJ

dc.relation.isversionof

10.7717/peerj.12618

dc.subject

Records

dc.subject

Biology

dc.subject

Phylogeny

dc.subject

Semantics

dc.subject

Software

dc.title

A new phylogenetic data standard for computable clade definitions: the Phyloreference Exchange Format (Phyx).

dc.type

Journal article

duke.contributor.orcid

Lapp, Hilmar|0000-0001-9107-0714

pubs.begin-page

e12618

pubs.organisational-group

Duke

pubs.organisational-group

Staff

pubs.publication-status

Published

pubs.volume

10

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
A new phylogenetic data standard for computable clade definitions the Phyloreference Exchange Format (Phyx).pdf
Size:
281.76 KB
Format:
Adobe Portable Document Format
Description:
Published version