Applying Machine Learning to Investigate Long Term Insect-Plant Interactions Preserved on Digitized Herbarium Specimens
dc.contributor.author | Meineke, EK | |
dc.contributor.author | Tomasi, C | |
dc.contributor.author | Yuan, S | |
dc.contributor.author | Pryer, KM | |
dc.date.accessioned | 2020-12-01T23:02:11Z | |
dc.date.available | 2020-12-01T23:02:11Z | |
dc.date.issued | 2020 | |
dc.date.updated | 2020-12-01T23:02:10Z | |
dc.description.abstract | <jats:title>Abstract</jats:title><jats:sec><jats:title>Premise of the study</jats:title><jats:p>Despite the economic importance of insect damage to plants, long-term data documenting changes in insect damage (‘herbivory’) and diversity are limited. Millions of pressed plant specimens are now available online for collecting big data on plant-insect interactions during the Anthropocene.</jats:p></jats:sec><jats:sec><jats:title>Methods</jats:title><jats:p>We initiated development of machine learning methods to automate extraction of herbivory data from herbarium specimens. We trained an insect damage detector and a damage type classifier on two distantly related plant species. We experimented with 1) classifying six types of herbivory and two control categories of undamaged leaf, and 2) detecting two of these damage categories for which several hundred annotations were available.</jats:p></jats:sec><jats:sec><jats:title>Results</jats:title><jats:p>Classification models identified the correct type of herbivory 81.5% of the time. The damage classifier was accurate for categories with at least one hundred test samples. We show anecdotally that the detector works well when asked to detect two types of damage.</jats:p></jats:sec><jats:sec><jats:title>Discussion</jats:title><jats:p>The classifier and detector together are a promising first step for the automation of herbivory data collection. We describe ongoing efforts to increase the accuracy of these models to allow other researchers to extract similar data and apply them to address a variety of biological hypotheses.</jats:p></jats:sec> | |
dc.identifier.issn | 2168-0450 | |
dc.identifier.uri | ||
dc.publisher | Cold Spring Harbor Laboratory | |
dc.relation.ispartof | Applications in plant sciences | |
dc.relation.isversionof | 10.1101/790899 | |
dc.title | Applying Machine Learning to Investigate Long Term Insect-Plant Interactions Preserved on Digitized Herbarium Specimens | |
dc.type | Journal article | |
duke.contributor.orcid | Pryer, KM|0000-0002-9776-6736 | |
pubs.begin-page | e11369 | |
pubs.issue | 6 | |
pubs.organisational-group | Trinity College of Arts & Sciences | |
pubs.organisational-group | Biology | |
pubs.organisational-group | Duke Science & Society | |
pubs.organisational-group | Duke | |
pubs.organisational-group | Initiatives | |
pubs.organisational-group | Institutes and Provost's Academic Units | |
pubs.volume | 8 |
Files
Original bundle
- Name:
- meinekeAps20.pdf
- Size:
- 1001.54 KB
- Format:
- Adobe Portable Document Format
- Description:
- Published version