Statistical Advances in Data Linkage and Model Evaluation

dc.contributor.advisor

Reiter, Jerome P.

dc.contributor.author

Binette, Olivier

dc.date.accessioned

2025-01-08T17:44:40Z

dc.date.issued

2024

dc.department

Statistical Science

dc.description.abstract

This dissertation is about statistical contributions to data linkage and model evaluation. The two subjects fall at the extremities of traditional model development, with data linkage used to enrich data fed into downstream models and analyses, and evaluation used to maximize the utility of deployed models. We report on five research projects where we developed generalizable statistical methodologies to solve important practical problems in these areas. This includes the evaluation of statistical models for the quantification of modern slavery, methods to estimate and monitor the generalization performance of entity resolution systems, a novel F-score optimization algorithm for bipartite record linkage, and the introduction of an estimands framework to improve the validity and practical usefulness of AI/ML evaluations.

dc.identifier.uri

https://hdl.handle.net/10161/31936

dc.rights.uri

https://creativecommons.org/licenses/by-nc-nd/4.0/

dc.subject

Statistics

dc.title

Statistical Advances in Data Linkage and Model Evaluation

dc.type

Dissertation

duke.embargo.months

2

duke.embargo.release

2025-03-08T17:44:40Z

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Binette_duke_0066D_18152.pdf
Size:
2.06 MB
Format:
Adobe Portable Document Format

Collections