Statistical Advances in Data Linkage and Model Evaluation
| dc.contributor.advisor | Reiter, Jerome P. | |
| dc.contributor.author | Binette, Olivier | |
| dc.date.accessioned | 2025-01-08T17:44:40Z | |
| dc.date.issued | 2024 | |
| dc.department | Statistical Science | |
| dc.description.abstract | This dissertation is about statistical contributions to data linkage and model evaluation. The two subjects fall at the extremities of traditional model development, with data linkage used to enrich data fed into downstream models and analyses, and evaluation used to maximize the utility of deployed models. We report on five research projects where we developed generalizable statistical methodologies to solve important practical problems in these areas. This includes the evaluation of statistical models for the quantification of modern slavery, methods to estimate and monitor the generalization performance of entity resolution systems, a novel F-score optimization algorithm for bipartite record linkage, and the introduction of an estimands framework to improve the validity and practical usefulness of AI/ML evaluations. | |
| dc.identifier.uri | ||
| dc.rights.uri | ||
| dc.subject | Statistics | |
| dc.title | Statistical Advances in Data Linkage and Model Evaluation | |
| dc.type | Dissertation | |
| duke.embargo.months | 2 | |
| duke.embargo.release | 2025-03-08T17:44:40Z |
Files
Original bundle
- Name:
- Binette_duke_0066D_18152.pdf
- Size:
- 2.06 MB
- Format:
- Adobe Portable Document Format