Toward Trustworthy Machine Learning with Blackbox and Whitebox Methods

dc.contributor.advisor

Li, Hai

dc.contributor.author

Qiao, Ximing

dc.date.accessioned

2023-06-08T18:22:04Z

dc.date.available

2023-06-08T18:22:04Z

dc.date.issued

2023

dc.department

Electrical and Computer Engineering

dc.description.abstract

With the growing applications of machine learning (ML) in high-stake areas such as autonomous driving, medical assistance, and financial prediction, building trustworthy ML models with reliable performance in novel situations becomes increasingly important. While most existing ML methods achieve good averaged performance on standard test data, their worst-case performance on adversarial or out-of-distribution data, both common in real-world scenes, can be arbitrarily bad. This dissertation discusses blackbox and whitebox methods, as short-term and long-term solutions respectively, to the trustworthy issue.The blackbox methods consider immediate remedies to existing ML systems, treat such systems as black boxes, and aim to wrap them with an extra layer of protection against common adversaries. Two specific attack settings are discussed, where attackers either modify images with small stickers, or poison a small portion of training data to inject backdoors. The proposed solutions include a neural-guided sticker reverse engineering technique and an ensemble training method based on a novel backdoor detection code. While being universal to all types of ML systems, the blackbox methods also require strong assumptions of the attack. Next, the whitebox methods explore new families of ML models that mimic human's reasoning capability, generalize to open domains, and are trustworthy by design. A novel neural representation of probabilistic programs extends existing neural networks to capture complex probabilistic knowledge of the world and perform inference. A reinforcement learning-inspired inference algorithm addresses the efficiency issue in a single input-output setting. Although still difficult to handle real-world high-dimension signals, the initial results demonstrate the potential of such methods as a long-term solution to fundamentally address the challenging trustworthy problem.

dc.identifier.uri

https://hdl.handle.net/10161/27675

dc.subject

Computer engineering

dc.subject

Machine learning

dc.subject

Neural Backdoor

dc.subject

Probabilistic programming

dc.subject

Trustworthy

dc.title

Toward Trustworthy Machine Learning with Blackbox and Whitebox Methods

dc.type

Dissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Qiao_duke_0066D_17260.pdf
Size:
4.39 MB
Format:
Adobe Portable Document Format

Collections