Advancing Deep Learning through Probability Engineering: A Pragmatic Paradigm for Modern AI

dc.contributor.advisor

Chen, Yiran

dc.contributor.author

Zhang, Jianyi

dc.date.accessioned

2025-07-02T19:03:52Z

dc.date.available

2025-07-02T19:03:52Z

dc.date.issued

2025

dc.department

Electrical and Computer Engineering

dc.description.abstract

Recent years have witnessed the rapid progression of deep learning, pushing us closer to the realization of AGI (Artificial General Intelligence). Probabilistic modeling is critical to many of these advancements, which provides a foundational framework for capturing data distributions. However, as the scale and complexity of AI applications grow, traditional probabilistic modeling faces escalating challenges, such as high-dimensional parameter spaces, heterogeneous data sources, and evolving real-world requirements, which often render classical approaches insufficiently flexible.

This paper proposes a novel concept, “Probability Engineering,” which treats the already-learned probability distributions within deep learning as engineering artifacts. Rather than merely fitting or inferring distributions, we actively modify and reinforce them to better address the diverse and evolving demands of modern AI. Specifically, Probability Engineering introduces novel techniques and constraints to refine existing probability distributions, improving their robustness, efficiency, adaptability, or trustworthiness.

We showcase this paradigm through a series of applications spanning Bayesian deep learning, Edge AI (including federated learning and knowledge distillation), and Generative AI (such as text-to-image generation with diffusion models and high-quality text generation with large language models). These case studies demonstrate how probability distributions—once treated as static objects—can be engineered to meet the diverse and evolving requirements of large-scale, data-intensive, and trustworthy AI systems. By systematically expanding and strengthening the role of probabilistic modeling, Probability Engineering paves the way for more robust, adaptive, efficient, and trustworthy deep learning solutions in today’s fast-growing AI era.

dc.identifier.uri

https://hdl.handle.net/10161/32780

dc.rights.uri

https://creativecommons.org/licenses/by-nc-nd/4.0/

dc.subject

Artificial intelligence

dc.subject

Computer science

dc.subject

Statistics

dc.title

Advancing Deep Learning through Probability Engineering: A Pragmatic Paradigm for Modern AI

dc.type

Dissertation

duke.embargo.months

0.01

duke.embargo.release

2025-07-08

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Zhang_duke_0066D_18522.pdf
Size:
19.05 MB
Format:
Adobe Portable Document Format

Collections