Improving Natural Language Understanding via Contrastive Learning Methods

Cheng, Pengyu

Improving Natural Language Understanding via Contrastive Learning Methods

Files

Cheng_duke_0066D_16267.pdf5.63 MB

Cheng_duke_0066D_17/Pengyu_Cheng_s_Thesis_supp_Part2_Part1.pdf798.79 KB

Date

2021

Authors

Cheng, Pengyu

Advisors

Carin, Lawrence

Repository Usage Stats

342
views

777
downloads

Abstract

Natural language understanding (NLU) is an essential but challenging task in Natural Language Processing (NLP), aiming to automatically extract and understand the semantic information from raw text or voice data. Among the previous NLU solutions, representation learning methods have recently become the mainstream, which maps textual data into low-dimensional vector spaces for downstream tasks. With the development of deep neural networks, text representation learning has achieved state-of-the-art performance on plenty of NLP scenarios.

Although text representation learning methods with large-scale network encoders have shown significant empirical gains, many essential properties of the text encoders remain unexplored, which hinders models' further application into real-world scenarios: (1) the high computational complexity of the large-scale deep networks limits text encoders to be applied on a broader range of devices, especially on low calculation-ability resources; (2) the mechanic of networks is agnostic, limiting the control of the latent representations for downstream tasks; (3) representation learning methods are data-driven, lead to inherent social bias problems with unbalanced data.

To address the problems above in deep text encoders, I proposed a series of effective contrastive learning methods, which supervise the encoders by enlarging the difference between positive and negative data sample pairs. In this thesis, I first present a theoretical contrastive learning tool, which bridges the contrastive learning methods and the mutual information in information theory. Then, I apply contrastive learning into several NLU scenarios to improve the text encoders' effectiveness, interpretability, and fairness.

Type

Dissertation

Department

Electrical and Computer Engineering

Subjects

Computer engineering, Contrastive Learning, Information theory, Machine learning, Natural Lauguage Processing, Neural network

Permalink

https://hdl.handle.net/10161/23119

Citation

Cheng, Pengyu (2021). Improving Natural Language Understanding via Contrastive Learning Methods. Dissertation, Duke University. Retrieved from https://hdl.handle.net/10161/23119.

Collections

Dissertations

Full item page

Dukes student scholarship is made available to the public using a Creative Commons Attribution / Non-commercial / No derivative (CC-BY-NC-ND) license.

Improving Natural Language Understanding via Contrastive Learning Methods

Files

Date

Authors

Advisors

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

Abstract

Type

Department

Description

Provenance

Subjects

Citation

Permalink

Citation

Collections