Automated Learning of Event Coding Dictionaries for Novel Domains with an Application to Cyberspace

dc.contributor.advisor

Ward, Michael D

dc.contributor.author

Radford, Benjamin James

dc.date.accessioned

2017-01-04T20:34:59Z

dc.date.available

2017-01-04T20:34:59Z

dc.date.issued

2016

dc.department

Political Science

dc.description.abstract

Event data provide high-resolution and high-volume information about political events. From COPDAB to KEDS, GDELT, ICEWS, and PHOENIX, event datasets and the frameworks that produce them have supported a variety of research efforts across fields and including political science. While these datasets are machine-coded from vast amounts of raw text input, they nonetheless require substantial human effort to produce and update sets of required dictionaries. I introduce a novel method for generating large dictionaries appropriate for event-coding given only a small sample dictionary. This technique leverages recent advances in natural language processing and deep learning to greatly reduce the researcher-hours required to go from defining a new domain-of-interest to producing structured event data that describes that domain. An application to cybersecurity is described and both the generated dictionaries and resultant event data are examined. The cybersecurity event data are also examined in relation to existing datasets in related domains.

dc.identifier.uri

https://hdl.handle.net/10161/13386

dc.subject

Political science

dc.subject

Cyber Conflict

dc.subject

Cybersecurity

dc.subject

Event Data

dc.subject

International relations

dc.subject

Machine learning

dc.subject

Natural language processing

dc.title

Automated Learning of Event Coding Dictionaries for Novel Domains with an Application to Cyberspace

dc.type

Dissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Radford_duke_0066D_13727.pdf
Size:
2.78 MB
Format:
Adobe Portable Document Format

Collections