Automated Learning of Event Coding Dictionaries for Novel Domains with an Application to Cyberspace
dc.contributor.advisor | Ward, Michael D | |
dc.contributor.author | Radford, Benjamin James | |
dc.date.accessioned | 2017-01-04T20:34:59Z | |
dc.date.available | 2017-01-04T20:34:59Z | |
dc.date.issued | 2016 | |
dc.department | Political Science | |
dc.description.abstract | Event data provide high-resolution and high-volume information about political events. From COPDAB to KEDS, GDELT, ICEWS, and PHOENIX, event datasets and the frameworks that produce them have supported a variety of research efforts across fields and including political science. While these datasets are machine-coded from vast amounts of raw text input, they nonetheless require substantial human effort to produce and update sets of required dictionaries. I introduce a novel method for generating large dictionaries appropriate for event-coding given only a small sample dictionary. This technique leverages recent advances in natural language processing and deep learning to greatly reduce the researcher-hours required to go from defining a new domain-of-interest to producing structured event data that describes that domain. An application to cybersecurity is described and both the generated dictionaries and resultant event data are examined. The cybersecurity event data are also examined in relation to existing datasets in related domains. | |
dc.identifier.uri | ||
dc.subject | Political science | |
dc.subject | Cyber Conflict | |
dc.subject | Cybersecurity | |
dc.subject | Event Data | |
dc.subject | International relations | |
dc.subject | Machine learning | |
dc.subject | Natural language processing | |
dc.title | Automated Learning of Event Coding Dictionaries for Novel Domains with an Application to Cyberspace | |
dc.type | Dissertation |
Files
Original bundle
- Name:
- Radford_duke_0066D_13727.pdf
- Size:
- 2.78 MB
- Format:
- Adobe Portable Document Format