For many cybersecurity problems, extracting information (e.g., entities, events and relations) from text and populating a knowledge base with the extracted information is a fundamental problem. So far, there have not been many resources (e.g., data and tools) specifically designed to support information extraction (IE) for cybersecurity. The lack of data and tools may significantly hinder our ability in detecting, analyzing and reporting emerging security risks effectively.
This project is specifically designed to address this challenge by focusing on three key areas: (1) curating a comprehensive data repository to support IE for cybersecurity; (2) developing advanced software tools to support IE and semantic analytics; and (3) developing a prototype system to demonstrate the utility of the resources we create.