Intelligence Value
The BETTER program aims to dramatically compress the information discovery cycle for Intelligence Community analysts by designing systems that extract personalized, mission-relevant semantic information from text and leverage this information to substantially improve search capabilities
Summary
Massive volumes of multilingual, unstructured text present an overwhelming challenge for analysts to classify, triage, and examine. Tools and methods are needed to help analysts automatically map entities and events in text documents and use this information to perform rapid search and triage. The BETTER program is leveraging entity and event identification, text search, and human-in-the-loop computation to create technologies that enable analysts to extract and retrieve highly-detailed, personalized knowledge across diverse domains and languages. BETTER’s new tools and methods will also automatically tag incoming information, according to the analysts’ pre-determined knowledge, and broaden the search criteria based on previously identified entities, events and relationships between entities, as well as human feedback. One objective is to use existing knowledge to compress the discovery cycle. Another objective is to minimize the amount of English training data needed to develop functional systems, with no training data provided in the foreign languages covered.
Information retrieval capabilities will be evaluated by the systems’ ability to return a diverse set of relevant documents in an order that optimizes the user’s experience, as well as identify the critical events and entities in them. Systems incorporate human feedback to drive system personalization. BETTER was launched on October 1, 2019 and is a 42-month long program (end date: 3/31/2023).
Publicly Available Program Artifacts
The BETTER program has created several natural language processing datasets and applications, advancing what’s possible in multi-language information extraction and information retrieval. For an 11x17 PDF of the listings of publicly available resources released by BETTER Performers, click on the image below.
You can also find a direct link to datasets on NIST here: Access publicly released BETTER datasets