HIATUS

Human Interpretable Attribution of Text using Underlying Structure

Intelligence Summary

The HIATUS program aims to develop novel human-useable systems for attributing authorship and protecting author privacy. Authorship attribution capabilities address many Intelligence Community (IC) needs, including combating sophisticated malicious information campaigns online and identifying counterintelligence risks. Authorship privacy capabilities protect authors whose writing, if attributed, could place them in danger.

Summary

Humans and machines produce vast amounts of text content every day. Text contains linguistic features that can reveal author identity. To support and protect the IC mission, the HIATUS program’s objective is to develop multi-language-capable tools to attribute authorship and protect author privacy. These tools must implement novel explainable Artificial Intelligence techniques to provide trustworthy and verifiable results to human users regardless of author background or document genre, topic, and length.

The HIATUS program casts authorship attribution and privacy as different aspects of the same underlying challenge: understanding author-level linguistic variation by elucidating stable identifiers of individual authors across diverse types of text. The program places Performers’ authorship attribution and privacy systems in competition with one another. Performer teams compete to generate higher fidelity representations between individual authors’ unique linguistic fingerprints.

Performer systems are submitted to the HIATUS Testing & Evaluation (T&E) teams for blind evaluation against opponent team systems on a sequestered dataset comprising multilingual documents representing diverse text and author characteristics. Attribution systems are evaluated on ability to match items by the same author in large collections, while privacy systems are evaluated on ability to thwart attribution systems. System explainability will be evaluated using a protocol developed by Performers, T&E teams and Government partners in the beginning of the program.

The HIATUS program began in late 2022 and has a duration of 45 months. The program comprises three phases, including an initial 21-month long phase and two subsequent 12-month long phases.

HIATUS Diagram

The HIATUS vision: A combined authorship attribution and privacy system that can be trusted and audited by human operators.

Development dataset

The HIATUS development dataset is available for research purposes and can be requested from ARLIS UMD by contacting hiatus_data@umd.edu. Official documentation can be accessed here.

VIRTUAL PROPOSERS' DAY INFORMATION:

Sam.Gov Reference

HIATUS Teaming Information

Related Publications

To access HIATUS program-related publications, please visit Google Scholar.

Contact Information

Program Manager

Dr. Timothy McKinnon

timothy.mckinnon@iarpa.gov

301-243-2084

Broad Agency Announcement (BAA)

Link(s) to BAA

IARPA-BAA-22-01

Solicitation Status

CLOSED

Proposers' Day Date

January 19, 2022

BAA Release Date

February 25, 2022

Proposal Due Date

April 18, 2022

Program Summary

HIATUS Summary

Testing and Evaluation Partners

Lawrence Livermore National Labs (U.S. Department of Energy)
Pacific Northwest National Labs (U.S. Department of Energy)
University of Maryland’s Applied Research Laboratory for Intelligence and Security

Prime Performers

Raytheon BBN
University of Southern California, Information Sciences Institute
University of Pennsylvania

Additional Information

IARPA-BAA-22-01 Q&A (round 1)

IARPA-BAA-22-01 Q&A (round 2)

IARPA-BAA-22-01 BAA Amendment 001

IARPA-BAA-22-01 BAA Amendment 002