Cache

PNNL partnered with the U.S. Department of the Treasury and Amazon Web Services (AWS) to develop Cache, a cloud-based tool that allows the Treasury’s disparate data to be easily searched, translated, extracted, linked, and analyzed.

Collab

The U.S. Department of the Treasury is the preeminent entity in the U.S. government dealing with financial and monetary issues. As such, its expertise in these areas is bolstered by the vast amounts of financial (and other) data in its holdings.

Data comes in different forms and from different entities.

  • The Financial Crimes Enforcement Network collects information from banks when suspicious activities are detected.
  • The Office of Terrorist Financing and Financial Crime is home to Treasury attaches in embassies abroad who collect and send back information from partner nations.
  • The Office of Foreign Assets Control deals with implementing and enforcing sanctions, among other things.

The Treasury’s power has been used by many presidential administrations as an effective and measured approach when dealing with terrorists, nuclear proliferators, drug cartels, rogue leaders, and nations or individuals engaged in abhorrent behavior. However, there is much more to Treasury and its mission, including excellent research and analysis by the Office of Intelligence and Analysis. 

All told, the one thing all entities in Treasury have in common is the need for fast, reliable, and easily accessible data.

Adopting AWS Technology

In 2019, Treasury leadership decided to give its employees the IT systems, data, and tools they needed to be even more effective in their national security roles. This began with discussions with Congress, PNNL, and AWS.

Congress saw the need to augment Treasury’s capabilities and authorized funding to modernize Treasury’s processes. AWS’s role was to get data into a central location so it could be better utilized and where people could leverage AWS tools.

Cache-ing In on a New Tool 

PNNL’s tasks were more specific. After canvassing the landscape for what Treasury employees needed, PNNL began creating a tool called Cache. With Cache, Treasury employees will be able to search for and integrate needed data to accomplish their mission.

At the same time, PNNL vetted and procured relevant datasets for Treasury’s use. This included major financial datasets, including cryptocurrency holdings. Cache allowed these disparate data (with billions of records) to be easily searched. Other benefits of this tool include language translation, multilingual entity extraction, entity linking, record linkage, and graph analytics. PNNL has also been working directly with Treasury data scientists and analysts to provide specialized algorithms and workflows. This collaboration accomplishes mission needs while giving current Treasury employees better skills and knowledge.

Training the Next Generation 

Further, PNNL has created several training programs to provide the next cohort of data scientists with experience in this methodology. In particular, the Data Science Training program and the Distinguished Graduate Research Program were created in coordination with the University of Washington and North Carolina State University, respectively. The goals of both programs are similar—to support data science students who would like to work on real-world problems while gaining their advanced degrees and to expose them to national security challenges. Participants in these programs have opportunities to work on real Treasury challenges and possibly work with Treasury, or other U.S. government entities, in the future as interns or staff members.