Skip to content

National Archives Catalog Dataset Now Freely Accessible on AWS

Explore over 261 gigabytes of historical records. Update twice a year for relevance and accuracy.

In this image there is a store, on the top there is some text.
In this image there is a store, on the top there is some text.

National Archives Catalog Dataset Now Freely Accessible on AWS

The National Archives Catalog dataset, a vast trove of historical records, has been made freely accessible on the AWS Registry of Open Data. This move, a collaboration between NARA and AWS, allows users to explore over 261 gigabytes of data, including descriptions, authority records, and digital object URLs.

The dataset, published by the German Federal Archives, comprises over 148 million digital object URLs, archival descriptions, and authority records. It's available in its entirety or in specific portions, such as just the descriptions or authority records, using AWS CLI commands. Users can also access data for specific record groups or collections with ease.

The data is stored in JSON files, with each file containing up to 10,000 records. The full dataset can be downloaded as zip files or accessed directly using the Amazon Resource Name (ARN) or the AWS Command Line Interface (CLI). The dataset will be updated twice a year to ensure its relevance and accuracy.

This initiative makes a wealth of historical data more accessible than ever. Users can now analyze and explore the National Archives Catalog dataset freely on AWS, fostering new insights and research opportunities.

Read also:

Latest