If you work with the collections of the National Archives of Australia, you might find the RecordSearch section of the GLAM Workbench helpful. I’ve just updated the repository to add new options for running the notebooks, including 1-click installation on Reclaim Cloud. There’s also a few new notebooks.
New notebooks and datasets
- Harvest details of all series in RecordSearch – get details of all series registered in RecordSearch, also generates a summary dataset with the total number of items digitised, described and in each access category
- Exploring harvested series data – generates some basic statistics from the harvest of series data
- Summary data about all series in RecordSearch (15mb CSV) – contains basic descriptive information about all the series currently registered on RecordSearch (May 2021) as well as the total number of items described, digitised, and in each access category

Updated
I’ve started (but not completed) updating all the notebooks in this repository to use my new RecordSearch Data Scraper. The new scraper is simpler and more efficient, and enables me to get rid of a lot of boilerplate code. Updated notebooks include:
- Harvest items from a search in RecordSearch – save the results of an item search in RecordSearch as a downloadable dataset, you can also save images and PDFs from digitised files (PDF saving is new!)
- Harvest files with the access status of ‘closed’ – find out what we’re not allowed to see by harvesting details of ‘closed’ files
Other updates include:
- Python packages updated
- Integration with Reclaim Cloud allowing 1-click installation of the whole repository and environment
- Automatic creation of Docker images when the repository is updated
- Updated README and repository index with list of all notebooks
- Notebooks intended to run as apps now use Voila rather than Appmode for better integration with Jupyter Lab
-
requirements-unpinned.txt
added to repository for people who want to develop the notebooks in their own clean environment
Hope you find these changes useful! #dhhacks
This is a companion discussion topic for the original entry at https://updates.timsherratt.org/2021/05/24/naa-recordsearch-section.html