Repositories
A guide to the different repositories used to store ocr-project data.
A repository is a collection of files and folders, like a file cabinet. All data for the Nineteenth-Century Knowledge Project resides in one of eight repositories.
name | location | contents |
---|---|---|
archive | HDD | Image files from multiple scans of different Encyclopedia Britannica editions |
eb03 | Google Drive | ocr-project files |
eb07 | Google Drive | ocr-project files |
eb09 | Google Drive | ocr-project files |
eb11 | Google Drive | ocr-project files |
information | Google Drive | General information used by the Knowledge Project |
metainfo | Google Drive | Informational material relevant to generating metadata for the entries. |
outputs | GitHub | This repository contains the output from the OCR process and everything that follows in creating the master file. It also includes derivatives, code, and analytics. |