Automated Metadata Procedure

How we create subject headings for every entry file.

We rely on several different Python scripts and the HIVE2 vocabulary server to automatically generate subject terms for each entry. The scripts assume a duplicate directory structure with three parallel directories as follows in the main repository for each edition.

directory description
entry TEI files, one for each encyclopedia entry.
metadata TXT and CSV files used to generate subject terms for each entry.
master TEI files with their subject terms written into the TEI Header.
Once we establish the edition and letter to be processed, the script will move along the tree within the appropriate one of these three directories to find what it needs.