Entry Files

Procedures for converting single pages into Encyclopedia entries.

ABBYY FineReader generates separate files for each page of the Encyclopedia. After we convert them to TEI (saved in the 3-tei folder) they are still files of individual pages. For the project, we need files for entire entries, not pages.

To solve this problem, we run a Python script that concatenates the individual page files into a single file. It then segments the file by entry and creates a new file for each, saved in the digital-edition/xml folder.

These new entry files need to be carefully checked and validated. To do this, we use Oxygen XML Editor.