Jump to main content
What is the Nineteenth-Century Knowledge Project?
About
Acknowledgements for all contributors.
With hundreds of thousands of files, the Nineteenth-Century Knowledge Project needs a clear means of organizing its data. We use specific naming conventions for all files and folders to order the material.
Edition-Section System
File organization depends on two basic folder types
archive Repository
Long-term storage of image files
kp1 Repository
The folder structure of kp1
kp2 Repository
The folder structure of kp2.
Setting Up the Repositories
Create local copies of the remote repositories
The procedures we use to get the best quality text recognition in ABBYY Fine Reader.
AFR Interface
Learn about the main elements of the program interface
Create an Image Collection
Organize image files for scanning.
Create an OCR-Project
How to create and manage an OCR-Project.
Settings
Recommended settings for all options in ABBYY FineReader
Draw Boxes
Manually creating text recognition boxes improves accuracy
Page Recognition
Excellent page recognition depends on preparing pages properly.
Save and Output
How to output your OCR results.
This introduction to Oxygen XML Editor shows you how to navigate the interface and perform standard procedures on the Encyclopedia files.
Oxygen Interface
An introduction to the main components of the Oxygen interface.
Create an XML-Project
Using Oxygen XML Editor to organize files.
Transform DOCX to TEI
How to convert DOCX files to TEI in Oxygen.
Procedures for converting single pages into Encyclopedia entries.
Convert Page to Entry Files
Before page files can be converted to entry files, we need to do some housekeeping.
Validate Entry Files
Use Oxygen to validate the entry files.
Entry-Inventory File
Document the filenames of every entry in a section using the entry-inventory file.
Check the Entry Terms List
Compare our list of recognized entry terms with the print pages.
Reference information on file/folder names, TEI-encoding standards, and unicode characters.
Editorial standards
The following editorial principles are employed in creating this digital edition.
Image Sources
Bibliographic information on print editions and image repositories.
Naming Conventions
Lists the naming conventions we use for editions, sections, folders, and files.
TEI Style Manual
All TEI encoding must follow these guidelines.
Unicode Characters
List of unicode characters and entities used frequently in the Encyclopedia and not on the standard US keyboard.

Project Director Peter Logan