This folder contains several data for using with OCRopus. Parts of it might not be available in packaged release since some of the subfolders are quite big. The subfolders are: lines -- line images for testing models -- trained models, e.g. neural networks pages -- page images with transcription in ASCII samples -- several samples in different languages, quality and scripts words -- word lists for language modelling