VITA Toolkit Help

Multipage Text Documents and Volumes 6.4, Sept 2022, p. 11

The following text may have been generated by Optical Character Recognition, with varying degrees of accuracy. Reader beware!

On the next screen, the tool needs to process the single PDF into multiple pages, extract the text and create the Thumbnails and other associated files for each page. Note: It takes approximately 1 minute per page to generate all the display images as well as OCR'ing the page content and applying to coordinates that will produce the hit highlighting on the page text when a search is performed. As well, it will take up to 20 minutes for the full text and hit highlighting to be active on your site depending on when the new material enters the indexing cycle. Begin processing the individual pages of the PDF. Continue ... Wait for all the files to be processed completely and then click the button at the bottom of the screen: Microsoft Word - GHPL Digitization Project Tips Manual doc 25 Af infringement by safeguarding 28 processed yr 16:06:07 Continue er On the File/Tech screen, you'll see the files generated from your PDF, including the Thumbnail and Regular image files, every page with extracted text snippets, and a complete PDF that is automatically associated with the record. You can make the full PDF public or not, depending on whether you want to allow downloading. 11

Powered by / Alimenté par VITA Toolkit
Privacy Policy