PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

V8 OCR Capability Does it allow indexing of the text in Secured PDFs?

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • V8 OCR Capability Does it allow indexing of the text in Secured PDFs?

    Apologies if there is a "How To" or FAQ, in which case would appreciate a pointer to same.
    V8 has OCR capabilities.

    I have several PDF documents that were locked by the authoring group, to prevent image and text extraction, copying, etc. and the credentials to unlock are unknown.

    One option would be to screenshot them with MWSnap or another high grade screenshooter, then OCR the resulting images.

    Looking in the Zoom8 indexing menus, not getting obvious hints as to how to OCR image documents that contain images of text.
    Indexing logs just indicate that the documents are secured.

    Does the securing to prevent copying/extraction of images and text prevent Zoom8 OCR capability from picking up the text in them, or do I need to tweak some other menu?

    [internal, restricted website]

    Thanks

  • #2
    Yes, images need to be extracted from the PDF files before they can be OCR'ed. Embedded images are a different thing compared encrypted text (text is not an image unless the original document was made by scanning a bunch of documents, or photographing them).

    Comment

    Working...
    X