Search and Tagging

Home | Search and Tagging
Reports

Reports

An excellent collection of reports available by default.


View/Edit OCR Text

View/Edit OCR Text

An inbuilt OCR engine that can extract text from image files such as GIF, TIFF, PNG, etc. with the ability to edit/correct extracted text.



Custom Meta Data

Custom Meta Data

Store identifying information of any type about your documents. Customize your meta-data to suit your business processes. You can also use advanced meta-data to better organize and index your Doccept content library.


Optical character recognition (OCR)

OCR

Optical character recognition (OCR) is the process of converting images containing typed or clearly written text into machine-encoded text that is editable / searchable. Typical sources of images with text would be scanners, digital faxes, or screen captures / photographs of papers containing text.

OCR is a built-in feature within Doccept Enterprise edition. Unlike most other solutions which require an external plug-in that costs extra, Doccept has an in-built OCR engine that can extract text from images. While performing the OCR, Doccept stores the extracted text separately, and ensures that the original files are not modified in any-way. Doccept also allows you to edit the extracted text, so you can make future searches more relevant and accurate.

OCR feature of Doccept provides the following:

  1. Extraction of text from files containing images with text. These could be the typical image files such as JPG, GIF, BMP, and TIFF or PDF files containing images.
  2. Ability to automatically run the OCR on these files as they are imported, or in manual mode.
  3. Ability to search and edit the extracted text, while keeping the original file intact.
  4. Run OCR manually, as needed, even on files on which it was already run (by changing settings).
  5. Ability to plug-in external OCR engines such as ABBY, Kofax, etc. as needed.

For best results when performing OCR on scanned files, scanner setting should be set to capture at 300 dpi.


Zonal OCR

Zonal OCR

Perform OCR on specific sections or “zones” within your documents