Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Intelligent Document Capture with Ephesoft, Second Edition - Second Edition
Intelligent Document Capture with Ephesoft, Second Edition - Second Edition

Intelligent Document Capture with Ephesoft, Second Edition: Automate the processing of scanned and digital documents by improving accuracy using web-based open and modern intelligent document capture software, Second Edition

€25.99 €17.99
Book Aug 2015 164 pages 2nd Edition
eBook
€25.99 €17.99
Print
€32.99
Subscription
€14.99 Monthly
eBook
€25.99 €17.99
Print
€32.99
Subscription
€14.99 Monthly

What do you get with eBook?

Product feature icon Instant access to your Digital eBook purchase
Product feature icon Download this book in EPUB and PDF formats
Product feature icon Access this title in our online reader with advanced features
Product feature icon DRM FREE - Read whenever, wherever and however you want
Buy Now

Product Details


Publication date : Aug 24, 2015
Length 164 pages
Edition : 2nd Edition
Language : English
ISBN-13 : 9781783558582
Category :
Table of content icon View table of contents Preview book icon Preview Book

Intelligent Document Capture with Ephesoft, Second Edition - Second Edition

Chapter 1. A Quick Tour of Ephesoft

As an introduction to Ephesoft, we will first walk you through the user interface and then examine the installation folder. The locations of certain files and folders within the Ephesoft installation are important because an administrator must make changes here to enable some features.

In this chapter, we will examine the following aspects of Ephesoft:

  • The user interface

  • The installation folder

The user interface


After logging in, users can access Ephesoft's features from an automatically hiding menu of navigation items that we will refer to as the side navigation. To display this menu, simply move your mouse cursor to the left-hand side of the browser window.

Ephesoft has organized this side navigation so that administrative features are separate from the common functions that operators use. Operators typically submit batches and review and validate Ephesoft's output, supplying additional information about the document images being processed.

Administrators enable these activities by defining the operations to be performed on each type of batch. Administrators also monitor and control the processing of the batches.

Ephesoft's navigation menu

Administrative features


The side navigation provides links to five areas of the system that are commonly used by administrators:

  • Batch class management

  • Batch instance management

  • Folder management

  • System configuration

  • Reports

Batch class management

A batch class defines a set of operations that should be performed on the page images that are provided as input. A batch class consists of document types, document fields, batch class fields, e-mail configuration, and workflow/plugin configuration. The Batch Class Management interface allows administrators to create, modify, edit, and delete batch classes.

Ephesoft's batch class management user interface

The batch class management interface displays a list of batch classes. Administrators can open a batch class to configure the following:

  • Document types: The documents that will be processed in the batch class are configured here. Each document type is described by a distinct set of properties called fields. Rules can be configured to extract information from the document into the fields, thereby automating the process of indexing the document.

  • Modules: Modules are the major steps in the processing of documents. Each module is implemented by a series of plugins.

  • E-mail configuration: In this portion of the administrative interface, users can provide connection information for an e-mail account, and Ephesoft will process e-mails sent to this address. Ephesoft processes both the e-mail body and the attached documents.

  • Scanner profiles: This is where administrators can associate one or more scanner configurations with each batch class. These profiles are available in the web scanner.

  • CMIS import: CMIS is a standard protocol for communicating with document repositories. Ephesoft can use CMIS to monitor a standards-compliant document repository for input.

  • Batch class fields: Ephesoft can associate information with a batch (the group of page images that are processed together) as a whole. Each piece of information associated with a batch is called a batch class field. Batch class fields are applied to batches and should not be confused with document fields, which contain information that applies to individual documents.

Batch instance management

A batch instance is a set of page images processed together. The terms batch and batch instance are usually interchangeable. This area within the administrative interface allows administrators to see the status of batches, reprioritize batches, and restart batches in a previous processing step.

Ephesoft's batch instance management user interface

Folder management

The folder management interface allows the administrator to upload files for batch class configuration. These files are also accessible from the installation folder, but this is often a more convenient way to manipulate these files.

Ephesoft's folder management user interface

System configuration

This administrative interface allows users to manage Ephesoft in ways that are not specific to a batch class or instance.

Ephesoft's system configuration user interface

System configuration allows the modification of the following features:

  • Regex pool: The regular expression pool is a library of regular expressions that administrators can access when creating extraction rules for a batch class.

  • Workflow management: Ephesoft's features are implemented in components called plugins. The workflow is the sequence in which these plugins are executed. This portion of the user interface allows an administrator to specify what plugins are available when configuring the workflow for a batch class.

  • Connection manager: The connection manager allows you to create and test database connections. These connections are used by plugins to access databases.

  • License details: This allows administrators to see the expiration date of the license and the features that are enabled.

Reports

Reporting can be enabled to provide administrators with statistics on the system and throughput. The administrator can filter reports by criteria such as batch class or start date. Advanced reports are also available, including correction reporting. Correction reporting identifies when operators made corrections to Ephesoft's automated processing. This information can be used to optimize the configuration over time.

Ephesoft's reporting user interface

The operator user interface


The side navigation provides links to the following four areas of the system that are commonly used by operators:

  • Batch list

  • Review validate

  • Web scanner

  • Upload batch

Batch list

The batch list shows the batch instances that require review or validation.

The review process involves documents that could not be identified as being of a certain type. In Ephesoft, as with most image capture systems, we say that these documents could not be classified. The review interface allows operators to split and merge pages of documents and specify the document type.

The validation process involves fields for which data could not be extracted from the document, or fields where the extracted values do not comply with the previously specified rules.

Ephesoft's batch list user interface

Review validate

The review validate screen will present the operator with the next available batch for processing according to priority and batch date.

Ephesoft's review validate user interface

Web scanner

Ephesoft is capable of capturing content from a scanner attached to the user's workstation. What is unique about the web scanner is that no software needs to be installed on the workstation; Ephesoft uses a Java applet to send content directly to the server from any TWAIN-enabled scanner.

Ephesoft's web scanner user interface

The first time a user logs into the operator interface and selects the Web Scanner link on the side navigation, the user will have to choose a scanner. When the user selects the Source button, the user will be shown all TWAIN devices that have been installed on the user's workstation. Once the scanner is selected, the user can select the batch class to be used for processing and start the scan job.

Upload batch

Operators can submit PDF and TIF files directly to Ephesoft for processing by using the upload batch feature. Once the documents are selected and uploaded, the operator can select the appropriate batch class and start the batch processing.

Ephesoft's upload batch user interface

File system


The following are some important folders that are created when Ephesoft is installed. These are subfolders beneath the Ephesoft installation folder:

  • Apache 2.2: Apache can be used in front of Ephesoft for load balancing and failover. It is included in the installation but not configured.

  • Application: The Ephesoft Java web application is installed in this folder.

  • Application/i18n, themes: These folders contain files to customize and localize the Ephesoft application.

  • Application/native/RecostarPlugin: This plugin provides the image OCR functionality.

  • Application/WEB-INF/classes/META-INF: System configuration property files are stored in this folder.

  • Dependencies/gs, ImageMagick: Applications that Ephesoft uses for image manipulation are installed here.

  • Dependencies/licence-util, licensing: These folders contain tools to collect the information needed to generate and install license keys.

  • Dependencies/luke: Luke is a tool that helps troubleshoot problems with Lucene indexes.

  • JavaAppServer: This folder contains the Tomcat configuration for Ephesoft.

  • JavaAppServer/conf: This is where the contexts are defined for Ephesoft; it is where URLs are bound to java code.

  • EphesoftReports: The configuration and binaries for reporting are stored here.

  • SharedFolders/BC99: The configuration for each batch class is stored here. The contents of the batch class folder can be modified through the Folder Management interface by a batch class or system administrator.

Summary


In this chapter, we looked at the administrative and the operator functionality of Ephesoft. We also looked at the installation folder on the filesystem. It's time to put Ephesoft to work.

In the next chapter, you'll learn how to train the system to recognize your documents, extract content from them, and test the configuration.

Left arrow icon Right arrow icon

Key benefits

What you will learn

Discover the benefits of using intelligent document capture in your work place Learn to capture, classify, and separate any type of document Extract important information from your documents Transfer the documents and data into your content management system Customize Ephesoft to meet your unique business requirements Understand the integration techniques using the Ephesoft web services API Convert your paper archive to electronic records efficiently Automate business processes that depend on documents in paper, fax, or email attachment format Implement distributed capture for mailroom automation

What do you get with eBook?

Product feature icon Instant access to your Digital eBook purchase
Product feature icon Download this book in EPUB and PDF formats
Product feature icon Access this title in our online reader with advanced features
Product feature icon DRM FREE - Read whenever, wherever and however you want
Buy Now

Product Details


Publication date : Aug 24, 2015
Length 164 pages
Edition : 2nd Edition
Language : English
ISBN-13 : 9781783558582
Category :

Table of Contents

14 Chapters
Intelligent Document Capture with Ephesoft Second Edition Chevron down icon Chevron up icon
Credits Chevron down icon Chevron up icon
Foreword Chevron down icon Chevron up icon
About the Authors Chevron down icon Chevron up icon
About the Reviewers Chevron down icon Chevron up icon
www.PacktPub.com Chevron down icon Chevron up icon
Preface Chevron down icon Chevron up icon
1. A Quick Tour of Ephesoft Chevron down icon Chevron up icon
2. Creating a Batch Class Chevron down icon Chevron up icon
3. Core Ephesoft Features Chevron down icon Chevron up icon
4. Ephesoft's Advanced Features Chevron down icon Chevron up icon
5. Tips Chevron down icon Chevron up icon
References Chevron down icon Chevron up icon
Index Chevron down icon Chevron up icon

Customer reviews

Filter icon Filter
Top Reviews
Rating distribution
Empty star icon Empty star icon Empty star icon Empty star icon Empty star icon 0
(0 Ratings)
5 star 0%
4 star 0%
3 star 0%
2 star 0%
1 star 0%

Filter reviews by


No reviews found
Get free access to Packt library with over 7500+ books and video courses for 7 days!
Start Free Trial

FAQs

How do I buy and download an eBook? Chevron down icon Chevron up icon

Where there is an eBook version of a title available, you can buy it from the book details for that title. Add either the standalone eBook or the eBook and print book bundle to your shopping cart. Your eBook will show in your cart as a product on its own. After completing checkout and payment in the normal way, you will receive your receipt on the screen containing a link to a personalised PDF download file. This link will remain active for 30 days. You can download backup copies of the file by logging in to your account at any time.

If you already have Adobe reader installed, then clicking on the link will download and open the PDF file directly. If you don't, then save the PDF file on your machine and download the Reader to view it.

Please Note: Packt eBooks are non-returnable and non-refundable.

Packt eBook and Licensing When you buy an eBook from Packt Publishing, completing your purchase means you accept the terms of our licence agreement. Please read the full text of the agreement. In it we have tried to balance the need for the ebook to be usable for you the reader with our needs to protect the rights of us as Publishers and of our authors. In summary, the agreement says:

  • You may make copies of your eBook for your own use onto any machine
  • You may not pass copies of the eBook on to anyone else
How can I make a purchase on your website? Chevron down icon Chevron up icon

If you want to purchase a video course, eBook or Bundle (Print+eBook) please follow below steps:

  1. Register on our website using your email address and the password.
  2. Search for the title by name or ISBN using the search option.
  3. Select the title you want to purchase.
  4. Choose the format you wish to purchase the title in; if you order the Print Book, you get a free eBook copy of the same title. 
  5. Proceed with the checkout process (payment to be made using Credit Card, Debit Cart, or PayPal)
Where can I access support around an eBook? Chevron down icon Chevron up icon
  • If you experience a problem with using or installing Adobe Reader, the contact Adobe directly.
  • To view the errata for the book, see www.packtpub.com/support and view the pages for the title you have.
  • To view your account details or to download a new copy of the book go to www.packtpub.com/account
  • To contact us directly if a problem is not resolved, use www.packtpub.com/contact-us
What eBook formats do Packt support? Chevron down icon Chevron up icon

Our eBooks are currently available in a variety of formats such as PDF and ePubs. In the future, this may well change with trends and development in technology, but please note that our PDFs are not Adobe eBook Reader format, which has greater restrictions on security.

You will need to use Adobe Reader v9 or later in order to read Packt's PDF eBooks.

What are the benefits of eBooks? Chevron down icon Chevron up icon
  • You can get the information you need immediately
  • You can easily take them with you on a laptop
  • You can download them an unlimited number of times
  • You can print them out
  • They are copy-paste enabled
  • They are searchable
  • There is no password protection
  • They are lower price than print
  • They save resources and space
What is an eBook? Chevron down icon Chevron up icon

Packt eBooks are a complete electronic version of the print edition, available in PDF and ePub formats. Every piece of content down to the page numbering is the same. Because we save the costs of printing and shipping the book to you, we are able to offer eBooks at a lower cost than print editions.

When you have purchased an eBook, simply login to your account and click on the link in Your Download Area. We recommend you saving the file to your hard drive before opening it.

For optimal viewing of our eBooks, we recommend you download and install the free Adobe Reader version 9.