Term
|
Acronym
|
Definition
|
Automated
Data Entry |
-
|
Process
of validating documents, checking for missing documents, and
assigning indexes or keywords, prior to storing them in a document
management system. |
Automatic Document Feeder
|
ADF
|
A
scanner accessory which feeds a paper into the scanner. |
Black
Border Removal |
-
|
A
software or hardware algorithm which removes the border around a
scanned document. A
slight border of black may be created during scanning because
scanners may read over the edge of the document.
Black Border Removal corrects thisand is an important
feature of a tools category called Image Clean Up. |
Bar
Code Recognition |
BCR
|
A
graphical representation of lines of varying widths representing
data. Bar Codes are
not readable by the human eye, but provide a high degree of
recognition using Bar Code scanning technology. |
Business
Process |
BP
|
A
set of rules or definitions, written or un-written, governing
information flow. |
CCITT
Group 4 Compression |
G4
|
A
two dimensional compression algorithm that uses patterns to
compress binary images, black and white text and line art. |
Certified
Document Imaging Architect |
CDIA
|
Industry
certification for document management and imaging professionals.
Certification requires passing a test covering concepts on:
capture, systems sizing, protocols, archival, storage and
performance. |
Charge-Coupled
Device |
CCD
|
Sensor
used in scanners to read documents being scanned into memory. |
Color
Dropout |
-
|
The
use of specially colored lamps during scanning so the color form
disappears leaving only data. |
Compression |
-
|
An
algorithm which runs via software or hardware reducing a
document's file size. Compression
is used to reduce storage space. |
Compression
Ratio |
-
|
The
ratio of uncompressed to compressed files size.
Theoretical limit of G4 compression is 20 to 1.
Typical results vary based on the document type.
10-15 to 1 is common. |
Database
Management System |
DBMS
|
Software
offering database services, such as Oracle and Microsoft SQL
Server. |
Decompression |
-
|
The
opposite of compression, decompression is an algorithm extracting
the compressed document's data. |
Deskew |
-
|
A
software or hardware algorithm correcting scanned or slightly
rotated documents. Deskew
is an important feature of a tools category called Image Clean Up. |
Despeckle |
-
|
Similar
to copying except an electronic document image is created and
stored on a computer. |
Digitizing |
-
|
The
act of processing documents through the scanner creating digital
files or images of the paper being processed. |
Document
Capture |
-
|
Similar
to copying except a document's electronic image is created and
stored on a computer. |
Document
Management |
DM/EDM
|
A
system storing and retrieving both digital document images and
electronic documents such as spreadsheets and word processing
files. Document
Management systems often possess ‘librarian’-type functions
such as check in/out or version control. |
Document
Preparation |
-
|
Pre-scanning
operation removing staples, notes, clips, bindings then sorts and
inserts separator pages. |
Dots
Per Inch |
DPI
|
A
measurement of scanner resolution relative to the original
document. Each dot
represents a pixel. The
more dots in a given inch, the better the clarity and higher the
resolution. Resolution is also a function of file size.
Because compression algorithms are
based on patterns, higher resolution documents compress
less, resulting in larger file sizes. |
Duplex |
-
|
A
scanner capable of reading both the front and back sides of
document simultaneously. This
decreases the effort to capture both images. |
Duty
Cycle |
-
|
Recommended
use level by the scanner manufacture measured in pages per month.
This is not the same as rated speed. |
Endorser |
-
|
Scanner
accessory which stamps documents with a number or data key either
pre or post scan. |
Fat
Client |
-
|
Client
software installed at each computer which communicates with
another location's server. The
term "fat" is used because all client components are
installed at the computer and the user typically operates a vendor
specific interface. The
opposite of "Thin Client". |
Forms
Processing |
-
|
The
act of extracting information from documents after being captured.
Forms Processing software uses histogram technology to
determine the form being processed, then applies a form specific
template to aid the data extraction from the form.
Data extraction may be via OCR for machine print,
Intelligent Character Recognition (ICR) for hand written data and
Optical Mark Recognition (OMR) for check boxes. |
Full
Page OCR |
-
|
The
process of converting graphical character representation on a page
to text (ASCII) data. See
also: Optical Character Recognition |
Grayscale |
-
|
A
scanner feature resulting in outputting multiple shades of gray.
While file size increases, it may create a more readable
document. |
Image
Clean Up |
-
|
A
software or hardware algorithm correcting a variety of issues
encountered as a result of the scanning process.
Typical features include: Desckew, Despeckle, Black Border
Removal, Shading Removal, Line Removal. |
Image
Enhancement |
-
|
A
software or hardware algorithm correcting a variety of issues
encountered as a result of the scanning process.
Typical features include: Deskew, Despeckle, Black Border
Removal, Shading Removal, Line Removal. |
Imprinter |
-
|
Scanner
accessory which stamps documents with a number or data key either
pre or post scan. |
ISIS |
-
|
A
scanner communication standard developed by Pixel Translations. |
Joint
Photographic Experts Group |
JPEG
|
A
compression algorithm primarily used for color documents.
Currently the dominant compression technique used on the
internet and web today. |
Landscape |
-
|
Scanning
or display of documents rotated 90 degrees.
That is, 11" wide by 8.5" tall. |
Line
Removal |
-
|
A
software or hardware algorithm removing vertical or horizontal
lines from scanned documents, such as boxes on forms.
Line removal is an important feature of a category of tools
called Image Clean Up. |
Open
Database Connectivity |
ODBC
|
Microsoft
standard for communicating with databases.
Applications supporting ODBC communicate with any database
management system that is also ODBC compliant. |
Operating
System |
OS
|
Computer
software that communicates with other devices and networks
connected to a computer and also provides an interface for users
to operate. Such as Windows 98, NT, 2000 or XP. |
Optical
Character Recognition |
OCR
|
Conversion
of a character image representation to ASCII text representation. |
Optical
Mark Sense |
OMR
|
A
technology used by Forms Processing applications to specific boxes
checked on a from. A
checked box's data value may be "yes," and a blank box's
data value may be "no." |
Oracle |
-
|
A
commercial database engine many software applications integrate
with for database services. |
Pages
Per Minute |
PPM
|
Unit
of scanner performance. Pages
or documents scanned in a given minute, either simplex or duplex.
See also: Rated Speed |
Patch
Code |
-
|
A
unique graphic, developed by Kodak,
placed on separator sheet to indicate document separation
and other scanning functions. |
Pixel
Element |
Pixel
|
The
smallest unit of measure represents all on-screen data.
Each character, image, picture or color is made up of a
collection of pixels. |
Portable
Document Format |
PDF
|
Document
file format developed by Adobe storeing textual and graphical
information in a single format.
While it is a proprietary format, the viewer is available
for free. |
Portal |
-
|
A
website functioning either as part of an intranet or the internet
consolidating information from a variety of sources unbeknownst to
the user. This
popular interface provides one location for finding information
and is familiar to the user. |
Portrait |
-
|
Scanning
or display of documents in typical orientation.
That is, 8.5" wide by 11" tall. |
Rated
Speed |
-
|
Measurement
of scanner performance based on scanning 200 DPI images through
the scanner landscape measured in pages per minute. |
Resolution |
-
|
The
density of pixels in a scanned document.
See also: Dots Per Inch |
Retrieval |
-
|
The
act of finding a document in a document management or portal
solution, equivalent to running a specific criteria search to
locate the information required. |
Scanning |
-
|
Processing
documents through the scanner to create digital files or images of
the paper being processed. |
Separator
Sheet |
-
|
A
page inserted at the time of document preparation indicating the
start of a new document to the scanning software. |
Shade
Removal |
-
|
A
software or hardware algorithm removing shaded areas on scanned
documents. Shade
Removal is an important feature of a category of tools called
Image Clean Up. |
Simplex |
-
|
A
scanner capable of reading only one side of a document at a time.
Simplex scanners can scan double sided documents, but the
document stack must be flipped, scanned and collated.
Presently, nearly all production level scanners are duplex. |
Small
Computer Systems Interface |
SCSI
|
Industry
standard for connecting and controlling peripheral devices such as
scanners, printers and tape drives. |
SQL
Server |
-
|
Microsoft's
commercial database engine many
software applications integrate with for database services. |
Structure
Query Language |
SQL
|
Standard
language to communicate with database servers such as Oracle and
SQL Server. |
Tagged
Image File Format |
TIFF
|
Standard
file format used by most document management systems.
While standard, there are many variations of this file
format. The most
popular being a multi-page G4 TIFF file. |
Thin
Client |
-
|
Web
based application that communicates with a server.
The term thin is used because all that is needed on the
client is the web browser, versus all client components. |
TWAIN |
TWAIN
|
A
software interface for controlling scanners and digital cameras.
Competes with ISIS technology. |
Virtual
ReScan |
VRS
|
Technology
developed by Kofax greatly improving the quality of post-scan
documents using advanced enhancement techniques. |
Workflow |
-
|
An
automated electronic document-routing system enabling users to
quickly process work with greater efficiency and faster accuracy
than traditional paper processing. Workflow instantly and
electronically transmits documents through the business process as
each increment or user work is completed. Workflow can be
scheduled to run at the most convenient time and run completely
unattended. |
Zonal
Optical Character Recognition |
OCR
|
The
process of converting graphical character representation on a page
to text (ASCII) data. See
also: Full Page OCR |