Glossary of Terms
Need help deciphering the jargon used in this site or within our product graphical user interfaces? You can consult the list below for an extensive library of terms and acronyms used throughout this website.
Adaptive Pattern Recognition Processing (APRP)
The technology used specifically by Convera Technologies (embedded in PowerRetrieve software suite) to retrieve information from documents that have been processed through OCR. The software looks for patterns in the binary code that make up ASCII text, while performing full-text searches. Instead of comparing text to libraries of information, the software looks for matching binary code patterns. This methodology for searching is generally considered much faster and more efficient than standard full-text searches.
American Standard Code for Information Interchange (ASCII)
ASCII is the most widely used and accepted method for small computers to convert characters, numbers, punctuation, and control codes into digital format. ASCII characters can be recognised and understood by other computers and by communication devices. ASCII generally consists of minimal formatting, such as bold, italics, and so forth.
Annotation
The process of attaching information to a scanned image. This information can be in many various forms, such as highlighting key areas, adding a 'post-it' note, drawing lines, or blanking out areas. Annotation functions are available in PowerBook, PowerFilm and PowerRetrieve Search.
Application
A generic term referring to any software program that carries out some useful task for an end-user. For example, word processing and presentation graphic packages are considered applications.
Application Programming Interface (API)
A generic term referring to a programmable format that assists one program to communicate with another. For example, an imaging vendor can provide an API that enables programmers to integrate their imaging system with a workflow package, or to customize the user interface.
Auto Masking
Masks any heavy black borders on the image, which result from having space between the edges of the document and the boundaries of the scan area. When Auto Masking is enabled the black areas are replaced with white areas.
Automatic Document Feeder (ADF)
An ADF is an element on a scanner that enables automatic feeding of pages into the scanner. ADF’s provide an excellent platform for unattended batch scanning. Generally, they have a maximum capacity of 50 pages.
Base Directory
An element of PowerBook and PowerFilm. Situated on the Batch Control Tab of the main user interface, here users can specify the output folder to be used to store their scanned documents. It is possible to specify any local or network path on which to store scanned output. This destination will be used to store the image file (e.g TIFF File) and the output file type, for example CDPublish IDX File. The inclusion of Configurations means that users can simply select different profiles to specify different output directories. This is less arduous than having to manually modify the output path each time a different path is required.
Batch
A batch can refer to either a physical stack of paper documents ready for scanning, or a group of electronic images generated by scanning a batch of paper documents.
Bitonal
Bitonal documents are documents comprised of Black and White dots, or pixels.
Bitmap
A graphic picture stored as a series of pixels. Each pixel represents the colour at a location in the picture. File extension format is .BMP.
Black and White
An image type that creates an image comprised of black and white dots, or pixels. Black and White images will be bitonal. When this option is selected, the scan file type is automatically set to TIFF and image compression defaults to Group 4 2d Fax.
Black Crop
The process of removing any black areas around an image, resulting in a clearer image, which can improve OCR results. This is a feature of InfoCAP Technologies' book scanning software, PowerBook, and microfilm/fiche digitization application, PowerFilm
Blip
see Image Mark
Blipped Film
Film that contains an Image Mark. Image marks are beneficial as they enable image searching. Users can specify the initial film position and then browse the film to get to the required frame.
Boolean Logic
A text searching tool that identifies documents by the relationships of words within them. Using logical operators like AND, OR, NOT and WITHIN to narrow the scope of a text search. PowerRetrieve Search uses Boolean search technology to assist with querying.
Brightness
Sets the reflectivity of the image, including both light and dark areas. Generally, lower Brightness settings will lighten the overall image. Brightness comes as an option available on the Image Tab of the PowerBook and PowerFilm user interfaces.
Browse
This term can have two specific meanings. For example, in PowerFilm, a user may initiate a browse command to scroll through the film to view each sheet of microfilm in the sequence that they were added to the film. In PowerRetrieve Search, users can double click a Document Folder to browse the entire contents of that folder. They may then choose to sort the Hit List by a number of specific parameters to make the browsed data more pertinent to their requirements.
Button
Part of the graphical user interface (GUI). Each button reflects a real life button in texture and shape and acts accordingly in the electronic environment of the application. Buttons are clicked in order to carry out specific commands. Their physical appearance changes to represent their current status.
Capture
A term referring to scanning (sometimes referred to as "digitization") in paper documents and creating an electronic representation of that document. Capture may also refer to the act of creating an electronic image based upon scanning a roll of film or microfiche, for example. Most capture applications offer the opportunity to scan in documents and create a TIFF image file. Index Field data can be applied at capture time which can then be used in the future to publish and retrieve upon, for instance. InfoCAP's microfilm digitization software, PowerFilm, enables users to build databases with CDPublish and then search them using CDRetrieve.
Cartridge Film
16-mm microfilm encased in an ANSI or 3M cartridge.
CDPublish
CDPublish is a database creation package that allows users to build/publish a database from images and index information created by PowerFilm, PowerBook or PowerGate. (Or other image capture systems that can conform to the CDPublish input specification - documented in the manual). Databases are built on hard drives and can easily be distributed on CD.
CDRetrieve
CDRetrieve is the software package that allows users to quickly search their database and retrieve the desired images. CDRetrieve searches the index field values associated with each image (or batch of images) in order to return the required feedback. Users open a published database from hard disk or CD.
Centering
Automatically centres the scanned image inside the defined image size. Auto Masking must be enabled before Centering can be selected.
Channels A and B
Cartridge film is divided into two channels along the film feed direction. The lower channel is called channel A (A-ch) and the upper channel is called channel B (B-ch).
Compact Disc (CD)
A standard optical medium for storage of digital data in machine-readable form, accessible with a laser based reader. Data is written on directly accessible tracks, making them faster to access. Data is recorded directly into binary code, making the data more accurate.
Compact Disc-Read Only Memory (CD-ROM)
A type of optical storage device that is physically the same as CD, but contains computer data. Storage capacity varies but is usually 650-700MB. CD-ROMs are designed to be read from only. Data can not be deleted from CD-ROMs. CD-ROMs are interchangeable between different types of computers.
Compression
A process that "shrinks" images so they occupy less storage space. This facilitates quicker transmission of data. Generally accomplished by exploiting the redundant nature of images. Group IV compression is the most recent and widely accepted standard for (TIFF) image compression as defined by the CCITT. The Group IV standard supersedes the Group III standard
Computer
A device that allows you to input data and have it stored, processed, or otherwise manipulated quickly and efficiently. Also see PC.
Configuration
A set of specific parameters designed to work with one specific document type. See Profile.
Configuration Profile
A feature of PowerRetrieve Capture, containing a set of specific parameters designed to work with one specific document type. PowerRetrieve Capture supports multiple Configuration Profiles to handle mulitple document types. See Profile for more information.
Consultative Committee for International Telegraphy and Telephony (CCITT)
Also called the International Telegraph and Telephone Committee. A committee within the International Telecommunications Union (ITU) that issues recommendations for data communications standards. In terms of document imaging, the CCITT helped standardize the TIFF format into more specific Group III or Group IV formats (among others).
Contrast
Allows the scanner to detect subtle differences in greyscale levels at scan time. Higher contrast levels generally produce cleaner images but lose some detail. Low contrast levels produce noisy images, or images that appear smudged, but preserve the details. Contrast comes as an option available on the Image Tab of the PowerBook and PowerFilm user interfaces.
CSV File
A comma delimited data file that contains the index information for your image or batch of images. The CSV (Comma Separated Values) file is compatible with Microsoft Excel. The index fields are exported in the following order: image file name (no path), date (in Windows long date format), index field 1, index field 2, index field 3, etc.
Data
The term data can be used in various ways. It can mean large quantities of standard information used for reference. It may also refer to readable information used as a reference or record located in a database. In addition, it could mean a large quantity of bits and bytes that help make up the programs you use every day.
Database
In its most stripped-down form, a series of tables, forms, macros that contain series of data all of which constitute a database - or in other words; an electronic filing system. PowerRetrieve deploys a Microsoft SQL database in order to store and access information.
Deskew
A process that improves or eliminates the slant or angle of an image. Images are often scanned in at an angle, which affects the accuracy of the OCR process. Deskewing an image can aid the OCR process and improve readability.
Despeckle
An image cleanup feature that removes unwanted dots or speckles from an image. This provides a cleaner image and can improve the readability of the image (for example, for OCR processing).
Digitization
The process of converting an image into binary code. In the case of document imaging, the scanner captures the paper and the image is then converted into binary code. "Digitizing" is often used interchangeably with the term "scanning."
Document
A collection of data, organized into some order, and placed in some type of storage, presumably for future retrieval. The document length is user-defined and varies from one page to many pages. In a document imaging system, documents refer to the scanned batches of paper that are put into the imaging system, stored, and presumably retrieved at a later date.
Dots Per (square) Inch (DPI)
The number of dots a scanner can physically distinguish in each vertical and horizontal inch of an original image. If a page is scanned at 300 dpi, each one square inch contains 300 x 300 = 90,000 dots.
DSV Scanners
See 2400DSV and 3000DSV for more.
Duo Film
Film on which images are arranged in one of two channels up to the end of the channel and the remaining images are arranged in the other channel in the reverse direction (starting at the end of the first channel).
Duplex (or Dual-Sided) Scanning
The process of scanning both sides of a page at the same time. This functionality is determined by the hardware specification of the scanner.
Duplex Film
Film on which front images of documents are arranged in one of two channels and rear images of the same documents are arranged in the other channel.
Short for Electronic Mail. The email function sends a command to your Windows interface to launch your default simple MAPI client in order to send a document as an attachment to your mail recipient.
Fax
Fax, or Facsimile transmission. The fax function sends a command to your Windows interface to launch your associated fax software application in order to fax a document directly to your fax recipient. Fax printing requires separate installation of third party Windows Fax software that integrates with Windows applications via the use of a Windows Fax/Print driver.
Fiche
See Microfiche.
FileMaster+
The FileMaster+ Book scanner is ideal for scanning bound media up to A2 in size. Large format books no longer need to be positioned face down causing damage to book bindings. It comes equipped with a SCSI interface and printer port and in combination with the available software "PowerBook" the FileMaster+ scanner can be easily integrated into digital imaging solutions.
Film
See Microfilm.
Flatbed
An element on some scanners that enables pages to be read by a scanner one by one. Typically, documents are laid face down, one at a time. Flatbed scanners enable scan operators to scan in documents that have trouble being fed into an ADF. For instance, a heavily creased document using an ADF may cause a paper jam, but the same document can be read easily using the flatbed element of the scanner. Flatbed scanning is slower as only one page can be read at a time.
Frame
An area on cartridge film where a microphotographed image is contained. Each frame is associated with an image mark or a combination of image marks which determines the position of the frame on the film.
Full Text Search
The ability to search ASCII text files for occurrences of specified words, digits, sentences, or patterns of characters. Scanned documents are often OCR'ed to create an associated ASCII text file for the purpose of full-text searching.
Fuzzy Logic
A type of full-text search (see Full Text Search) that allows the user to adjust the tolerance levels for how specifically to search on a given criteria. Fuzzy logic searches often make use of Boolean logic to expand the tolerance of the search. See Adaptive Pattern Recognition Processing.
Graphical User Interface (GUI)
The visual content of a Windows application. The GUI is what the user sees and interacts with in a software application. It allows the user to perform functions by "clicking on" buttons/icons on the screen, choosing menus, dragging and dropping items, etc.
Greyscale
The range of shades of black within an image. These black pixels are assigned values (0=black through 255=white). This is in contrast to thresholded images in which each pixel is either black or white with no shading between.
Hardware
The physical (i.e., electronic, electrical, or mechanical) elements of a system not including the software programs that control the system's operation or any other applications. All the tangible parts of the computer system, both the computer and the peripheral devices are considered to be hardware.
Header
The part of an image file format that contains information associated with the image. It consists of fixed fields (provided by the host computer) and scanner-determined dynamic fields (image sequence number, image size in bytes, image level).
Icon
Part of the graphical user interface (GUI). Icons are small pictures used to represent the function of the underlying button. For instance, an icon might display a printer on it, or a floppy disk. This immediately informs the user that they may Print or Save their work by clicking on the appropriate icon. Icons are good examples of how a small picture can help the user estimate what the function of the button is.
IDX File (Index File)
An ASCII file that contains the Indexing information for your Images. IDX files are compatible with the StageWorks (from PowerScan, Inc.) suite of Document Capture software as well as used for database creation and searching via CDPublish and CDRetrieve.
Image
In terms of Paper Scanning - A bitmapped representation of a page, in which the page is represented as a pattern of dots, rather than as a collection of alphanumeric characters.In terms of Microfilm Digitization - Characters and/or photo contained in a frame of cartridge film.An image cannot be edited or altered. While technology exists which offers the ability to write "over" an image file, annotate, or otherwise change the appearance of an image, the actual image file itself is not altered.
Image Address
An address indicating the position of a frame on a film.
Image Cleanup
The process of cleaning up scanned images to enhance their quality and increase the accuracy of the OCR engine. Typically, document level scanning solutions incorporate Deskew, Despeckle and Image Crop features. These aid the OCR process as they increase the readability of the image.
Image Crop
An Image Enhancement process where Images are reduced in size by specifying that all black border around an image (Black Crop) or all white border around an image (White Crop) is removed at scan time. This can reduce the file size of an image, thus creating more space for further images.
Image Enhancement
The process of cleaning up scanned images to enhance their quality and increase the accuracy of the OCR engine. Typically, document level scanning solutions incorporate Deskew, Despeckle and Image Crop features. These aid the OCR process as they increase the readability of the image.
Image Mark
A square mark recorded in channel A or B of each frame. Image marks are used for image search. Image marks on film are classified into a maximum of three types according to their sizes (large-size, medium-size, and small-size image marks). Also known as a Blip.
Index Fields
These are placeholders that enable users to enter information about images. These can be used to associate certain key words with specific images, e.g. author name, document date, etc… information that will mean something to users. This information can be vital for speedy and efficient retrieval.
Indexing
The process of entering descriptive keywords that are associated with a document for later retrieval purposes. Indexing may also refer to the PowerRetrieve Index process whereby documents are added to the File Room and information is extracted from those documents.
Intelligent Character Recognition (ICR)
Recognition of hand-printed characters from a bit mapped image.
Interface
A set of properties and methods that provide access to a component providing a means by which a developer can use components in an application.
Joint Photographic Experts Group (JPEG)
The dominant compression technique for colour and other multi-bit-per-pixel images is the JPEG technique. This technique derives its name from the Joint Photographic Experts Group of the American National Standards Institute (ANSI), the group charged with creating and maintaining this standard. JPEG compression relies on the manner in which the human eye perceives colour, and can achieve truly astounding compression (on the order of 75:1 or better) while still rendering the uncompressed images in a nearly realistic fashion. However, one must be aware that, in order to achieve its results, the JPEG technique very carefully and selectively discards data (pixels and/or depth of colour) in such a manner that its loss upon decompression goes (virtually) unnoticed by the viewer.
Landscape
The page orientation where width exceeds length. In document imaging systems, the orientation of the page (Landscape or Portrait) affects the OCR process, indexing, and other functions based on the horizontal orientation of the text rather than the vertical orientation of the text.
Local Area Network (LAN)
Local Area Network. A topology that provides a common connection for high-speed transmissions from computers and peripherals for a distance of up to one mile. LANs enable computer resources such as PCs (workstations), file servers, printers, etc., to be shared. LANs can be connected by a variety of cabling including twisted pair, coax, or fiber optic cabling.
MAPI
MAPI or Messaging Application Programming Interface. This is the API that the software application (i.e. PowerBook, PowerFilm or PowerRetrieve Search) interacts with in order to launch your default Windows mail client when the email command is received.
Mask White
See Redaction.
Microfilm
A strip of film on which books or documents can be recorded in miniaturized form.
Microfiche (or fiche)
A sheet of film for storing publications in miniature form.
MFD (Multi Functional Device)
MFDs or Multi Functional Devices are hardware peripherals that offer a wide range of functions handily contained within one device. Typical MFD functions include the ability to Print, Fax, Photocopy and Scan. MFDs are available to basic home PC users right through to catering for the upper tier of the document management industry, with manufacturers such as Konica and Minolta at the forefront. Due to their size and flexibility more and more are being used as a cost cutting alternative in the document management sector.
MFP (Multi Functional Peripheral)
see MFD.
Noise
The small dots or specks that appear on an image. These specks increase file size and usually contain no image information. Noise can also degrade the performance of OCR.
Optical Character Recognition (OCR)
The process of analysing a scanned image, determining what each character is and producing an ASCII text file containing those characters. Imaging systems that utilise full text searching use OCR hardware and software to convert document images into text to be searched and retrieved. OCR technology is developing rapidly and is currently centred in neural network-based or feature extraction technologies. OCR devices vary widely in terms of performance, features, platforms and price.
Optical Disk
A storage medium that is written and read by a laser. Optical disks can be "rewritable, " which means the disk can be written and overwritten, or "WORM" Write Once Read Many, which can be written to, but cannot be overwritten. CD-ROMs, and Laser disks are all examples of optical disks. These disks vary widely in storage capacity, price, and intended use.
Orientation
The relative direction of a page, either horizontal (known as landscape) or vertical (known as portrait).
Output Device
Any peripheral device that takes computer data and transforms it into readable information (i.e., printers, video screens, microfilm, etc.).
Page, File, Block
Terms page, file and block come from the concept of filing. A roll of film which is assumed to consist of multiple pages each containing a frame of image, multiple files are consisting of one or more pages, or multiple blocks each consisting of one or more files. Image marks are used to indicate a page, the first page of a file, or the first page of a block according to their sizes. The largest mark indicating the block and the smallest indicating a page.
Pages Per Minute (ppm)
The speed at which a scanner or printer can process pages. This measurement does not include the time it takes for human processing or for software/driver access to the peripheral device before scanning or printing takes place.
PC
Also known as a single-user computer, PC is short for Personal Computer and is a term commonly used in the computer industry to describe an IBM or IBM compatible computer even though the term PC can refer to any personal computer, IBM, IBM compatible, or Apple Macintosh.
Pixel
Short for "pixel element" and is a relative measuring term. Pixels have no absolute size; they are simply square units of equal size. Since a pixel has no specific size, a monitor’s pixels may be a different size from an image’s scanned pixels. However, one 300 pixel per inch measurement is the same as another 300 per inch since the "per inch" qualifies it with an absolute scale.
Portrait
The page orientation where length exceeds width. In document imaging systems, the orientation of the page (Portrait or Landscape) affects the OCR process, indexing, and other functions based on the horizontal orientation of the text rather than the vertical orientation of the text.
PowerBook
PowerBook is a high performance digital image processing system for the Kodak FileMaster and FileMaster+ Book Scanners. PowerBook provides a friendly graphical user interface for converting bound media, as thick as 5cm and up to A2 in size, into electronic image files. See here for more.
PowerCD
See PowerRetrieve.
PowerFilm
PowerFilm is a high performance digital image processing system for the ABR3000D & 3000DSV and 2400DS & 2400DSV Universal Microfilm Digitizers. PowerFilm provides a friendly graphical user interface for converting fiche, Jackets, aperture cards, 16mm cartridges, and 16mm and 35mm roll film, into electronic image files. See here for more.
PowerRetrieve
Formerly known as PowerCD, PowerRetrieve is an intelligent global knowledge management solution that will manage your paper and electronic documents together in the same File Room. See here for more.
PowerRetrieve Capture
PowerRetrieve Capture allows users to capture electronic documents (via scanning or importing) and process them accordingly (for example carry out Property Field Assignment and Optical Character Recognition) so that they are compatible with the rest of the PowerRetrieve suite of products. See here for more.
PowerRetrieve Filer
PowerRetrieve Filer allows users to take ordinary electronic documents (such as Word Documents, Excel Spreadsheet, etc...) and place them into Folders in PowerRetrieve File Rooms. Each document is assigned Property Field values and the final output format is compatible with the rest of the PowerRetrieve suite of products. See here for more.
PowerRetrieve Index
PowerRetrieve Index allows users to create multiple File Rooms containing mulitple levels of Folders and Document Folders. take ordinary electronic documents (such as Word Documents, Excel Spreadsheet, etc...) and place them into Folders in PowerRetrieve File Rooms. Each document is assigned Property Field values and the final output format is compatible with the rest of the PowerRetrieve suite of products. See here for more.
PowerRetrieve Search
PowerRetrieve Search allows users to retrieve information stored in File Rooms via content and/or index field (Property Field) label searches. PowerRetrieve Search utilizes Convera's award winning pattern recognition technology, and its many different search types provide a very accurate retrieval tool. File Rooms are built and maintained in PowerRetrieve Index, and PowerRetrieve Search allows users to browse or search (query) the contents of each File Room. See here for more.
The print function sends a command to your Windows interface to launch the standard Windows Print Dialog Window. From here you can select which printer you wish to print out the selected image or multi page TIFF document to. You can set the Print range and also specify the number of copies you wish to print. Other options are available, such as double-sided (duplex) printing, however, these options are printer specific.
Printer
A hardware device responsible for taking computer data and generating a hard copy of that data. These days printer functions are also embedded into much more sophisticated devices known as MFDs or Multi Functional Devices.
Profile
A set of saved and named settings. Saving settings to a profile is a convenient way to have special-purpose settings available. You can have multiple profiles to deal with all sorts of scenarios, for example, different paper sizes. Selecting a specific profile is far quicker than having to manually modify the current settings to match your requirements. Sometimes referred to as a Configuration or Configuration Profile.
Property Fields
These are placeholders that enable users to enter information about images. These can be used to associate certain key words with specific images, e.g. author name, document date, etc… information that will mean something to users. This information can be vital for speedy and efficient retrieval.
Redaction
A process whereby unwanted or sensitive areas of an image are removed (or masked white), prior to the image being printed, faxed or emailed.
Resolution
A measure of how many dots per inch (dpi) or pixels per inch (ppi) make up a scanned or printed image. Higher resolution images produce higher quality images as they include more dots per inch. However, users should be aware that high resolution images will require more storage space.
Retrieval
The process of recovering files stored in a repository. Retrieval engines can use embedded indexing information; the information stored in Property Fields to recover the desired document. If the application is sophisticated enough, then it can use the text file (created via the OCR process) in order to recover documents. PowerRetrieve Search, offers an intelligent retrieval capability that incorporates advanced pattern and concept searching, as well as standard search methods.
Rotation
The process of rotating an image. Typically, document scanning solutions enable users to re-align an image by either 90, 180 or 270 degrees.
Scan (or Scanning)
The process of electronically capturing a paper document. The resultant outcome is known commonly as an image or batch of images. The paper document can be scanned in through an ADF or via a Flat Bed. The quality of the scanned output will depend largely on the specifications of the scanner.
Scanner
A device that converts paper document into digitized images. Scanner speeds are usually measured in pages per minute (ppm) for throughput. They can scan single-sided (simplex) or dual-sided (duplex) pages, and have parameters similar to copy machines (contrast, paper size, etc.). Scanner speeds can range from 2 pages per minute to 120+ pages per minute. Pricing also varies according to the features offered.
Search
The act of looking for something in order to achieve a desired goal. In PowerRetrieve Search, the term 'Search' is often interchanged with the term 'Query' in order to depict an action whereby data is interrogated using a number of techniques in order to return the most pertinent information. PowerRetrieve Search utilizes the power of Adaptive Pattern Recognition Processing (APRP) to provide a fluid retrieval system.
Simplex (or Single-Sided) Scanning
The process of scanning a single side of a page only.
Simplex Film
Film on which images are arranged in a row.
Skew
Crooked image appearance which occurs when the bit mapped image is not square with the page. Generally this is caused by slippage or misfeeds during the scanning process.
Small Computer Systems Interface (SCSI)
A standard data pathway used mostly for connecting peripheral devices like hard drives and CD-ROM drives, but also a common interface for scanners and sometimes printers. SCSI defines both software and hardware industry standards for communication between the computer and its peripheral devices. Devices designed to meet SCSI standards should be able to work together. Currently one of the most flexible methods of interfacing with hard drives. It comes in two major varieties: SCSI-2 or WIDE SCSI. Both can now be Ultra, which doubles the speed.
Tagged Image File Format (TIFF)
An image file format originally developed by Aldus Corporation. Most document imaging systems save scanned images in this format. There are several variations and standards for governing the specifications for TIFF files. File extension is TIF.
Threshold
A scanner setting used to convert greyscale originals into bitonal images. The setting's values can range from 0 to 255. The setting determines how faint a pixel can be and still be assigned the value black. Higher settings produce darker images by setting a wider range of pixels to black.
Toolkit Without An Interesting Name (TWAIN)
A standard software protocol and Application Programming Interface (API) for communication between software applications and image acquisition devices. Designed to make all scanner manufacturers adhere to a fixed interface, eliminating the need for distinct scanner drivers for each type of scanner.
Unblipped Film
Film that does not contain an Image Mark (blip). Search capability of this type of film is available if edge detection is switched on.
White Crop
The process of removing any white areas around an image, resulting in a clearer image, which can improve OCR results. This is a feature of InfoCAP Technologies' book scanning software, PowerBook, and microfilm/fiche digitization application, PowerFilm.
Wide Area Network (WAN)
A topology that provides a common connection enabling high-speed transmission of data over an extended distance, usually over one mile. WANs enable computer resources such as PCs (workstations), file servers, printers, etc., to be shared over a wide area. WANs can be connected by a variety of cabling including twisted pair, coax, or fiber optic cabling.
Zoom
A viewing option available from the main graphic user interface that allows the user the chance to zoom into or out of an image, depending on the requirement. Users may also zoom to a specific selection – they must "rubber band" an area to zoom into, and choose "zoom to selecion" to zoom in on the area they’ve have defined. Icons are included to zoom in or out of a displayed TIFF image, as well as zoom to selection.
Misc
Back to Index1-Level
Image marks are used for image search on a roll of film which is assumed to consist of multiple pages. Each image mark is assumed to indicate a page which contains an image, regardless of the mark size.
2-Level
Image marks of two different sizes are used for image search on a roll of film which is assumed to consist of multiple files. Each smaller image mark is assumed to indicate a page containing an image and each larger image mark the first page of a file consisting of one or more pages.
3-Level
Image marks of three different sizes are used for image search on a roll of film which is assumed to consist of multiple blocks each consisting of one or more files. Each small-size image mark is assumed to indicate a page containing an image, each medium-size image mark the first page of a file consisting of one or more pages, and each large-size image mark the first page of a block consisting of one or more files.
2400DSV
The 2400DSV Universal Microfilm Digital Scanner is an A4 screen 'state-of-the-art' digital microfilm scanner which combines the benefits of analogue printers with the clean 'easy-to-use' digital nature to today's applicational requirements. The 2400DSV with our PowerFilm application software provides a 'gateway technology' with fax and e-mail options within the Windows Operating Systems NT4, 2000 and XP.
3000DSV
The 3000DSV is the ideal tool for service bureaus or end users wishing to convert a wide range of microfilm formats to a digital one. The 3000S series universal A3 viewing screen scanners have an option of our 'world exclusive and developed' PowerFilm software application for Windows NT4, 2000 and XP operating systems.









