Scott
Kehoe, Technology Consultant - Northeast
Massachusetts Regional Library System (NMRLS)
scott@nmrls.org /
978-762-4433x16 / AIM-bibliotechy / del.icio.us/bibliotechy
| Bit, Bit-depth & Grayscale | Pixel | Resolution & DPI | Interpolation |
| TWAIN | JPEG or JPG | TIFF or TIF | Megapixel |
| OCR
(Optical Character
Recognition) |
CD & DVD discs | RESOLUTION RULES OF THUMB | Web Links |
Bit, Bit-depth & Grayscale – determines the maximum number of colors available in a digital image. The more bits, the more color. Or the more depth of color as perceived by the human brain. The following are standard bit-depth ratios:
|
24 bit image |
8 bit image |
8 bit Grayscale image |
|
An image can contain up to 16.7 million colors. |
An image can contain up to 256 colors. |
An image can contain up to 254 shades of gray, black and white. |
|
|
|
|
![]()
Pixel
(picture element) - Digital
images are made up of individual pixels.
Each pixel conveys a different color that is dependant on
the available
bit-depth of the image. The
more pixels
captured when scanning or digitally photographing an image the better.
![]()
A
longer explanation is
available below, but suffice it to say that for everyday use of your
scanner,
“what you don’t know won’t hurt you.”
But, what you should know are the … Resolution Rules
of Thumb
|
Printing
photos / OCR
text = 300dpi Digital Master/ “Archival” image = at least 600dpi
|
Interpolation or Interpolated Resolution - Enhanced resolution created by software. This is additional resolution above what a scanner is optically able to produce. Interpolation adds what it thinks is appropriate data to an image in an attempt to achieve a higher resolution. Note that most Library, Museum, and Archive digitization projects rely on the optical resolution of their equipment to determine standards and best practices. For instance, NMRLS Digital Library Initiative standards call for a non-interpolated resolution of 600dpi for digital master images.
TWAIN - it’s an acronym that doesn’t mean anything, or as some report it, it means “Technology Without An Interesting Name.” Essentially TWAIN software is what allows a computer and scanner talk to each other. You need TWAIN drivers installed on your computer for the computer to recognize and utilize your scanner
OCR (Optical Character Recognition) - process that converts text in an image file to a word processing document . For example a TIFF image file of an invoice in run through OCR software to produce a Microsoft Word document that can be fully edited. As a TIFF image file Microsoft Word would treat it as a picture and you would not be able to correct misspelling, typos, etc. Once run through OCR software this former image becomes a document that can be treated as any other word processing document.
This is a very useful process for older documents that have no digital or computer version (type-written letters, newspapers). It is not a foolproof process, different fonts, word spacing, and foreign languages can create word processing documents that may need a great amount of proof-reading and correction. Unfortunately, it is not a process that is particularly useful for hand-written documents as the variations in hand-writing styles is still too great for OCR software to overcome at this time. Most new home-use flatbed scanners come rudimentary OCR software which can handle most simple type-written documents.
Lossless & Lossy Compression– terms refereeing to an image file formats’ way of compressing data.
Lossless files compress image data and do not lose any image information when an image is saved and/or compressed. But lossless image formats can create very large files.
Examples of lossless image file formats: TIFF, PNG
Lossy files compress image data and do lose image information every time an image is saved and compressed. Over time images may appear grainier and grainer as an image is saved and re-edited and saved again. Lossy compression is not effected each time a file is opened and closed for viewing, only when the image is opened and edited and saved in an image editing software program like PaintShop Pro.
Examples of lossy image file formats: JPEG, GIF
Megapixel
–
a digital camera term referring to
a resolution of 1 million pixels.
A
camera with the ability to take photographs at a resolution of 2.1
megapixels camera
translates to a cameras with the resolution of 1200 high by 1800 pixels
wide
(1200 x 1800 = 2,160,000). This
term has
become the yardstick by which camera manufacturers use to measure their
camera’s capabilities against their competitors.
The higher the megapixels, the more pixels,
the more colors, the more detail, and the better the image.
a note about DVDs
…
DVD-R and DVD+R formats hold up to 4.7 gigabytes of information! And prices for DVD readers (players), writers (burners), and the discs themselves keep falling.
Care
and Handling of CDs and DVDs: A guide for Librarians and Archivists, Fred
R Byers.
Council on Library and Information
Resources / National
(CLIR pub21). FREE DOWNLOAD (PDF format): http://www.clir.org/pubs/reports/pub121/pub121.pdf
Updated:
3/27/08 Scott Kehoe,
Technology Consultant, NMRLS