Although the term scanning can refer to any process—be it analog, digital, or strictly optical—it will be used in the following article (and the related cross-references) to referr to the process of capturing images electronically for conversion to digital form, primarily to obtain color and black-and-white images for prepress or multimedia purposes. Scanning is performed using a device called a scanner, which essentially uses transmitted or reflected light to capture an image in tiny increments (pixels) of an image. The size of these increments is one of several ways that a good scan is separated from a bad scan.
There are those who consider scanning to be merely a bridge or transitional technology, linking earlier photographic processes to nascent digital camera technologies. That may very well be the case, but for now scanning is an integral part of the prepress and imaging processes.
==PRINCIPLES OF SCANNING==There are several different types of scanners, each of which operate somewhat differently (see below), but they all use different means to do the same thing: convert analog (i.e., optical) data into digital data.
'Bit-Depth'. All digital information exists as bit's, or combinations of 1 and 0. Thus, each pixel (or individual dot making up an image) is represented or described using combinations of these two numbers. The number of bits per pixel that a scanner can store thus determines how much color information it can recognize. For example, in a simple black-and-white scanner, the device can only handle one bit: black or white, "image" or "no image." This is fine for line work or even text, but is inadequate for photographs or other continuous tone images. Scanners thus can store increasing numbers of bits per pixel, the more the better. Two bits per pixel can add light gray and dark gray to the basic black-and-white dichotomy, increasing somewhat the level of detail possible. (Increasing the number of bits increases exponentially the number of levels of gray or colors that can be captured, since one bit can be one of two values. For example, one bit can express pixels that are either black or white. This is represented by taking the number 2, the number of possible values per pixel, to a power equal to the number of bits the device can store (e.g., 21 which equals 2). Two-Bit systems would be represented as 22, or 4, equal to black, white, light gray, or dark gray. 4-Bit would be represented as 24, and so on. See 1-Bit Color, Two-Bit Color, 8-Bit Color, etc.)
Because of the number of gray levels the human eye can detect, a scanner needs to be able to capture a high numnber of gray levels in order to capture photographic detail and render it with acceptable accuracy. The human eye can differentiate a little over 200 shades of gray, so using less than that results in the phenomenon of banding, or the visible detection of the discrete gray levels. Thus, most grayscale scanners utilize eight bits per pixel, which results in 256 levels of gray, which is more than adequate to create smooth blends in grayscale images.
Thus, when a scanner scans a photograph (popularly referred to as black-and-white but technically considered "grayscale"), the light source generated by the machine's optics move along the image pixel by pixel, measuring the level of gray and storing it as a particular value, or combination of bits. A light gray, for example, may be represented by 01110110. When the image ultimately appears on the computer screen after scanning, those pixel descriptions will be converted back to the appropriate gray level.
In color scanning, eight bits per pixel is inadequate to represent the vast range of colors available. In many multimedia applications, however, 8-Bit color is the most a system can handle, but for high-quality color prepress it is of too low a quality to be useful. A color scanner generally makes use of 24-Bit color. It does this by making three passes over the original color image, each pass using a different filter to capture one of the three additive color primaries: red, green, or blue. It stores the color information for each of these passes as 256 gray levels, or eight bits per pixel per color. Since, say, red, in some color models consists of a pure hue with varying degrees of lightness or grayness, each gray level captured for red is equivalent to one of 256 shades of red. This is true for the blue and the green components as well. When the final scanned image is ultimately displayed on screen, the program being used to display and process the image creates a composite image of these three grayscale images. (In Adobe Photoshop, for example, each of these channels can be viewed and adjusted separately.) This process is very similar to the prinmciple behind process color printing, in which three or four different plates, each of which contains only shades of a single color, are overprinted to yield a much wider color spectrum.
Many scanners now, in the interest of efficiency, make only one pass with the scanner mechanism, which successively captures the three colors per pixel. This is called strobing, and it is one way of eliminating color drifts caused by tracking errors which can creep in during successive passes of the scanner optics. This doesn't always avoid color drifts, however, and some degree of color correction is often needed following scanning.
'Resolution'. One important means of measuring the performance of a scanner is by resolution. Resolution is defined as the number of pixels per linear unit, in this case, pixels per inch (popularly but somehat incorrectly referred to as ["dots per inch [dpi]"]). The greater the number of pixels per inch, the smaller and less distinguishable those pixels are, which increases the detail which is captured and hence increases image quality. In the case of scanners, resolution is a function of sampling rate. A pixel captured by a scanner is called a "sample," and the frequency with which a sample is taken determines the pixel size and how many of those pixels occur in a unit of linear space. A scanner with a maximum resolution of 1200 ppi can fit 1200 pixels in an inch or, in other words, takes a sample every 1/1200th of an inch.
There is a difference between a scanner's optical resolution—or the resolution that can be achieved by the scanner's optics alone—and its interpolated resolution. Scanner software uses interpolation to "artificially" increase the resolution captured by the device. Interpolation routines average the grayscale values of sampled pixels and places new pixels between them. Thos works more or lss well, depending on the image. Although interpolation doesn't add any detail to a scan, it does tend to smooth out transitions between grays. In some cases, the resolution of an image can be doubled by interpolation. A caveat concerning resolution is that more is not always better. Capturing an image at a resolution substantially greater than the highest resolution possible by the output device will not improve image quality, and will require more time to process an image and higher levels of processing power, RAM, and disk storage space.
'Dynamic Range'. In addition to resolution, a scanner's dynamic range is important to consider in a choice of scanner. Dynamic range is defined as the ability of any reproduction to capture both highlight and shadow detail, or in other words detail in the lightest and darkest portions of an image, respectively. Dynamic range is measured logarithmically; a highlight that is ten times as bright as a shadow is said to have a dynamic range of 1.0. One that is 100 times as bright would have a dynamic range of 2.0. The higher the dynamic range, the greater the level of detail that is captured at both ends of the image spectrum. Desktop scanners have a dynamic range of about 1.8 to 3.8, while high-end scanners can go as high as 4.0. Dynamic range is a function not only of the inherent capabiolities of the scanner optics, but also of the bit depth per pixel. In many images, even eight bits per pixel per color is not enough to capture as much shade gradation and detail as the image warrants; some systems can go as high as 16 bits per pixel per color, which can eventually be distilled by the software down to 8-Bit. Some scanner protocols work with a total of 48-Bit-color rather than 24-Bit. As of this writing, PostScript-based output systems can only handle 24-Bit color, so a 48-Bit image needs to be distilled just prior to output.
TYPES OF SCANNERS
Just as there are a wide variety of imaging requirements, so too there is a wide variety of scanner configurations, which run the gamut from small, hand-held units used for scanning business cards to high-end scanners costing hundreds of thousands of dollars. Generally speaking, however, color scanners can be distinguished in four ways:
'Flatbed Scanners'. At the low-end of the price and performance spectrum are the flatbed scanners. These cost about $1,000, sometimes a little more, and sometimes, increasingly, a little less. Also known as CCD scanners, these use as their light detectors an array of charge-coupled devices, which are very small photocells. Their small size is what enables the devices utilizing them to fit on a desktop. The drawback, however, is that they are less sensitive to subtle gradations of color and gray values than larger photomultiplier tubes. Flatbed scanners are most often used for scanning reflection copy (or prints, rather than transmission copy such as slides and transparencies), although some models do have attachments for transmission copy. An image is placed face down on a glass platen, while the scan head (comprising a light source and a CCD array) moves down the length of the original, measuring the intensity of the reflected light pixel-by-pixel. The drawback to flatbed scanners—in addition to low dynamic range and maximum optical resolutions of under 1000 ppi—is that they cannot optically enlarge an image. This presents no problems if an image is going to be reduced in size or used at 100%, but loss of detail will occur at enlarged sizes. With reflection copy, this may not be a problem, since prints can be very large to begin with. When using transparencies (the standard size of which is 4 x 5 inches) or the even smaller 35mm slides, the original size is already very small and upsampling to the desired dimensions may degrade image quality unacceptably.
'Slide and Transparency Scanners'. A slide scanner works on the same basic principle as the flatbed scanner—a light source illuminates the image and sends it to a CCD array, which converts it to electrical signals which in tun can be translated to digital information—but is specifically designerd to overcome the transmission copy limitations found on conventuional flatbed scanners. In this case, the light source shines through the image onto the CCD array. These may soon become more popular than flatbed scanners, as transmission copy in general is capable of a much greater dynamic range than is reflection copy. Slides and transparencies are also much easier to handle, store, and transport than prints. Many slide scanners can capture images at up to 16 bits per pixel per color and at a resolution high enough for quality reproduction.
'Desktop Drum Scanners'. A drum scanner is a high-end device which uses photomultiplier tubes (PMTs) to record and capture color information. A transparency is mounted on a rotating (or stationary, depending on the type of scanner) transparent drum. As it rotates (or the optics rotate), a light source illuminates the image, where the PMTs detect it and send it to the computer. The use of PMTs makes these devices very sensitive to gradations in shade and color. Desktop drum scanners—descendents of much superior and more expensive high-end drum scanners—are becoming widely used for high-quality color work. They lack many of the "frills" and accessories of the high-end devices, many of which are not really needed any longer.
'High-End Drum Scanners'. The high-end scanners—which can be the size of a piano and cost even more—were the first scanners developed and continue to be used as part of a high-quality color electronic prepress system. Not only do these scanners have the best possible imaging systems, many also include built-in color computers which can convert from color space to color space (such as from RGB to CMYK) on the fly, but many also output directly to film using built in film recorders. These were widely used for making color separations, and still are, although more and more often captured images are output to a computer for additional manipulation and incorporation into electronic pages. (See Scanner.)
(The above devices are all used for color scanning of images, and are thus known as image scanners. There are other types of scanners, however. For example, OCR scanners are used in optical character recognition, which uses an optical scanner—often the same type of scanner used for capturing images, only running different software—to capture a page of text. The software can then recognize the characters that have been scanned and convert it to ASCII text for importation into a word processing program or database. OCR scanners and software are used in lieu of keyboarding, saving time. There are also bar-code scanners which are used in retail shops and elsewhere to read bar codes. See Optical Character Recognition and Bar Code.)
SCANNING SOFTWARE AND IMAGE ENHANCEMENT
Most scanners come bundled with some type of software which is used to control the scanning process, adjust contrast, set resolution and ultimate image size, crop the image, etc., prior to making the actual scan. Many flatbed scanners come with either full-fledged or "limited edition" versions of popular photo manipulation programs such as Photoshop. Many scanning software programs can function as plug-ins to programs like Photoshop, which means that images can be scanned directly into those programs. Many programs now also are compatible with the TWAIN standard, which allows the use of different scanners without requiring a variety of different device drivers. There are also a variety of third-party scanning software utilities available which allow enhanced image calibration and color correction prior to scanning.
'Resolution and File Size Revisited'. An important consideration, as was mentioned above, is to scan at the proper resolution. Most scanners and scanning software utilities allow the user to adjust the resolition (and image size) in advance. Some programs allow user-selectable resolution, which lets users enter a specific number (i.e., 221 ppi), while others have preset resolutions in increments (i.e., 200, 300, etc.). When scanning line art, it is best to scan at the resolution of the output device. For best results, between 600 and 1,000 ppi is the best resolution for text and line art. For halftones, it is in general good practice to scan at a resolution which is double the screen count of the halftone screen which will be used. Thus, if a 150-line screen is going to be used, a 300-ppi scan would be required. This is not a hard and fast rule, though; sometimes a resolution of 1.6 to 1.8 the screen count will produce high-quality images, as well.
However, when images are to be enlarged, the situation changes. The resolution will need to be doubled in proportion to the enlargement ratio. So if an image is to be enlarged by a factor of 2:1 (or, in other words, doubled) the resolution should be twice what it would be at 1:1. So the 300-ppi halftone image should now be scanned at 600 ppi. It is easy to see that depending on the respective sizes of the original and the desired image, the resoluition may need to go very high indeed. Enlarging a 4 x 5 transparency, for example, to a 24 x 30 poster is an enlargement of 6:1. If a 150-line screen were still going to be used, the scanning resolution would need to be about 1800 ppi. Greater enlargements (such as those made from 35mm slides) and higher screen counts require even higher scanning resolutions. Up to 3,000 ppi is not unusual.
As one would expect, file size increases as resolution (and image size) increases. File sizes are measured in units known as bytes. One byte equals eight bits. One thousand bytes (actually 1,024 bytes) equals one kilobyte (abbreviated KB) while one million bytes (actually 1,048,576 bytes) equals one megabyte (abbreviated MB). Let's use this example to determine file size: a grayscale image scanned at 100% at 300 ppi at 8 bits per pixel. The image is thus 2400 pixels wide (300 ppi x 8 in.) by 3000 pixels deep (300 ppi x 10 in.), having a total of 7,200,000 pixels (2400 x 3000). At eight bits (one byte) per pixel, it takes a total of 7,200,000 bytes to describe this image, or 7.2 megabytes. If this were an RGB color image at 24 bits per pixel, it would require 21.6 MB (7.2 x 3) to describe. If it were converted to CMYK, it would require 28.8 MB. And that was only at a relatively low resolution of 300 ppi.
Consequently, it is easy to see the pitfalls of scanning at a much higher resolution than will actually be needed. Large files take a very long time to process, even on fast computers, and take up a lot of disk space, which may be in short supply. Generally, though, it is better to scan high and have to reduce than to scan low and have to increase. Increasing the resolution (in Photoshop or other such program) after scanning rarely results in very good-looking images.
'Scanning Originals'. The manual process of placing an original in or on a scanner for scanning has its own share of considerations. Needless to say, flatbed scanners should have their glass platens as free of dust, dirt, and other detritus as possible. Transparencies and prints should also be inspected for dust, scratches, or other visible problems that may be magnified by the scanning process. When attaching a transparency to a drum scanner, it is important that all parts of the image be flat against the drum; if any part of the image varies in distance from the scanner optics as the rest of the image, distortions in the scanned image will be evident. Sometimes, oil mounting is performed so as to eliminate an optical problem known as Newton's rings, or haloes of color caused by refraction of light passing through a transparency. Adhering the transparency to the drum by means of a clear oil can reduce this problem.
'Image Correction and Enhancement'. There are a variety of ways of fixing and correcting scans. Depending on the scanner and tghe scanning software, it may be possible to do this prior to or during scanning. Often, however, especially with flatbed scanners, such processes can only be handfled after scanning, in an image manipulation program such as Photoshop.
Some of the most common activities include sharpening, variously known as edge enhancement or unsharp masking. In the latter designation, abbreviated USM, the scanner includes a separate photomultiplier tube which captures a slightly out-of-focus signal. This somewhat blurry (or "unsharp") signal is added to the sharp signal. The effect of this combination—used for many years in photography—is to sharpen the contrast at the edges of boundaries between separate portions of an image. (When USM is performed after scanning in a program such as Photoshop, it is effedcted by calculating the differences between the values of adjacent pixels and increasing the contrast between them.) Too much unsharp masking, however, can produce excessive noise and distortion in an image.
Tonal adjustments can also be made in a scanned or to-be-scanned image. This can take the form of adjusting the endpoints of an image (i.e., whitest white and blackest black, or highlight and shadow, respectively) or adjusting the midpoint of the image or the distribution of tones in the image. Similarly, color correction may be needed, dpending on the quality of the scanner. Sometimes, a scanner will impart a color cast to an image, and at other times a few of the colors in the image will be off. Global correction is the correction of the color throughout the entirety of the image, shich can consist of darkening all the reds, for example. Local correction is the changing of the color of one partocular portion of an image, such as only the red of a fire hydrant present in the image.
Depending upon the nature of the image and the context in which it is ultimately to appear, further types of manipulations may be required, including formaing collages, removing elements from the image, inserting elements in the image, etc.
There is no hard and fast rule to these adjustments, of course; most good software and scanning programs have "preview" functions which allow the user to see what the effects of a partocular adjustment will be before they are actually made. The best judge of any image or color correction operation is the human eye.
BEYOND SCANNING
It has been suggested that scanning may ultimately be replaced by other forms of imaging, especially digital cameras, which capture images directly in digital form. Tere is widespread popularity and ethusiasm for thewse devicxes, but so far quality and price issues have impeded their widespread use. But they are gaining ground. The popularity of the Photo CD, which many perceive to be a transitional medium, is an indicator that prepress departments and other users of digital images would like to eliminate the scanning phase as much as possible.