Scanning Tips and Guidelines

Typed Documents

Typed pages are typically going to be black and white images so there is no need to scan them in full color, grayscale will do just fine. Scanning in grayscale also makes it easier to make the background a pure white and create better contrast. It is also one third or less the size of the equivalent color image. Scanning resolution also depends on whether you want to OCR the document or just want to preserve an image of it. Typed documents are usually not as important as an image, rather the information that is presented on the page - thus preserving the information is more important than the look. This being the case, the is nothing wrong with using a compressed file format, such as JPG, to store the file. Storing as a JPG will save a huge a mount of space as well.

For advanced users, you can even convert a typed document to 4-bit grayscale and save as a GIF, which typically yields even smaller file sizes than JPG and usually with better visual quality.

For OCR the purpose is to use a temporary image of the document so a computer can convert the image to an actual editable text file. Because the information is transitory, grayscale works great for the color depth and you definitely want a smaller file size so use JPG for the file type. The resolution is higher, however, because the computer needs a good image of each letter on the page.

Document Preservation
Scan Resolution: 150-200 dpi
Color Depth: Grayscale (8-bit) or True Color (24-bit), depending on your desire.
File Format: TIF or JPG

OCR
Scan Resolution: 200-300 dpi
Color Depth: Grayscale (8-bit)
File Format: JPG

Original Photographs

Original photographs are definitely worth preserving, especially since most photographs starting in the 1950s fade over time - especially color photographs. These are the highest resolution scans that we recommend - a full 600 dpi. We also recommend storing the images in the TIF format, which does not loose any detail but also takes up a lot of room. A typical 8 x 10 photograph scanned at 600 dpi in full color stored as a TIF will take about 120 MB.

Color (includes sepia tone)
Scan Resolution: 600 dpi
Color Depth: Full color (24-bit)
File Format: TIF

Black & White
Scan Resolution: 600 dpi
Color Depth: Grayscale (8-bit)
File Format: TIF

Tintypes and Daguerreotype

Tintypes and Daguerreotypes were typically very small, sometimes measuring only 1 inch square. To really preserve these, you should scan at a minimum of 600 dpi but can go higher if you desire. Also, there was no color so scanning in grayscale will save you some space.

Scan Resolution: 600 dpi or higher
Color Depth: Grayscale (8-bit)
File Format: TIF

Artifacts

Pictures of artifacts or artifacts them selves can be scanned for preservation purposes. And for this, the field is pretty wide open to what you want to do with the image. Try and match your usage to one of the above-listed applications and follow those guidelines.

Posting on a Website

Web sites require much smaller images than are typically obtained from scanning. Except for special cases, most images you want to put on the Internet should be about 500-600 pixels in width. If you are scanning a 4 x 6 photograph, you would use between 80-100 dpi. Normally, however, you would want a high-resolution scan done of the photo or image, then resize that image to 500-600 pixels using your favorite graphics manipulation program.

You really only have two choices in file format when posting an image to a website: JPG and GIF. There are differences between the two, each one excelling in a particular usage. As a general guideline JPG is best for photographs, GIF is best for documents.


Digital Cameras & Megapixels

Digital cameras are very popular now and many people are using them instead of traditional film cameras. Digital cameras, as a general rule, always store their images a JPG (meaning they are compressed and some information is lost). But the tradeoff is you can fit many times more JPG files in your camera than you would in an uncompressed format.

Megapixels is a measurement, undoubtedly invented by someone in marketing, of how large an image the camera can take (or rather, how much resolution the image has). The following table might help you compare megapixels to scanning and printing.

Megapixels
Image Size
Avg. File Size
Raw / JPG
4 x 6 Photo Comparison
Native Print Size
(300 dpi)
1.3 1280 x 960 3.7 MB / 0.5 MB 200 dpi 4.25″ x 3.3″
2.1 1600 x 1200 5.7 MB / ?? MB 266 dpi 5.3″ x 4″
3.0 2048 x 1536 9 MB / ?? MB 341 dpi 6.8″ x 5.1″
4.0 2400 x 1600 12 MB / 1.0 MB 375 dpi 8″ x 5.3″
5.0 2560 x 1920 15 MB / ?? MB 426 dpi 8.5″ x 6.4″
6.3 3088 x 2056 19 MB / ?? MB 514 dpi 10.25″ x 6.8″
8.0 3264 x 2448 24 MB / ?? MB 544 dpi 10.9″ x 8.1″
11.1 4046 x 2074 25 MB / ?? MB 647 dpi 13.5″ x 9″
12 4256 x 2848 36 MB / ?? MB 709 dpi 14.2″ x 9.5″
14 4536 x 3024 41 MB / ?? MB 756 dpi 15.2″ x 10″
16.6 4080 x 4080 50 MB / ?? MB 680 dpi 13.3″ x 13.6″
20 5248 x 3712 58 MB / ?? MB 875 dpi 17.5″ x 12.3″
30 6560 x 4640 91 MB / ?? MB 1093 dpi 21.8″ x 15.5″
41 7648 x 5408 124 MB / 15.8 MB 1274 dpi 25.5″ x 18″

 

 
© 2004 John Marriott Historical Society, All rights reserved.