Stereographic Theory - Terry Blackburn on the Web!

STEREOGRAPHIC IMAGING
IN
THEORY AND PRACTICE
by
Terry Blackburn

Note: This document is not complete. Many reference figures are missing and some sections are incomplete. I will finish them as I have time and material. I promise! tb!

TABLE OF CONTENTS:
i. Introduction

What is Stereographic Imaging?
- History...(really brief)
- How does it work?
Stereoscopic Theory
- Why do we see in 3D? The perception of depth...
- How far away is infinity? Separation anxiety...
- Rules of Composition Let's complicate matters further...
- Oh, by the way... Stupid eye tricks
- FAQ:
  - I have trouble seeing through those funny colored glasses. When I close one eye, I can see fine. Am I seeing 3d then?
  - I have this really cool picture I took last summer. Can you make it into a 3d picture?
  - I don't get it. What's the big deal?
Stereographic Presentation Methods
- Drawbacks and limitations
- Stereo Pairs - free view or viewer aided
  - parallel view
  - crossed eye view
- Stereo Projection
- Anaglyphs
- Lenticular
- The Pulfrich Effect - http://dogfeathers.com/java/pulfrich.html
- Other methods - lenticular, LCD shutters
- Drawbacks and limitations.
Creating Stereographic Images for Distribution via the Internet
- Pairs
- Anaglyphs

Introduction:

Stereographic imaging is at once art, science and practiced technique. We take three-dimensional imaging so much for granted that we sometimes forget the intricacies and difficulties associated with presenting 3d images. This document is designed to present an overview of the theory and techniques of stereographic imaging. To the veteran, the examples presented are standard fare, easily observed images. To the first timer, it is impossible to view the examples without instruction in the equipment and techniques necessary to realize the images. It's a two-edged sword. The examples in the section on stereographic theory depend on information only provided much later in the document; specifically, the viewing of anaglyphs and stereo pairs. If you are new to stereographic imaging, you may wish to initially skip ahead to the section on Stereographic Presentation Methods. This section will help you acquire the necessary knowledge to view the examples presented in the section on Stereoscopic Theory. If you find this document interesting and helpful, you're welcome!

What is Stereographic Imaging?

History

Stereographic Imaging is almost as old as photography itself. From the very earliest times, photographers have sought to reproduce the world around them in as realistic a manner as possible. Examples of stereographic daguerreotypes exist from the mid 1800's. The stereopticon with its cards were popular for several decades after the turn of the century. In the 1950's, Viewmaster reels were the medium of choice for the public distribution of 3D material. Holograms, which are beyond the scope of this dissertation, achieved visibility with the advent of laser technology in the 1960's.

How does it work?

We have two eyes. Each eye has its own unique viewpoint. Very simply put, take a picture. Move 2.5 inches to the right and take a second one. Look at the first picture with your left eye, look at the right picture with the right eye. Voila - 3d! Why does it work? Read on...

Stereoscopic Theory - (The technical stuff...)

Why do we see in 3D? The perception of depth...

The perception of depth has a number of contributing factors. The relative size of adjacent objects can contain depth proximity clues. For example, figure 1 displays a person beside a building. Even though we cannot see the bottom of the building or the feet of the person, we assume the person is closer because past experience tells us buildings are larger than people, therefore the person is much closer. The limitations of this type of depth perception are obvious. The successful interpretation of depth clues are based on past experience and can be easily subverted. The building in figure 1 is actually a 4' model and is standing 16 inches in front of the person!

Moving images can also provide depth clues. Objects in the foreground are observed to have greater displacement relative to those further away at a given speed. Here again, the effect is largely a conditioned response to previously observed phenomenon. Only under highly specific circumstances can absolute depth information be extracted from moving images. These circumstances are explained later under the topic 'Pulfrich Effect' below.

So, while some depth clues may contribute to the overall visual experience, true depth perception exists exclusively in the stereoscopic domain. True stereoscopic imaging contains depth information completely independent of learned spatial relativity. Even abstract objects can be perceived in absolute spatial relationship to each other. This would be an opportunity to present another figure in support of this portion of the discussion. However, if you are new to the concept of stereographic imaging, you probably don't have the tools or experience necessary to view the example. Figure 2 illustrates this idea. Please return here when you are ready to view the example.

The perception of depth is the result of image convergence disparity, measured very precisely by the brain. Human eyes, spaced approximately 2.5 inches (6.25 cm) apart, each capture an image very slightly, but significantly, different from each other. Figure 3 demonstrates these seemingly minute differences in a rather dramatic way. In this example, two images of the same subject, taken from two very slightly varying perspectives (less than 2 inches of separation), are superimposed over each other, one in red, the other in blue. The red image displays significant foreshortening, suggesting that it was taken from a perspective to the right of the blue image.

A number of operations must occur in order to resolve the differences that result from your two eyes viewing the same object from different perspectives. First, the center of interest must be identified. Both eyes move to center this area of the scene on the optic receptors at the back of the eye. If a double image is detected, resulting from the object of interest being focused on different areas of the optic receptor matrix (retina), the eyes adjust horizontally until a single, or converged, image results. At this moment, it may be possible to detect that other objects within the same field of view do not produce converged images. As the attention is focused from one object to another, the eyes repeat the process of tracking horizontally until a converged image is obtained. Objects at a great distance require the horizontal alignment of the optic system to be parallel. Closer objects require increasing angular deviations. Angles greater than 15 degrees can be uncomfortable, and a 30 degree angle can be painful.

A simple experiment demonstrates convergence quite clearly. Focus on an object several feet distant. Raise you hand with one finger extended into your field of view. The resulting double image, two extended fingers rather than one, is quite obvious. Now, focus your attention on your finger and observe the distant object as divergence occurs. Focus on your finger as you move it closer to your face. Observe the increasing divergence occurring in the background.

Depth is then measured by interpreting the angular deviation necessary to produce a converged image. Objects at varying depths in a given scene have varying relative horizontal placements, depending on the perspective of the optical unit, requiring varying alignment of the optical system to achieve convergence.

Separation anxiety... How far away is infinity?

Traditionally, forever is a long, long time. From a practical standpoint, there really is a limit, a point beyond which no perceptible change can be detected. In stereoscopics, infinity is a function of the baseline, or separation, between the stereo pair, and the distance to the object being observed. The 'functional infinity' of the human optical system is considered by ophthamologists¹ to be 20 feet (7 meters)! At 20 feet, the angle of convergence, the angle that your eyes deviate from completely parallel, is only six tenths (.59) of a degree. Surprisingly, it is possible to detect convergence out as far as 120 feet (40 meters) where the angle is 0.1 degrees. However, at this point the effect is of more trivial value than practical. Beyond a distance of a dozen or so yards (meters) the 2.5 inch baseline provided by our eyes makes it impossible for depth to be perceived through convergence measurement. Beyond that distance all objects converge at the same visual angle. At this point, the optical system alignment is essentially parallel, perpendicular to the baseline. When you stop and think about it, the difference between convergence angles of an object 100' away and one 200' away is .06 of a degree! Yet, it is still possible to clearly discern the space between the objects.

How we perceive depth is closely tied to the baseline of our internal optical system. If that baseline is altered, a unique psychological phenomenon can be observed. Under normal conditions, an object 10' away requires a convergence angle of 1.2 degrees, using the standard 2.5 inch baseline. If the baseline is extended to 5 inches, the convergence angle becomes 2.4 degrees, the same angle that would normally indicate an object 5 feet away on a 2.5 inch baseline. Additionally, objects 50 feet away produce convergence angles of those only 25 away. This results in an enhanced sense of depth. The interpretation of this information by the brain is very interesting. Past experience cannot be ignored. If the objects presented are familiar, non abstract items, the brain interprets the distances based on the convergence angle, and psychologically reduces the size of the object to fit into that space. Figure 4 presents a series of images of the same scene with increasing baseline separation.

Rules of Composition (the practical stuff...)

In addition to the standard rules of photographic composition, stereographic photography introduces another set of basic rules to be considered in addition. A basic rule of stereographic imaging is to include foreground objects in the picture to optimize the characteristics of the technique. Otherwise, there's no point in taking a 3d picture, is there? Foreground objects are important because the effect of depth falls off dramatically as distance to the subject increases. In general, 3d scenery pictures shooting off to the horizon are ineffective. Sometimes the desire to capture a scene is so overwhelming that we ignore good judgment and snap the picture anyway, and are disappointed with the results later.

A simple technique can be employed to enhance depth and turn a less than ordinary shot into a spectacular demonstration of depth. Estimate the distance to the closest object in the scene. Then, determine the ideal placement of the object in the scene. Suppose the closest object is at a distance of 100'. Ideally, an object within 5' to 10' feet is desired. Let's place the object at 7 feet. By extending the baseline we increase the convergence angle and reduce the apparent distance to the object. Using simple ratios we can compute a new baseline using the following equation - 100 / .21 = 7 / x. The value .21 is 2.5 inches converted to feet. 100 / (.21 * 7) = 1.47 feet or slightly less than 18 inches. The psychological implications of this technique can be extrapolated further by comparing the new baseline to your own height. If you are 5'10" looking at an object 7 feet away, then an object 100 feet away with the same convergence angle would proportionately make you 83 feet tall!! That's why the objects in the image appear smaller with an increase in separation.

This technique will drive purists nuts, and my apologies. Art only imitates nature. It doesn't necessarily reproduce it verbatim. There are obvious drawbacks to this technique as well. Placing a very large object virtually too close within the observation space will cause convergence difficulties. The brain is well experienced at quickly guessing and presetting the convergence angle based on the image contents. Too much stretching of virtual depth can result in 'fishing' as the eyes search horizontally attempting to converge at expected depth rather than the actual virtual depth. This is not a good thing. Techniques for avoiding this will be discussed later.

Oh, by the way...

Your eyes can be trained to do all sorts of amazing tricks. There are, however, a few things your eyes could do for which there is no practical application. And, in fact, if they do some of these things, a trip to your local ophthamologist might be in order. Your eyes are designed to work together on the same horizontal plane. If one eye were to look up and the other look down, for instance, really bad and probably painful things are happening. Your eyes only do things that make images converge, and this situation just doesn't occur in nature.

Divergence, or 'wall eyed' behavior is anomalous as well. It is actually possible to force a small degree of divergence under controlled conditions, but more than a few moments in this situation will result in a massive headache. It is not recommended even for brief experimentation.

Why do I mention these issues? Because in designing and presenting 3d images it is important NEVER to expect these things to happen. An image that is not properly aligned both horizontally and vertically will cause fatigue, pain, and general ocular discomfort. It is important to have a solid understanding of the mechanics involved so your images can be fine tuned for optimal viewing effect AND comfort.

Frequently Asked Question:

I have trouble seeing through those funny colored glasses. When I close one eye, I can see fine. Am I seeing 3d then?

NO. Both eyes are required for stereoscopic imaging. Viewing with one eye closed totally defeats the purpose of stereographic reproduction. Some practice may be required to master the viewing of anaglyphs. It can take several seconds for the eyes to adapt to the filtration used in anaglyphic glasses. Usually a little patience is all that's necessary. Individuals with colorblindness issues may have difficulty as well.

I have this really cool picture I took last summer. Can you make it into a 3d picture?

NO. Stereoscopic imaging requires two views of a scene taken from different perspectives. Unless you took two pictures at the time, that scene is lost to the three dimensional world forever. Sorry... Now, if you were driving down the road with a movie camera pointed out the side window... maybe...

I don't get it. What's the big deal?

A percentage of the population simply have limited ability to perceive depth. While uncommon, it's not rare. Various uncorrected or uncorrectable ophthalmicconditions can result in difficulty with image convergence. Some people are colorblind, others are tone deaf, oh, well...

Stereographic Presentation Methods

Drawbacks and Limitations

Despite the relative ease at which our own stereoscopic optical system (eyes) collect and analyze three dimensional scenes, capturing and reproducing these scenes is not always convenient. With the possible exception of lenticular images and holograms, the presentation of stereographic images requires either special equipment, special viewing techniques, or both. Below is a basic rundown of various popular presentation methods and some of the issues surrounding the use of each.

Stereo pairs generally require external optical systems, or viewers, to present individual images to each eye with proper focus and alignment. Examples of stereo pair systems include ViewMaster and the older Stereopticon. With training and practice it is possible to view stereo pairs without the aid of a viewer. Free view images can be presented as either crossed eye or parallel view pairs. Difficulty can be encountered if the images are too large or too small.

Anaglyphs require the use of special filters covering each eye, usually one red and one blue. Without these filters, the image is jumbled and difficult to decipher. Individuals with colorblindness may not be able to view them at all. There is no amount of special training or practice that will allow the viewing of these images without the filters. Because of the filtration necessary, color rendition is generally poor, and even under the best of conditions some cross image bleed-through produces a ghosting effect.

Projecting stereo pairs combines the viewing optical system and filtration devices to allow large numbers of people to view projected stereo images. The viewing system in this case is a projector utilizing polarized filters. The dual projection system superimposes the stereo pairs through filters polarized at 90 degree angles to each other. To extract the discreet images from the superimposed composite on the screen again requires the use of polarized filters at 90 degree angles over the eyes. The screen must be specially manufactured to ensure a high coefficient of polarized reflectivity. The head must be held level while viewing images to prevent cross image bleed-through.

Lenticular images require precise cutting of images into vertical strips, re-assembled alternating left and right images, and displayed behind a special viewing matrix. Unfortunately, image clarity and resolution suffer.

The Pulfrich effect again requires filtration. This time, only a single dark filter is placed over one eye. This effect only works with moving pictures (movies). There are additional restrictions on camera movement. If the camera stops moving, the image disturbingly reverts to flat. Moving too fast creates exaggerated depth.

Additionally, in recent years, techniques have been derived to take advantage of computer video technology. A discreet stereo pair are interlaced on the video monitor, each image displayed on alternating scans of the video raster. The images are de-coded, if you will, by an electronic shutter placed over each eye. The shutter for the left eye opens as that image is traced on the monitor. The left shutter closes, and the right shutter opens as the right image is traced on the screen. The shutters use electronic LCD technology to quickly darken and lighten the viewer 'shades'. Currently, this technology is expensive and not well standardized. Proprietary software and hardware are required to view images.

Stereo Pairs

Stereo pairs are the most common and straight-forward method of presenting stereographic images. Image quality is uncompromised because no filtering or other image manipulation is necessary to produce the 3d effect. There are two basic methods for viewing stereo pairs - parallel and crossed-eye. While the viewing of pairs is usually accomplished utilizing optical systems, it is possible to view these images without the aid of optics. This technique, called 'free viewing' requires practice and can result in eye strain headaches initially. Free viewing is most effective with crossed-eye images and is described in that section below.

Parallel view is the most common stereo pair viewing method. Utilized extensively in commercial applications, it is the method of choice in all types of viewers from the stereopticon to the ViewMaster reels still being produced today. In this viewing system, the left image is presented to the left eye, and the right image is presented to the right eye. The optical systems utilized can produce vivid, full field of view images with striking clarity and realism. Care must be taken to ensure that they eyes are never forced beyond a parallel position into a position of divergence. This technique is most suited for viewing with the aid of a carefully aligned optical system.

Crossed eye view, to my knowledge, is exclusively used for free viewing. This technique presents the left eye image on the right, and the right eye image on the left. The three dimensional image is realized by 'manually' crossing ones eyes, forcing the left eye to view the right side image, and the right eye to view the left side image. With careful practice, it is possible to converge the two images into a single, three-dimensional image. Crossed eye viewing demonstrates several advantages over parallel view for free viewing. With parallel viewing, if the two images exceed more than 2.5 inches in horizontal size, the eyes are forced to an obtuse angle to converge the images. This is painful, at best. This small image size limits the amount of detail that can be preserved. The eyes are far more flexible when forced inward, so the crossed eye technique allows greater image size.

The most significant challenge to free viewing is focal/angular displacement. When parallel viewing, the eyes are positioned for viewing at infinity. Usually, the image being viewed is placed 12 to 18 inches from the eyes. Significant and conscious effort must be exerted to allow the eyes to focus closely independently from the parallel driven urge to focus at infinity. Conversely, crossed eye viewing forces the eyes into an acute angle that would normally indicate an object at very close proximity. Consequently, considerable effort must be expended to force the eyes to focus at a distance somewhat beyond the angular cues being supplied to the brain.

Free viewing requires practice. Figure 5 contains several images to help you practice crossed eye free viewing. Take your time. This is not an easy technique. Younger people tend to have the greatest success. There's something about the flexibility of youth. I learned free viewing when I was in my mid teens. I drive my optometrist nuts! I can converge and focus on impossibly misaligned images that are used to diagnose serious vision deficits. Then I have to take several minutes to explain the techniques that I have practiced and developed to a very high degree. This technique cannot damage your eyes. When your mom told you to 'stop looking cross-eyed or your eyes will stick like that' she was merely trying to ensure you didn't look like a moron when you went to visit grandma. Sure, if practiced to excess initially you'll get a splitting headache. Take it easy. Try it for a few minutes, take a break and come back to it an hour or two later. It can take months to develop comfortable and flexible control of your viewing mechanism. I believe it's worth the effort, and certainly it's at least worth a try.

Stereo Projection

Stereo projection is an extension of stereo pair viewing, combined with filtering techniques designed to separate two images projected on a common surface. Typically, a dual projection system consisting of parallel optical channels are passed through polarizing filters oriented at 90 degrees to each other. The projection surface (screen) must be capable of reflecting polarized light with very little dispersion or depolarization. The individual viewing the projected image must wear glasses with polarized lenses over both eyes oriented to match the polarization of the image desired for each eye. One notable example of this method of stereographic presentation is Disney's Epcot Center presentation of Michael Jackson in 'Captain EO', now replaced by the 3d thriller "Honey, I shrunk the Audience."

Anaglyphs

Anaglyph Theory

Like all other methods of stereographic presentation, anaglyphs are simply another method of presenting discreet images to each eye. Because anaglyhs present as superimposed images, special appliances are required to separate the encoded images.

Traditionally anaglyphs have been encoded using red and blue filters. Images encoded using red and green can be successful as well. Standard convention also dictates the viewing apparatus be constructed with the red lens on the left, the blue or green on the right. The notable exception to this standard is the propensity of the movie industry to reverse this convention and place the red on the right. Thus, if you have acquired a pair of cardboard 3D glasses for a network television presentation of an old 3D movie, you will need to reverse them to successfully view most web based 3D presentations.

The final presentation of the stereographic image is accomplished by encoding the image intended for the left eye in blue and the right eye in red. In this scenario, the image coded in red becomes invisible to the eye covered by the red filter. (see footnote)

Historically, left and right components were encoded from black and white images. Today, more is being done to create anaglyphs with limited true color rendition. This is accomplished by replacing the red channel of an RGB image with the red channel from the other half of a full color stereo pair. Success varies depending on the color content of the image. Unfortunately, this practice can result in excessive 'bleed through' resulting in undesirable 'halos' around some objects.

The actual technique of creating anaglyph images is almost elementary with tools such as Adobe® PhotoShop™ and similar photo editing software. This process will be described in detail later.

Anaglyph Registration and Virtual Depth Placement

Now we introduce a third element into the composition mix. After composing a pleasing image and adjusting the virtual depth to capture the maximum 3D effect we now must decide how to place the image in relation to the surface of the display medium. Do we want the image to protrude from the surface of the page or screen, recess entirely within it, or traverse both planes? The difference of effect can be quite dramatic.

The alignment, or registration, of anaglyph components is a 'soft' science. The eye can adjust within a broad range of parameters. The least forgiving is vertical alignment. The red and blue images should be as closely aligned on the horizontal plane as possible. A very slight misalignment will result in eye-strain and degradation of image quality. More severe misalignment will prevent image resolution.

Horizontal alignment controls the placement of objects within the virtual depth of the scene. The angular deflection of the eyes from parallel is an indication of the virtual distance to the subject. The point at which the red and blue images coincide is perceived to also coincide with the surface of the display media. Portions of the image where the left-eye is to the right of the right-eye image are perceived to be forward of the media plane.

There is much discussion as to the 'proper' placement in virtual space. Analyze the image you are preparing for presentation. Where is the center of attraction. How do you pre-judge the placement of the main subject prior to putting on your glasses. The tendency is to focus on the surface of the medium, whether it's the printed page or a computer screen, to search for initial convergence. Place something on the surface to catch the eye and give 'perspective' to the placement of the rest of the image.

Ambiguous virtual placement results in 'fishing,' or convergence searching. A double image will appear to oscillate horizontally until either the eyes converge on the image or stabilize with a double image. An individual with significant experience with stereographic image manipulation will notice these problems less due to the visual flexibility developed over time. One must put conscious effort into these issues if the images are to be viewed and appreciated by the less proficient. Figure 5 presents the same image with various virtual depth placements. Be aware of any issues you observe as you view these images.

Creating your own Anaglyphs using Adobe® Photoshop™

Anaglyphs can easily be created and manipulated using a variety of graphic tools. The basic technique, using RGB image processing, is this -

Replace the red channel of the left image with the red channel from the right image.
Accomplish registration by moving the red channel until optimum placement is achieved.
Crop image to optimize presentation.

Easy as 1,2,3(4,5). If you'd like step by step instructions, click here to learn how to create your own anaglyphs using Adobe Photoshop, version 7.0. The same technique applies to earlier version of the product but some of the screens may look different.

The Pulfrich Effect -

Named for the German physicist Carl Pulfrich, who discovered this little known ophthalmic phenomenon, the Pulfrich effect is a highly unique and specialized form of 3d imaging. The Pulfrich effect works only with moving images. If the action stops, the image reverts to a flat scene. If action moves the wrong direction, the 3d effect is reversed, giving an inside-out appearance to the scene. Here's how it works.

Equipment - Filtration is required for the Pulfrich effect. A single, dark tinted lens must be placed over one eye. Which eye? It depends. Let's talk first about how it works...

How it works - Our friend Carl discovered that lower levels of illumination require additional time for visual perception to occur. The dimmer the image, the more time required. This delay can be as much as several hundredths of a second.

In practical application, a moving image displays a continuous stream of visual information from a constantly varying perspective. If one eye is covered with a darkening filter, it registers the scene a frame or two later than the uncovered eye. This image will be from a different perspective than the other eye is currently registering, thus potentially containing full 3-dimensional information. To maximize the effect and produce predictable and high quality 3D effects, constantly moving the camera from right to left during filming and viewing with the right eye filtered will produce impressive results. Obviously, the camera must always move the same direction to be effective. If the direction is reversed, the filter must be moved to the other eye. Requesting your audience to switch the filter to the other eye in the middle of a presentation is not considered to be good form. FigureX is a short (yet 4 meg) video clip shot from the window of a moving train to demonstrate the Pulfrich effect. View by covering the right eye with a sun glass lens. FigureY is the return trip. Filter over your left eye this time. If you don't have the bandwidth to withstand the very large video files, this simple java application will demonstrate the effect rather clearly: http://dogfeathers.com/java/pulfrich.html

Other methods - lenticular, LCD shutters

Lenticular images, once popular in storybooks and postcards, are less popular today. The results are definitely discreet three dimensional images. Image quality suffers from the mechanical processing necessary to produce the pictures. Stereo pairs are cut into very narrow strips and mounted behind a fresnel type lens. The strips are arranged in such a way that each eye views only the appropriate strips. The images tend to be a little fuzzy but have their own unique and ethereal charm.

LCD shutters are one of the most exciting developments in 3D imagery in many years. The viewer wears special glasses equipped with LCD lenses. These lenses are alternately darkened in coordination with the refresh rate of the video equipment. A constant stream of alternating left and right video images are fed to the display. The LCD lenses darken to prevent the wrong eye from viewing an inappropriate image and lighten when the appropriate image is displayed.

LCD systems tend to be expensive and proprietary. Development tools are generally aimed at commercial markets. Compatibility between vendors is limited as well.

Creating Stereographic Images for Distribution via the Internet

Pairs

Anaglyphs

¹I asked my friend Ralph. He is an ophthamalogist.