The Video Image
A Primer from Aspect Ratios to
Frame Rates
By Ron Seifried and
Craig Rosenzweig, courtesy B&H Video
Everything
you ever wanted to know about video aspect
ratios, resolution, scanning (interlaced vs
progressive), deinterlace, frame rates
- THE guide for the professional video maker
in just a single web page)
Every image tells a story. The subjects,
colors, and composition of an image contain
a wealth of information. Even if we removed
these attributes, a digital image still
retains vital information necessary for any
video editor. The following article covers
the primary terminology required to
understand the wide range of resolutions,
fields, and frame rates. The learning curve
for High Definition can seem large at first,
but is easily understood after learning
these basic concepts.
Aspect Ratio
There are several different aspect ratios,
but in relevance to video we will cover the
two most important. The image aspect ratio
is width divided by height. The 4:3 ratio
has been around since the beginning of
television in the late 1940's, and is also
commonly used in still photography. The HDTV
standard, however, is 16:9, also known as
widescreen. The "letterbox" format, most
commonly recognized on standard-def TV's as
"black bars" on the top and bottom portions
of the screen, is the result of converting a
widescreen film to a standard-def TV. The
opposite is true when playing standard 4:3
video on a 16:9 HD screen- "black bars"
appear on the left and right sides of the
screen, which is called "pillarboxing.".
Generally, today's HD cameras capture in the
16:9 aspect ratio.
 |
 |
|
The 16:9 image on the left has
approximately 30% more picture than
the 4:3 on the right, which is lost
on the right and left hand side of
the screens |
Resolution
It is important to note that the word
"resolution" has more than one meaning. This
article will cover resolution as it pertains
to television and TV specs, where the
resolution of an image is determined by the
number of pixels or scanned lines within it,
and how they are arranged. In digital video,
this is measured in pixels per inch, and in
analog video it is measured in horizontal
lines per height of screen. For example, in
digital video 720 x 480 is 720 columns of
pixels wide by 480 rows of pixels high. When
naming a particular video format's
resolution, we refer to its vertical number
of rows of pixels high. Therefore, 1920 x
1080 refers to a digital sampling structure
of 1920 horizontally (width) and 1080
vertically (height) and is called "1080."
Pixels (picture elements) can be described
in a variety of formats, including computer
displays and digital cameras. In the world
of standard-def video, pixels are
non-square, or rectangular. Therefore, the
horizontal and vertical lines are not equal,
making images look up to 10% wider than
they're supposed to when displayed on a
computer or HD monitor. With high-def video,
pixels are square, eliminating the
possibility of stretching video or text that
is sometimes inherent in standard-def video.
For a reference, DV video is displayed in
non-square pixels, while images from a
computer are viewed in square pixels. When
converting video for YouTube, for instance,
it's a good idea to de-interlace the video
so it displays square pixels.
 |
 |
When it comes to the two most common HD
resolutions (1920x1080 and 1280x720),
capturing and displaying with square pixels
is not a problem. But for HDV, the
resolution is 1440x1080, necessitating
squeezing the pixels so a skinny picture can
be displayed on an HD monitor. HDCAM also
has a 1440x1080 frame, but outputs to
1920x1080. In the past, post-production
artists had to be especially aware of the
elements they were working with. Today, most
editing and compositing applications make it
easy to mix different media formats in the
same project.
Scanning Method
Interlaced:
A technology that has been around since the
dawn of television, interlaced video scans
all the odd lines of the first field of a
frame, and then scans the even lines of a
field. When putting these two fields
together, or "interlacing" them, you get a
complete frame. Beginning with the upper
left corner of the display and ending at the
bottom right corner, each field contains
half the lines required to complete one
frame. Interlaced video is limited to 525
horizontal lines of standard-def TV's.
One main disadvantage of interlaced video is
that because two different fields are
capturing one moment in time, fast-motion
artifacts will appear on the image when
playing interlace video on a progressive
display monitor. This can somewhat be
corrected through a process called
deinterlacing, which converts interlace
video to a non-interlace progressive format.
The increased vertical resolution to a
progressive frame reduces fast-motion edge
artifacts because the progressive image has
higher resolution. Although shooting fast
movement with interlaced or progressive will
lead to fast motion artifacts, due to "
"shutter drag", interlaced captures are
still better when shooting fast-motion like
documentaries and sporting events. The
interlace picture capture rate gives a story
teller freedom to move the camera; it has a
temporal advantage over a progressive
capture rate.
Progressive:
Progressive video is the method by which all
lines (whole frame) are captured at the same
instant, representing a single moment in
time. The advantage it has over an
interlaced method is that each frame is
scanned in sequence, utilizing twice the
bandwidth and resolution. Even though the
number of resolution lines is the same
between interlaced and progressive, when the
number is followed by a "p" (i.e., 1080p)
more detail - with no artifacting - will be
displayed at any given moment. Because of
the higher bandwidth required for
progressive, component video or HDMI cables
are needed to transfer the signal. It is
also important to note that all progressive
formats are available only in 16:9 aspect
ratio.
Sony's latest
solid-state camera, the
PMW-EX1,
is the first reasonably-priced camera that
supports 1920 x 1080 progressive recording
rather than the 1440 x 1080 interlaced you
find in HDV cameras. Available with a ½-inch
CMOS chip, rather than the traditional 1/3
chip, video is saved on flash memory cards
rather than tape, and is easily switchable
to progressive and interlaced scanning
methods at various capture rates.
 |
|
Chart illustrating the relative
frame dimensions of different video
formats in Progressive and Interlace |
Resolution Breakdown
480i
is full-frame, standard-definition NTSC
video. (NTSC is the analog television system
used in the United States, Canada, Japan,
Mexico, and other countries, originally
adopted by the National Television System
Committee, for whom it is named.)
480p,
a progressive mode scanned at 480 lines, is
found primarily in EDTV sets
(Enhanced-Definition Television).
576i/p
is the PAL television format (used
throughout Europe, parts of Africa, the
Middle East, the Indian Subcontinent, the
Pacific Rim countries, the Pacific Islands
as well as Australia, New Zealand, and
Tasmania), available in both interlaced and
progressive.
720p
is a progressive rate found in HD. The 16:9
widescreen format has a horizontal
resolution of 1280 pixels (1280x720), and is
one of the two most common HD resolutions in
use today. The other is 1080i.
1080i
is a common HD format, interlaced, that
scans 1080 lines vertically for a frame
resolution of 1920x1080. Despite being of a
higher resolution than 720p in spatial
resolution (pixels per frame), the
interlaced scanning introduces artifacts
during fast-moving shots, but shows more
detail in stationary video.
1080p
is a progressive rate with 1080
vertically-scanned lines (1920x1080). Most
commonly found in high-end HDTV's, Blu-Ray,
and HD-DVD formats, 1080p produces a sharper
image than the aforementioned formats, but
is currently not available from
broadcasters.
2048 (2k)
Designed for Digital Cinemas, the 2k
(2048x1080 at 24fps) format is primarily
scanned from 35mm films for distribution to
digital-cinema theaters.
4096 (4k)
The optimal resolution (4096x2160 at 24fps)
available today, 4k has been the hot buzz
phrase in the industry, thanks to the
availability of the RED One Digital Video
Camera.
Ultra High-Definition
Video is in
development, with a proposed resolution of
7,680 × 4,320.
 |
 |
|
The top image roughly demonstrates
what takes place with interlaced
scanning, with the progressive image
below |
De-interlace
What if you want to display an interlaced
image on a progressive-scan monitor? This
situation will occur when you attempt to
edit interlaced video on a computer. If the
video is intended to be played back on a
computer, you can convert or de-interlace
the video using your editing program or
capture device. However, if the final video
will be played on a standard television via
DVD or tape, you need to maintain
interlacing while you edit.
Three current, popular
HD camcorders in the Canon HD series, the
XH-A1,
XH-G1,
and the
XL-H1
use a very intuitive de-interlacer. For the
de-interlacing process, a progressive video
stream is extracted from an interlaced one.
The benefit of the Canon camcorder is that
the de-interlace feature gives you the best
of both worlds - interlaced shooting for
fast motion content like sports and
documentary shooting, and progressive for
higher-resolution dramatic content such as
character-driven movies. Canon's Cinema-mode
24F clocks the CCD at 24Hz, which is 48i,
rather than the traditional 60i. The Canon
derives 24F from 48i, and as we elaborate
below, 48 interlaced is equal to 24
full-frame captures, or 24 frames per
second. Please read on to learn more about
capture rates.
Frame or Field Rate (Capture Rate)
Capture rate is a term used to describe the
number of times per second that a picture is
taken or captured in an imaging system. In a
progressive system, the capture rate is
equal to the frame rate, while in an
interlaced system the capture rate is double
the frame rate because only one field (a
half-resolution image) is acquired at each
interval. It takes two fields to make a
complete, interlaced frame. It is standard
practice to refer to the capture rate of an
image, as well as how it is captured when
describing it, instead of the frame rate -
i.e., 60i (60 captures, 30 frames per
second), 30P (30 captures, 30 frames per
second) and 60P (60 captures, 60 frames per
second).
In theory, television has a frame rate of 30
fps, with interlaced video displaying two
fields (half resolution) per frame, and
progressive capture rate equaling one frame
per displayed image. Film has a frame rate
of 24 fps, and in video, 24p is captured in
the progressive mode. Many videographers are
buying camcorders with 24p for a more
"cinematic" experience. The focus on
improving video has always been to narrow
the gap between video and film quality,
making video more "film-like." With 24-frame
HDTV, Digital HD now has achieved basic
imaging attributes that greatly narrow the
gap between video and film. 24-frame HDTV,
now more than ever, has similar cinematic
qualities to a feature film release you
would see on a theater screen.
|
NTSC Capture Rate
|
How it is captured
|
|
60i |
interlaced |
(60 captures, 30 frames per second) |
|
59.94i |
interlaced |
(59.94 capture, 29.97 fps) |
|
60P |
progressive |
(60 capture, 60 fps). |
|
59.94P |
progressive |
(59.94 capture, 59.94 fps.) |
|
30P |
progressive |
(30 captures, 30 fps) |
|
29.97P |
progressive |
(29.97 captures, 29.97 fps) |
|
24P |
progressive |
(24 captures, 24 fps) |
It is debatable whether or not a "film-like"
image can be achieved with either 24-Frame
HDTV or 60-Field HDTV. Some "progressive"
thinkers maintain that the 60-Field HDTV can
compete with film in terms of the tonal and
color reproduction, exposure latitude, and
picture sharpness of traditional film
production. The progressive scan found in
24p will increase the vertical resolution
and reduce fast-motion edge artifacts,
making 24p frame rate capture more sharp,
but proponents of 60-field capture say that
the picture sharpness is not that essential;
often, filtration is used to decrease
picture sharpness for certain scenes anyway.
60i
This interlaced format has been the standard
frame rate used in TV for years. One of the
technical limitations of a 24-frame capture
is a visual effect called "staccato judder."
Staccato judder is a visible flaw in
24-frame video that occurs when the camera
is panned quickly. The high 60i picture
capture rate gives a storyteller freedom to
move the camera quickly, without worrying
about artifacts.
60p
Although not commonly used in productions
today, this progressive format works with
the 1080p format.
50i
50 interlaced fields per 25 frames. This is
the standard for PAL and SECAM (used in
France, Russia, and certain western African
countries as well as Madagascar) television.
30p
Progressive rate that produces 30 frames per
second
25p
Popular before the 24p video revolution of
the last few years, this progressive format
produces 25 frames per second and comes from
the PAL standard.
24p
Although the 1080/60i has high-resolution
frames, the 60i field rate displays reduced
vertical resolution due to interlacing.
24-frame capture has been around for
decades, and ostensibly allows a greater
creative dynamic and a more subtle picture
portrayal. The 24 fps film-camera capture
rate was developed in the 1920's, coinciding
with the arrival of film sound. 24 fps was
the lowest frame rate that could
satisfactorily reproduce audio on the film's
optical tract. 24 fps is technically a
product of sound; however, it did render
motion picture more accurately than what
filmmakers created during the silent era.
Therefore, it worked. 24 fps has a more
"artistic" appeal than 60i. The benefit of
shooting 1080/24p is that this format has
the resolution, scanning method, frame rate,
and aspect ratio closest to motion picture
film and is commonly used for converting
video into film.
Conclusion
What capture and editing specs you choose
for your productions will have an important
effect on the final look. This article
merely touches upon the surface of what
needs to be understood for capturing and
editing video. In the future, we will expand
on each technical definition for a more
comprehensive understanding of video
resolutions, frame rates, and aspect ratios.