Network+ Guide to
Networks, Chapter 12 Review
Voice and Video over IP
In Chapter 1, you learned that convergence is
the use of one network to simultaneously carry voice, video, and data
communications. For most of the twentieth century, voice and data signals
traveled over separate networks. The PSTN (Public Switched Telephone Network),
based on Alexander Graham Bell’s circuit-switched model, carried telephone
calls and fax transmissions. Packet-switched networks, such as the Internet,
took care of e-mail, Web pages, file transfers, and access to other data
resources. In the latter part of the twentieth century, the two types of
networks began intersecting. However, this intersection is not necessarily
seamless or efficient. In some cases, it requires modems to convert digital
data into analog signals and vice versa. Networks achieve more unified
integration, however, by packetizing voice—that is, digitizing the voice signal
and issuing it as a stream of packets over the network. In the last 15 years,
telecommunications carriers, network service providers, data equipment
manufacturers, and standards organizations have focused on ways to deliver
voice, video, and data over the same networks. These converged networks, as
they are called, may be cheaper and more convenient, but they also require new
technology. This chapter describes a variety of voice and video-over-IP
applications, plus the protocols and infrastructure necessary to deliver them.
Terminology
In discussions of convergence, the use of multiple terms to
refer to the same or similar technologies is common. This is partly a result of
a market that developed rapidly while many different vendors marketed their own
solutions and applied their preferred terminology. The terms used throughout
this chapter are those most frequently cited by standards organizations such as
the ITU and IETF. Before you learn how voice and video-over-IP services work,
it’s useful to understand the meaning of these terms. One important term is IP
telephony, the use of any network (either public or private) to carry voice
signals using the TCP/IP protocol. IP telephony is more commonly known as VoIP
(Voice over IP). VoIP can run over any packet-switched network. Virtually any
type of data connection can carry VoIP signals, including T-carriers, ISDN,
DSL, broadband cable, satellite connections, Wi-Fi, WiMAX, HSPA+, LTE, and
cellular telephone networks. When VoIP relies on the Internet, it is often
called Internet telephony. But not all VoIP calls are carried over the
Internet. In fact, VoIP over private lines is an effective and economical
method of completing calls between two locations within an organization. And
because the line is private, its network congestion can be easily controlled,
which often translates into better sound quality than an Internet telephone
call can provide. But given the Internet’s breadth and low cost, it is
appealing to consider the Internet for carrying conversations that we currently
exchange over the PSTN. Voice is not the only nondata application that can be
carried on a converged network. Other applications include IPTV (IP
television), in which television signals from broadcast or cable networks
travel over packet-switched networks. Videoconferencing, which allows multiple
participants to communicate and collaborate at once through audiovisual means,
is another example of using networks to carry video information. Streaming
video refers to video signals that are compressed and delivered in a continuous
stream. For example, when you choose to watch a television show episode on the
Web, you are requesting a streaming video service. You don’t have to download
the entire episode before you begin to see and hear it. When streaming videos
are supplied via the Web, they may be called Webcasts. One way to distribute
video signals over IP is multicasting. As described in Chapter 4, in
multicasting, one node transmits the same content to every client in a defined
group of nodes, such as a subnet. 
IPTV, videoconferencing, streaming video, and IP multicasting
belong to the range of services known as video over IP. Over time, voice and
video services over packet-switched networks have matured, and as a result more
users rely on them. These users, in turn, have demanded better integration with
traditional data services, such as e-mail and Web browsing. In Chapter 1, you
learned that unified communications (sometimes called unified messaging) is a
service that makes several forms of communication available from a single user
interface. In unified communications, a user can, for example, access the Web,
send and receive faxes, e-mail messages, voice mail messages, instant messages,
or telephone calls, and participate in videoconference calls—all from one
console. This overview of the terms used when discussing converged services
gives you a sense of how many applications fall into this category. Now you are
ready to learn how they work.
VoIP Applications and Interfaces
VoIP (pronounced “voyp”) has existed in various forms for over a decade.
Although organizations were slow to adopt it at first, as networks became
faster, more reliable, and more accessible, use of VoIP increased dramatically.
Significant reasons for implementing VoIP include the following:
Lower
costs for voice calls—In the case of long-distance calling, using VoIP over a
WAN allows an organization to avoid paying long-distance telephone charges, a
benefit known as toll bypass. For example, an organization that already leases
T3s between its offices within a region can use the T3s to carry voice traffic
between colleagues.
Supply
new or enhanced features and applications—VoIP runs over
TCP/IP, an open protocol suite, whereas the PSTN runs over proprietary
protocols. This means developers with enough skill and interest can develop
their own VoIP applications, making the possibilities for new VoIP features and
services endless. It also means that off-the-shelf VoIP applications can be
modified to suit a particular organization’s needs.
Centralize
voice and data network management—When voice and data transmissions use the
same infrastructure, a network manager needs only to design, maintain, and
troubleshoot a single network. Furthermore, on that network, VoIP devices can
provide detailed information about voice transmissions, such as the date, time,
and duration of calls, in addition to their originating number and caller
names. 
Voice and data can be combined on a network in several different
configurations. VoIP callers can use either a traditional telephone, which
sends and receives analog signals, a telephone specially designed for TCP/IP
transmission, or a computer equipped with a microphone, speaker, and VoIP
client software. And on any VoIP network, a mix of these three types of clients
is possible. The following sections explain how analog and digital voice
networks are integrated and describe equipment necessary to accomplish such
integration.
Analog
Telephones
If a VoIP caller
uses a traditional telephone, signals issued by the telephone must be converted
to digital form before being transmitted on a TCP/IP-based network. In fact, even
if the entire VoIP connection is digital, voice signals still need to be
converted from their natural, analog form into bits. This conversion involves
first compressing and encoding analog signals, functions that occur at the
Presentation layer of the OSI model. Any method for accomplishing this
conversion is known as a codec (a word that derives from its function as a
coder/decoder). Detailing the wide variety of voice and video codecs is beyond
the scope of this book. However, to successfully implement converged networks,
you should understand what types of equipment are necessary to accomplish
analog-to-digital conversion. One possibility is to connect an analog telephone
to a VoIP adapter, sometimes called an ATA (analog telephone adapter). The ATA
might be a card within a computer workstation or an externally attached device
that allows for one or more telephone connections. The traditional telephone
line connects to an RJ-11 port on the adapter. The ATA, along with its device
drivers and software on the computer, converts analog voice signals to IP
packets and vice versa. Figure 12-1 shows an ATA that supports two telephone
connections. A second way to achieve this conversion is by connecting an analog
telephone line to a switch, router, or gateway capable of accepting analog
voice signals, converting them into packets, then issuing the packets to a data
network—and vice versa. Like the switches, routers, and gateways you learned
about earlier in this book, VoIP-enabled devices come with a variety of
features, including support for NAT, VPN protocols, encryption, and more.
Figure 12-2 shows a VoIP router that accepts up to four telephone lines. Next
to the bank of eight RJ-11 ports for incoming analog lines are two RJ-45 ports
to connect the router to an Ethernet network. A third example of an
analog-to-digital voice conversion device is a digital PBX or, more commonly,
an IP-PBX.(PBX stands for private branch exchange, which is the term used to
describe a telephone switch that connects and manages calls within a private
organization.) In general, an IP-PBX is a private switch that accepts and
interprets both analog and digital voice signals. Thus, it can connect with
both traditional PSTN lines and data networks. An IP-PBX transmits and receives
IP-based voice signals to and from other network connectivity devices, such as
routers or gateways. Most IP-PBX systems are packaged with sophisticated software
that allows network managers to configure and maintain an organization’s phone
system. For example, the system can be set up to ring a user’s desk phone and
cell phone simultaneously. And because an IP-PBX stores call information
electronically, source and destination numbers, call times, durations, and
voice-mail messages can be accessed via a Web interface. A special type of
IP-PBX is one that exists on the Internet. Instead of installing an IP-PBX on
its WAN, an organization might contract with a service provider for call
management services in a hosted PBX arrangement. (Hosted PBXs may also be
called virtual PBXs, although this term is a trademark of the VirtualPBX
Company.) Organizations that choose a hosted PBX don’t need to install or
maintain hardware or software for call completion and management. In a fourth
scenario, the traditional telephone connects to an analog PBX, which then
connects to a voice-data gateway. In this case, the gateway connects the
traditional telephone circuits with a TCP/IP network (such as the Internet or a
private WAN). The gateway digitizes incoming analog voice signals, compresses
the data, assembles the data into packets, and then issues the packets to the
packet-switched network. When transferring calls from a packet-switched network
to a circuit-switched network (for example, if you call your home telephone
number from your office’s IP telephone), the gateway performs the same
functions in the reverse order.
IP
Telephones
Most new VoIP
installations use IP telephones (or IP phones), which, unlike traditional
phones, transmit and receive only digital signals. When a caller uses an IP
telephone, her voice is immediately digitized and issued from the telephone to
the network in packet form. To communicate on the network, each IP telephone
must have a unique IP address, just as any client connected to the network has
a unique IP address. The IP telephone looks like a traditional touch-tone
phone, but connects to an RJ-45 wall jack, like a computer workstation. Its
connection may then pass through a connectivity device, such as a switch or
router, before reaching the IP-PBX. An IP-PBX may contain its own voice-data
gateway, or it may connect to a separate voice-data gateway, which is then
connected to the network backbone. Figure 12-5 illustrates different ways IP
telephones can connect with a data network. IP telephones act much like
traditional telephones. For example, they feature speed-dialing, call hold,
transfer, and forwarding buttons, conference calling, voice-mail access,
speakers and microphones, and an LCD screen that displays caller ID and call
hold information. IP telephones come in both mobile and wired styles. More
sophisticated IP telephones offer features not available with traditional
telephones. Because IP telephones are essentially network clients, like
workstations, the number and types of customized features that can be
programmed for use with these phones is limitless. For example, IP telephone
screens can act as Web browsers that allow users to complete a call by clicking
on a telephone number. Another benefit of IP telephones is their
mobility. Because IP telephones are addressable over a network, they can be
moved from one office to another office, connected to a wall jack, and be ready
to accept or make calls. Compare this with the traditional method of moving
telephone extensions, which requires reprogramming the extension’s location in
a PBX database. A user would have to wait for the network administrator to
perform this change before her telephone extension would work in a new
location. With IP telephones, however, the user is free to move to any point on
the network without missing a call. One issue that faces IP telephones is the
need for electric current. A conventional analog telephone obtains current from
the local loop. This is necessary for signaling—for example, to make your phone
ring and to provide a dial tone. However, IP telephones are not directly
connected to the local loop. Instead, most obtain electric current from a
separate power supply. This makes IP telephones susceptible to power outages in
a way that analog telephones are not. It also points to the need for assured
backup power sources in organizations that rely on IP telephones. In some VoIP
installations, IP telephones obtain current via their Ethernet connection using
PoE (Power over Ethernet). A typical IP phone is shown in Figure 12-6. Using IP
telephones is not the only way to benefit from a fully digital voice
connection. Instead, an off-the-shelf workstation can be programmed to act like
an IP telephone, as described in the next section.
Softphones
Rather than using traditional telephones or
IP telephones, a third option is to use a computer programmed to act like an IP
telephone, otherwise known as a softphone. Softphones and IP telephones provide
the same calling functions; they simply connect to the network and deliver
services differently. Before it can be used as a softphone, a computer must
meet minimum hardware requirements (which any new workstation purchased at an
electronics store would likely meet), be installed with an IP telephony client,
and communicate with a digital telephone switch. In addition, softphone
computers must have a sound card capable of full-duplex transmission, so that
both the caller and the called party can speak at the same time. Finally, a
softphone also requires a microphone and speakers or a headset. Skype, the
popular Internet telephony software, is one type of softphone. After a user
starts the softphone client software, he is typically presented with a
graphical representation of a telephone dial pad, as shown in Figure 12-7. 
The interface might also present a list of
telephone numbers in the caller’s address book, so that the caller can click on
the number he wants to call. And like IP telephones, the program features
buttons for call forwarding, speed dialing, conferencing, and so on—except that
on a softphone, these buttons are clickable icons. Unlike many traditional
phones, softphones allow the user to customize the graphical interface. For
example, an administrative assistant who spends most of his time calling
clients and vendors on behalf of his supervisor can position a list of
clickable, frequently called numbers in the foreground of his default
interface. One difference between IP telephones and softphones is that a
softphone’s versatile connectivity makes it an optimal VoIP solution for
traveling employees and telecommuters. For example, suppose you are a district
sales manager with a home office and you supervise 32 sales representatives
throughout the Pacific Northwest. Your company uses VoIP, with an IP-PBX
connected to the company headquarters’ LAN. At your home office, you have a
desktop workstation equipped with a sound card, headset, and softphone
software. You also lease a DSL connection to your local carrier, which allows
you to log on to your company’s LAN from home. After logging on to the LAN, you
initiate the softphone client and then log on to the company’s IP-PBX. By logging
on to the IP-PBX, you access your personal call profile and indicate to the
IP-PBX that your calls should be routed to your home computer. However, because
you are a district sales manager, you only spend half of the time working from
home. The other half of the time you travel to visit your sales representatives
across the region. During that time, you use a laptop that, like your home
workstation, is equipped with a sound card, headset, and the softphone client
software. While on the road, you use remote connectivity software to access
your company’s LAN, and then initiate your softphone client. Now your calls are
directed to your laptop computer, rather than your home workstation. No matter
where you are, you can establish a remote telephone extension, if the computer
has the appropriate software and hardware installed. Figure 12-8 depicts the
use of softphones on a converged network. Besides their extreme mobility,
another advantage to softphones is the capability for convenient, localized
call management. Like IP phones, softphone clients can easily track the date,
time, and duration of calls, in addition to their originating number and caller
names. A softphone user can also, for example, export call information to a
billing or accounting program on the same workstation. This feature simplifies
record keeping and billing for professionals—such as lawyers or consulting
engineers—who bill their customers by the hour. Now that you understand the
variety of ways VoIP services may be implemented, you are ready to learn about
the different types of video services that packet-switched networks may carry.
Video-over-IP Applications and
Interfaces
Cisco Systems, the largest supplier of networking hardware in the world,
estimates that by 2015, over two-thirds of the traffic carried by the Internet
will be video traffic, with online video viewers totaling one billion people.
This extraordinary growth is thanks to the large quantity of video content from
providers such as Netflix and Hulu, the increasing volume of devices accessing
the Internet, including smartphones and tablet PCs, plus the decreasing cost of
bandwidth and equipment. Given these networking trends, it’s important to
understand the types of video services that TCP/IP networks can carry and the
hardware and software they rely on. The following sections divide video-over-IP
services into three categories: streaming video, IPTV, and videoconferencing.
However, divisions between these services are not always clear, as you’ll
learn. 
Also bear in mind that no matter what the application or distribution
method, every video-over-IP transmission begins with digitizing the audio and
visual signals using one of several popular video codecs, such as MPEG-4.
Details of video codecs are beyond the scope of this book.
Streaming
Video
You have already
learned that streaming video is a service in which audiovisual signals are
compressed and delivered over the Internet in a continuous stream. If you have
watched a YouTube video on the Internet, you have used streaming video. Because
most networks are
TCP/IP-based, most
streaming video belongs to the category of video over IP. Among all
video-over-IP applications, streaming video is perhaps the simplest. A user
needs only to have a computer with sufficient processing and caching resources,
plus the appropriate audiovisual hardware and software to view encoded video.
On the transmission end, video can be delivered by any computer with sufficient
capabilities to store and send the video. This might be a streaming server dedicated
to the task or any computer that performs video streaming among other tasks.
Streaming video can traverse any type of TCP/IP network, though it often relies
on the Internet. One popular way of providing video streams is to make them
available as stored files (saved in any one of a number of popular video
formats) on a server. The viewer then chooses to watch the video at his
convenience, typically from a Web browser. Upon receiving a request, the server
delivers the video to the viewer. This type of service, in which the video file
remains on the server until it is specifically requested by the user, is known
as video-on-demand. When you choose to watch a news report from your local TV
channel’s Web page, for example, you are making use of video-on-demand. In
another form of streaming video, the video is issued live—that is, directly
from the source to the user as the camera captures it. For example, suppose you
wanted to watch a political debate that’s being broadcast by a TV network. You
could access the network’s Web site and watch the debate as it happens using
live streaming video. One drawback to live streaming is that content cannot be
edited before it’s distributed. Another potential drawback is that viewers must
connect with the stream as it occurs, whereas they can use video-on-demand at
their convenience. In addition, video-on-demand allows viewers to control their
viewing experience, for example, by pausing, rewinding, or fast-forwarding.
Figure 12-9 illustrates video-on-demand and live streaming video services. The
distinction between on-demand video and live streaming video is just the
beginning. You also need to consider the number of clients receiving each
service. For example, an IT manager in one office might use his laptop and its
built-in camera to capture and issue a video of himself explaining a technical
topic to one of his employees in another office. This is an example of
point-to-point video over IP. Or he might issue the video stream to a whole
group of employees in a point-to-multipoint manner. You might recall the terms
unicast and multicast from Chapter 4 and assume that point-to-multipoint
streaming video means multicast transmission. That’s not necessarily the case.
In IP multicasting, a source issues data to a defined group of IP addresses. In
fact, many streaming video services—and nearly all of those issued over a
public network, such as the Internet—are examples of unicast transmissions. In
a unicast transmission, a single node issues a stream of data to one other
node. In a VoIP call, for example, one IP phone addresses another in unicast
fashion. If many Internet users watch CSPAN’s streaming video on the Web
simultaneously, the CSPAN source would issue encoded audiovisual signals to
each viewer via separate unicast transmissions. In the example of an IT manager
sharing a video discussion with his employees in another office, the
transmission might be unicast or multicast, depending on how he configured it.
Finally, streaming video services may also be classified according to the type
of network they use, private or public. Watching YouTube videos or TV episodes
on Hulu are obviously cases of streaming video issued over a public network,
the Internet. Examples of streaming video on private networks include
educational videos delivered over the private networks of schools, businesses,
or other organizations. For example, a guest speaker’s presentation at a
college’s main campus auditorium could be filmed and transmitted via live
streaming to classrooms in the colleges’ satellite campuses, all without ever
leaving the college’s private network. Most, though not all, examples of
streaming video take place over public networks. The following section
describes a video-over-IP service that typically makes use of private networks.
IPTV
(IP Television)
In Chapter 7, you
learned about the networks that telecommunications carriers and cable companies
have established to deliver high-bandwidth Internet connections to their
customers. These networks are now being used to deliver digital television
signals using IPTV. In fact, because of digital video’s value as an added
service, your local telephone company is likely investing significant sums into
the hardware and software that make IPTV possible. Because telecommunications
carriers are leading the way with IPTV installation, this section concentrates
on their network architecture and components but bear in mind that cable
companies are investing in this technology, too. Several elements come together
to deliver digital video to consumers, as shown in Figure 12-10. Each element
and its role are described next. To begin, a telco accepts video content at a
head-end. The content may include signals captured from satellite video feeds,
national or regional broadcasts, or local content, such as live feeds of city
council meetings. Typically, this content arrives in analog format, and at the
head-end, an encoder converts it to digital format. At the telco’s CO (central
office), one or more servers manage customer subscription information, encrypt
video to comply with digital rights regulations, publish channel listing
information, and associate each video input with its own channel, among other
things. Also at the CO, each video channel is assigned to a multicast group. Multicasting
makes sense for delivering IPTV. First, it’s a simple way of managing content
delivery. For example, 2000 of the telco’s customers might choose to watch a
Monday night football game. Rather than supply the video via 2000 separate
unicast transmissions, the carrier can issue one multicast transmission to the
entire group of 2000 subscribers. The second advantage to using multicasting
has to do with local loop capacity. As you know, fiber to the home is still
rare in the United States, and most local loops rely on copper cabling.
Therefore, throughput is limited. In an environment where customers may choose
from hundreds of channels, supplying all the choices to every customer at all
times would overwhelm the local loop’s capacity. Even supplying more than a few
channels would be too much. Instead, the telco transmits only the content a
subscriber has chosen. When an IPTV user changes the channel, therefore, she is
merely opting out of one IP multicast group and opting into another. Recall
from Chapter 4 that multicasting is managed by IGMP (Internet Group Management
Protocol). Therefore, IGMP underlies all IPTV implementations at the Network
layer of the OSI model. However, IGMP can only identify group members. To
ensure efficient content delivery to a multicast group, routers communicate
using a multicast routing protocol. Several multicast routing protocols exist.
Which protocol the network uses is less important than the fact that all Layer
3 devices communicate using the same multicast routing protocol. A compressed,
digital video signal travels over the telco’s network just as a data signal
would. For example, if a subscriber obtains DSL service from the carrier, the
signal would go from the telco’s router to a DSLAM (DSL access multiplexer),
either at the CO or at a remote switching facility, and then to the
subscriber’s DSL modem. If the subscriber obtains service wirelessly—for
example, via WiMAX—the signal would be issued from the carrier’s router to an
antenna on a tower, and then to an antenna and WiMAX connectivity device at the
subscriber’s home. After passing through the DSL modem or home WiMAX device,
the video signal is decoded and issued to a television by a set-top box.  Besides decoding the video signal, set-top
boxes communicate with content servers to manage video delivery. For example,
the set-top box delivers TV program schedules from the content server to
subscribers and sends a subscriber’s channel request to the content server. In
cases where IPTV providers allow pay-per-view or video-on-demand programming,
set-top boxes manage requesting and delivering those services. Set-top boxes
may also allow a user to browse the Internet from his TV. Figure 12-11 shows
one type of set-top box. A significant advantage of delivering video services
over a telecommunications carrier’s or cable company’s network is that those
firms control the connection end to end. This means they can better monitor and
adjust its QoS (quality of service). 
Later in this
chapter, you’ll learn some techniques for controlling the QoS of voice and
video transmissions. The next section describes a third popular video-over-IP
service, videoconferencing.
Videoconferencing
So far in this
chapter, you have learned about unidirectional video-over-IP services—that is,
video delivered to a user who only watches the content, but does not respond
with her own. In most examples of videoconferencing, connections are
full-duplex, and participants may send and receive audiovisual signals. This
allows two or more people in different locations to see and hear each other in
real time. As you can imagine, the cost savings and convenience of such a
service make it especially attractive to organizations with offices, clients,
or consultants scattered across the nation or the globe. Besides replacing
face-to-face business meetings and allowing collaboration, uses for
videoconferencing include the following:
Telemedicine, or
the provision of medical services from a distance—For example, a physician can view and listen
to a patient in another location. Often, the patient is accompanied by a nurse
or physician’s assistant, who might administer tests and supply information
about the patient’s condition. For patients who live far from major medical
facilities, this saves the cost, time, and potential health risks of having to
travel long distances. NASA is developing telemedicine capabilities for
diagnosing patients in space.
Tele-education, or
the exchange of information between one or more participants for the purposes
of training and education—A significant benefit of tele-education is the capability
for one or a few experts to share their knowledge with many students.
Judicial
proceedings, in which judges, lawyers, and defendants can conduct arraignments,
hearings, or even trials while in different locations—This not only saves costs, but may minimize
potential security risks of transporting prisoners.
Surveillance, or
remotely monitoring events happening at one or more distant locations—Unlike previously mentioned videoconferencing
applications, surveillance is typically unidirectional. In other words,
security personnel watch (and perhaps also listen) to live video feeds from
multiple locations around a building or campus, but do not send audiovisual
signals to those locations.
Hardware and
software requirements for videoconferences include, at minimum, a means
for each
participant to generate, send, and receive audiovisual signals. This may be
accomplished by workstations that have sufficient processing resources, plus
cameras, microphones, and videoconferencing software to capture, encode, and
transmit audiovisual signals. 
Instead of a
workstation, viewers may use a video terminal or a video phone, a type of phone
that includes a screen, such as the one shown in Figure 12-12. These devices
can decode compressed video and interpret transport and signaling protocols
necessary for conducting videoconference sessions. When more than two people
participate in a videoconference, for example, in a point-to-multipoint or
multipoint-to-multipoint scenario, a video bridge is required. A video bridge
manages multiple audiovisual sessions so that participants can see and hear
each other. Video bridges may exist as a piece of hardware or as software, in
the form of a conference server. For an organization that only occasionally
uses videoconferencing, Internet-accessible video bridging services can be
leased for a predetermined period. Organizations such as universities that
frequently rely on videoconferencing might maintain their own conference
servers or supply each auditorium, for example, with its own video bridge. 
To establish and
manage videoconferencing sessions, video bridges depend on signaling protocols,
which are described in the following section.
Signaling Protocols
In VoIP and
video-over-IP transmission, signaling is the exchange of information between
the components of a network or system for the purposes of establishing,
monitoring, or releasing connections as well as controlling system operations.
Simply put, signaling protocols set up and manage sessions between clients.
Some functions performed by signaling protocols include the following:
·        
Requesting
a call or videoconference setup
·        
Locating
clients on the network and determining the best routes for calls or video
transmissions to follow
·        
Acknowledging
a request for a call or videoconference setup and setting up the connection
·        
Managing
ringing, dial tone, call waiting, and in some cases, caller ID and other
telephony features
·        
Detecting
and reestablishing dropped calls or video transmissions
·        
Properly
terminating a call or videoconference
On
the circuit-switched portions of the PSTN, a set of standards established by
the ITU known as SS7 (Signaling System 7) typically handles call signaling. You
should be familiar with this term, as it might appear in discussions of
interconnecting the PSTN with networks running VoIP.
In the early days
of VoIP, vendors developed their own, proprietary signaling protocols, which
meant that if you wanted to use the Internet to call your neighbor, you and
your neighbor had to use hardware or software from the same manufacturer. Now,
however, most VoIP and video-over-IP clients and gateways use standardized
signaling protocols. The following sections describe the most common of these.
H.323
On the
circuit-switched portions of the PSTN, a set of standards established by the
ITU known as SS7 (Signaling System 7) typically handles call signaling. You
should be familiar with this term, as it might appear in discussions of
interconnecting the PSTN with networks running VoIP. H.323 is an ITU standard
that describes an architecture and a group of protocols for establishing and
managing multimedia sessions on a packet-switched network. H.323 protocols may
support voice or video-over-IP services. 
Before learning
about H.323 protocols, it’s helpful to understand the set of terms unique to
H.323 that ITU has designated. Elements of VoIP and video-over-IP networks have
special names in H.323 parlance. Following are five key elements identified by
H.323:
H.323 terminal—Any node that provides audio, visual, or data
information to another node. An IP phone, video phone, or a server issuing
streaming video could be considered an H.323 terminal.
H.323 gateway—A device that provides translation between
network devices running H.323 signaling protocols and devices running other
types of signaling protocols (for example, SS7 on the PSTN).
H.323 gatekeeper—The nerve center for networks that adhere to
H.323. Gatekeepers authorize and authenticate terminals and gateways, manage
bandwidth, and oversee call routing, accounting, and billing. Gatekeepers are
optional on H.323 networks.
MCU (multipoint
control unit)—A
computer that provides support for multiple H.323 terminals (for example,
several workstations participating in a videoconference) and manages
communication between them. In videoconferencing, a video bridge serves as an
MCU.
H.323 zone—A collection of H.323 terminals, H.323
gateways, and MCUs that are managed by a single H.323 gatekeeper. Figure 12-13
illustrates an H.323 zone comprising four terminals, one gateway, and one MCU.
Now that you
understand the elements that belong to an H.323 network, you are ready to learn
about the H.225 and H.245 signaling protocols, which are specified in the H.323
standard. Both protocols operate at the Session layer of the OSI model.
However, each performs a different function. H.225 is the H.323 protocol that
handles call or videoconference signaling. For instance, when an IP telephone
user wants to make a call, the IP telephone requests call setup (from the H.323
gateway) via H.225. The same IP telephone would use H.225 protocol to announce
its presence on the network, to request the allocation of additional bandwidth,
and to indicate when it wants to terminate a call. Another H.323 Session layer
protocol, H.245, ensures that the type of information—whethvoice or
video—issued to an H.323 terminal is formatted in a way that the H.323 terminal
can interpret. To perform this task, H.245 first sets up logical channels between
the sending and receiving nodes. On a VoIP or video-over-IP network, these
logical channels are identified as port numbers at each IP address. One logical
channel is assigned to each transmission direction. Thus, for a call between
two IP telephones, H.245 would use two separate control channels. Note that
these channels are distinct from the channels used for H.225 call signing. They
are also different from channels used to exchange the actual voice or video
sign(for example, the words you speak during a conversation or the pictures
transmitted in videoconference). In addition to the H.225 and H.245 signaling
protocols, the H.323 standard also special interoperability with certain
protocols at the Presentation layer, such as those responsible coding and
decoding signals, and at the Transport layer. Later in this chapter, you’ll
learn about the Transport layer protocols used with voice and video services.
ITU codified H.323 as an open protocol for multiservice signaling in 1996.
Early versions the H.323 protocol suffered from slow call setup, due to the
volume of messages exchanged between nodes. Since that time, ITU has revised
and improved H.323 standards several times and H.323 remains a popular
signaling protocol on large voice and video networks. AftH.323 was released,
however, another protocol for VoIP call signaling, SIP, emerged and attracted
the attention of network administrators.
SIP
(Session Initiation Protocol)
SIP (Session
Initiation Protocol) is a protocol that performs functions similar to those
performed by H.323. SIP is an Application layer signaling and control protocol
for multi-service-packet-based networks. The protocol’s developers modeled it
on HTTP. For example, text-based messages that clients exchange to initiate a
VoIP call are formatted like an HTTP request and rely on URL-style addresses.
Developers also aimed to reuse as many existing TCP/IP protocols as possible
for managing sessions and providing enhanced services. Furthermore, they wanted
SIP to be modular and specific. SIP’s capabilities are limited to the following:
·        
Determining
the location of an endpoint, which in SIP terminology refers to any client,
server, or gateway communicating on a network; this means SIP translates the
endpoint’s name into its current network address.
·        
Determining
the availability of an endpoint; if SIP discovers that a client is not
available, it returns a message indicating whether the client was already
connected a call or simply didn’t respond.
·        
Establishing
a session between two endpoints and managing calls by adding (inviting),
dropping, or transferring participants.
·        
Negotiating
features of a call or videoconference when it’s established; for example,
agreeing on the type of encoding both endpoints will employ.
·        
Changing
features of a call or videoconference while it’s connected
SIP’s functions are more limited than those performed by the protocols
in the H.323 group. For example, SIP does not supply some enhanced features,
such as caller ID, that H.323 does. Instead, it depends on other protocols and
services to supply them. As with H.323, a SIP network uses terms and follows a
specific architecture mapped out in the standard. Components of a SIP network
include the following:
User
agent—This is any node that initiates or responds to SIP requests.
User
agent client—These are end-user devices, which may include
workstations, tablet computers, smartphones, or IP telephones. A user agent
client initiates a SIP connection.
User
agent server—This type of server responds to user agent clients’ requests
for session initiation and termination. Practically speaking, a device such as
an IP telephone can act as a user agent client and server, thus allowing it to
directly contact and establish sessions with other clients in a peer-to-peer
fashion. As you have learned, however, peer-to-peer arrangements are
undesirable because they become difficult to manage when more than a few users
participate. User agent clients and user agent servers are considered user
agents.
Registrar
server—This type of server maintains a database containing information about
the locations (network addresses) of each user agent in its domain. When a user
agent joins a SIP network, it transmits its location information to the SIP
registrar server.
Proxy
server—This type of server accepts requests for location information from user
agents, then queries the nearest registrar server on behalf of those user
agents. If the recipient user agent is in the SIP proxy server’s domain, then
that server will also act as a go-between for calls established and terminated
between the requesting user agent and the recipient user agent. If the
recipient user agent is not in the SIP proxy server’s domain, the proxy server
will pass on session information to a SIP redirect server. Proxy servers are optional
on a SIP network.
Redirect
server—This type of server accepts and responds to requests from user agents
and SIP proxy servers for location information on recipients that belong to
external domains. A redirect server does not get involved in establishing or
maintaining sessions. Redirect servers are optional on SIP networks.
Figure 12-14 shows how the elements of a SIP system may be arranged on a
network. In this example, user agents connect to proxy servers, which accept
and forward addressing requests and also make use of redirect servers to learn
about user agents on other domains. For purposes of illustration, the registrar
server, proxy server, and redirect server are shown as separate computers in
Figure 12-14. However, on a SIP network all might be installed on a single
computer. Some VoIP vendors prefer SIP because of its simplicity, which makes
SIP easier to maintain than H.323. And because it requires fewer instructions
to control a call, SIP consumes fewer processing resources than H.323. In some
cases, SIP is more flexible than H.323. For example, it is designed to work
with many types of Transport layer protocols, not just one. One popular system
based on SIP is Asterisk, an open source IP-PBX software package. Companies
that provide telephone equipment, such as 3Com, Avaya, Cisco, and Nortel, also
supply SIP software with their hardware. SIP and H.323 regulate call signaling
and control for VoIP or video-over-IP clients and servers. However, they do not
account for communication between media gateways. This type of communication is
governed by one of two protocols, MGCP or MEGACO, which are discussed in the
following sections.
MGCP
(Media Gateway Control Protocol) and MEGACO (H.248)
You have learned
about gateways in the context of WANs and VPNs. Gateways are also integral to
converged networks. A media gateway accepts PSTN lines, converts the analog
signals into VoIP format, and translates between SS7, the PSTN signaling
protocol suite, and VoIP signaling protocols, such as H.323 or SIP. You have
also learned that information (or “payload,” such as the speech carried by a
VoIP network) uses different channels from and may take different logical or
physical paths than control signals. In fact, to expedite information handling,
the use of separate physical paths is often preferable. The reason for this is
that if media gateways are freed from having to process control signals, they
can dedicate their resources (for example, ports and processors) to encoding,
decoding, and translating data. As a result, they process information faster.
And as you have learned, faster data processing on a converged network is
particularly important, given quality and reliability concerns. However,
gateways still need to exchange and translate signaling and control information
with each other so that voice and video packets are properly routed through the
network. To do so, gateways rely on an intermediate device known as an MGC
(media gateway controller). As its name implies, an MGC is a computer that
manages multiple media gateways. This means that it facilitates the exchange of
call signaling information between these gateways. It also manages and
disseminates information about the paths that voice or video signals take
between gateways. Because it is software that performs call switching
functions, an MGC is sometimes called a Softswitch. For example, suppose a
network has multiple media gateways, all of which accept thousands of
connections from both the PSTN and from private TCP/IP WAN and LAN links. When
a media gateway receives a call, rather than attempting to determine how to
handle the call, the gateway simply contacts the media gateway controller with
a message that essentially says, “I received a signal. You figure out what to
do with it next.” The media gateway controller then determines which of the
network’s media gateways should translate the information carried by the
signal. It also figures out which physical media the call should be routed over,
according to what signaling protocols the call must be managed, and to what
devices the call should be directed. 
After the media
gateway controller has processed this information, it instructs the appropriate
media gateways how to handle the call. The media gateways simply follow orders
from the media gateway controller. MGCs are especially advantageous on large
VoIP networks—for example, at a telecommunications carrier’s CO. In such an
environment, they make a group of media gateways appear to the outside world as
one large gateway. This centralizes call control functions, which can simplify
network management. Figure 12-15 illustrates this model. (Note that in this
figure, as on most large networks, the media gateways supply access services.)
MGCs communicate with media gateways according to one of several protocols. The
older protocol is MGCP (Media Gateway Control Protocol). MGCP is commonly used
on multiservice networks that support a number of media gateways. It can
operate in conjunction with H.323 or SIP call signaling and control protocols.
A newer gateway control protocol is MEGACO. MEGACO performs the same functions
as MGCP, but using different commands and processes. Like MGCP, MEGACO can
operate with H.323 or SIP. Many network engineers consider MEGACO superior to
MGCP because it supports a broader range of network technologies, including
ATM. MEGACO was developed by cooperative efforts of the ITU and IETF, and the
ITU has codified the MEGACO protocol in its H.248 standard.
Bear in mind that this
chapter describes only some of the signaling protocols used on converged
networks. In fact, some softphones, VoIP servers, and videoconferencing
software packages (for example, Skype) use proprietary protocols, which means
that these devices or applications will only work with other devices or
applications that use the same proprietary protocols.
Now that you are
familiar with the most popular session control protocols used on converged
networks, you are ready to learn about the transport protocols that work in
tandem with those session control protocols.
Transport Protocols
The protocols you
just learned about only communicate information about a voice or video session.
At the Transport layer, a different set of protocols is used to actually
deliver the voice or video payload—for example, the bits of encoded voice that
together make up words spoken into an IP telephone. Recall that on a TCP/IP
network, the UDP and TCP protocols operate at the Transport layer of the OSI
model. TCP is connection oriented and, therefore, provides some measure of
delivery guarantees. UDP, on the other hand, is connectionless, and does not
pay attention to the order in which packets arrive or how quickly they arrive.
Despite this lack of accountability, UDP is preferred over TCP for real-time
applications such as telephone conversations and videoconferences because it
requires less overhead and, as a result, can transport packets more quickly. In
transporting voice and video signals, TCP’s slower delivery of packets is intolerable.
However UDP’s occasional loss of packets is tolerable—that is, as long as
additional protocols are used in conjunction with UDP to make up for its
faults. 
RTP
(Real-time Transport Protocol)
One protocol that
helps voice and video networks overcome UDP’s shortcomings is the RTP (Real-time
Transport Protocol). RTP operates at the Application layer of the OSI model (despite
its name) and relies on UDP at the Transport layer. It applies sequence numbers
to indicate the order in which packets should be assembled at their
destination. Sequence numbers also help to indicate whether packets were lost
during transmission. In addition, RTP assigns each packet a time stamp that
corresponds to when the data in the packet were sampled from the voice or video
stream. 
This time stamp
helps the receiving node to compensate for network delay and to synchronize the
signals it receives. RTP alone does not, however, provide any mechanisms to
detect whether or not it’s successful. For that, it relies on a companion
protocol, RTCP.
RTCP
(Real-time Transport Control Protocol)
RTCP (Real-time
Transport Control Protocol or RTP Control Protocol) provides feedback on the
quality of a call or videoconference to its participants. RTCP packets are
transmitted periodically to all session endpoints. RTCP allows for several
types of messages. For example, each sender issues information about its
transmissions’ NTP (Network Time Protocol) time stamps, RTP time stamps, number
of packets, and number of bytes. Recipients of RTP data use RTCP to issue
information about the number and percentage of packets lost and delay suffered
between the sender and receiver. RTCP also maintains identifying information
for RTP sources. The value of RTCP lies in what clients and their applications
do with the information that RTCP supplies. For example, if a call
participant’s software uses RTCP to report that an excessive number of packets
are being delayed during transmission, the sender’s software can adjust the rate
at which it issues RTP packets. RTCP is not mandatory on networks that use RTP.
In fact, on large networks running high bandwidth services, such as IPTV, RTCP
might not be able to supply useful feedback in a timely manner. Some network
administrators prefer not to use it. It’s important to realize that although
RTP and RTCP can provide information about packet order, loss, and delay, they
cannot do anything to correct transmission flaws. Attempts to correct these
flaws, and thus improve the quality of a voice or video signal, are handled by
QoS protocols, which are discussed next.
QoS (Quality of Service) Assurance
Despite all the
advantages to using VoIP and video over IP, it is more difficult to transmit
these types of signals over a packet-switched network than it is to transmit
data signals.
First, more so than
data transmissions, voice and video can easily be distorted by a connection’s
inconsistent QoS. When you talk with your friend, you need to hear his
syllables in the order in which he uttered them, and preferably, without delay.
When you watch a movie over the Web, you want to see the scenes sequentially
and without interruption. In general, to prevent delays, disorder, and
distortion, a voice or video connection requires more dedicated bandwidth than
a data connection. In addition, it requires the use of techniques that ensure
high QoS. QoS is a measure of how well a network service matches its expected
performance. From the point of view of a person using VoIP or video over IP,
high QoS translates into an uninterrupted, accurate, and faithful reproduction
of audio or visual input. Low, or poor, QoS is often cited as a key
disadvantage to using VoIP or video over IP.  But although early attempts at converged
services sounded and looked dreadful, thanks to technology improvements, these
services now achieve quality comparable to the PSTN (in the case of VoIP) and
cable television (in the case of video over IP). Network engineers have
developed several techniques to overcome the QoS challenges inherent in
delivering voice and video over IP. The following sections describe three of
these techniques, all of which are standardized by IETF.
RSVP
(Resource Reservation Protocol)
RSVP (Resource
Reservation Protocol), specified in RFC 2205, is a Transport layer protocol
that attempts to reserve a specific amount of network resources for a
transmission before the transmission occurs. In other words, assuming it is
successful; RSVP will create a path between the sender and receiver that
provides sufficient bandwidth for the signal to arrive without suffering delay.
You can think of RSVP as a technique that addresses the QoS problem by
emulating a circuit-switched connection. 
To establish the
path, the sending node issues a PATH statement via RSVP to the receiving node.
This PATH message indicates the amount of bandwidth the sending node requires
for its transmission, as well as the level of service it expects. RSVP allows
for two service types: guaranteed service and controlled-load
service. Guaranteed service assures that the transmission will not
suffer packet losses and that it will experience minimal delay. Controlled-load
service provides the type of QoS a transmission would experience if the network
carried little traffic. Each router that the PATH message traverses marks the
transmission’s path by noting which router the PATH message came from. This
process continues until the PATH message reaches its destination. But the
reservation is not yet complete. After the destination node receives the PATH
message, it responds with a Reservation Request (RESV) message. The RESV
message follows the same path taken by the PATH message, but in reverse. It
reiterates information about bandwidth requirements that the sending node
transmitted in its PATH message. It also includes information about the type of
service the sending node requested. Upon receiving the RESV message, each
router between the destination node and the sender allocates the requested
bandwidth to the message’s path. This assumes that each router is capable of
interpreting RSVP messages and also has sufficient bandwidth to allocate to the
transmission. If routers do not have sufficient bandwidth to allocate, they
reject the reservation request. After each router in the established path has
agreed to allocate the specified amount of bandwidth to the transmission, the
sending node transmits its data. It’s important to note that RSVP messaging is
separate from the data transmission. In other words, RSVP does not modify the
packets that carry voice or video signals. Another characteristic about RSVP is
that it can only specify and manage unidirectional transmission. Therefore, for
two users to participate in a VoIP call or a videoconference, the resource
reservation process must take place in both directions. Because it emulates a
circuit-switched path, RSVP provides excellent QoS. However, one drawback to
RSVP is its high overhead. It requires a series of message exchanges before
data transmission can occur. Thus, RSVP consumes more network resources than
some other QoS techniques. Although RSVP might be acceptable on small networks,
it is less popular on large, heavily trafficked networks. Instead, these
networks use more streamlined QoS techniques, such as DiffServ.
DiffServ
(Differentiated Service)
DiffServ
(Differentiated Service) is a simple technique that addresses QoS issues by
prioritizing traffic. It differs significantly from RSVP in that it modifies
the actual IP datagrams that contain payload data. Also, it takes into account
all types of network traffic, not just the time-sensitive services such as
voice and video. That way, it can assign voice streams a high priority and at
the same time assign unessential data streams (for example, an employee surfing
the Internet on his lunch hour) a low priority. This technique offers more
protection for the time sensitive voice and video services. To prioritize
traffic, DiffServ places information in the DiffServ field in an IPv4 datagram.
(For a review of the fields in an IP datagram, refer to Chapter 4.) In IPv6
datagrams, DiffServ uses a similar field known as the Traffic Class field. This
information indicates to the network routers how the data stream should be
forwarded. DiffServ defines two types of forwarding: EF (Expedited Forwarding) or
AF (Assured Forwarding). In EF, a data stream is assigned a minimum departure
rate from a given node. This technique circumvents delays that slow normal data
from reaching its destination on time and in sequence. In AF, different levels
of router resources can be assigned to data streams. AF prioritizes data
handling, but provides no guarantee that on a busy network, packets will arrive
on time and in sequence. This description of DiffServ’s prioritization
mechanisms is oversimplified, but a deeper discussion is beyond the scope of
this book. Because of its simplicity and relatively low overhead, DiffServ is
better suited to large, heavily trafficked networks than RSVP.
MPLS
(Multiprotocol Label Switching)
Another QoS
technique that modifies data streams at the Network layer is MPLS
(multiprotocol label switching). As described in Chapter 5, to indicate where
data should be forwarded, MPLS replaces the IP datagram header with a label at
the first router a data stream encounters. The MPLS label contains information
about where the router should forward the packet next. Each router in the data
stream’s path revises the label to indicate the data’s next hop. In this
manner, routers on a network can take into consideration network congestion,
QoS indicators assigned to the packets, plus other criteria. MPLS forwarding is
also fast. This is because, in MPLS, a router knows precisely where to forward
a packet. On a typical packet-switched network, routers compare the destination
IP address to their routing tables, and forward data to the node with the
closest matching address. With MPLS, data streams are more likely to arrive
without delay. On a network supplying clients with voice and video services,
fast transmission is desirable. A network’s connectivity devices and clients
must support the same set of protocols to achieve their QoS benefits. However,
networks can—and often do—combine multiple QoS techniques.
Chapter Summary
■ The use of a
network (either public or private) to carry voice signals using the TCP/IP
protocol is commonly known as VoIP (Voice over IP). VoIP services can operate
over any type of transmission medium and access method that support TCP/IP.
■ When VoIP relies
on the Internet, it is often called Internet telephony. But not all
VoIP calls are
carried over the Internet. In fact, VoIP over private lines is an effective and
economical method of completing calls between two locations within an
organization.
■ An organization
might use VoIP to save money on telephone calls, centralize management of voice
and data services, or take advantage of customizable call features.
■ Many types of
clients and network designs are available with VoIP networks. Clients can be
traditional analog telephones, IP telephones, or softphones (a computer running
telephony software and connected to a microphone and headphones). In each case,
analog voice signals are first converted to digital signals by a voice codec
(coder/ decoder).
■ Analog VoIP
clients may connect to IP networks in one of four ways: using an internal or
external ATA (analog telephone adapter); connecting directly to a router or to
a voice-data gateway that digitizes call information; connecting directly to an
IP-PBX capable of handling both analog and digital voice connections; or connecting
to an analog PBX, which then connects to a voice-data gateway.
■ Digital VoIP
clients typically connect to a digital PBX or other connectivity device with
VoIP capabilities.
■ A special type of
digital PBX is one that is hosted by a service provider. Organizations that
choose a hosted PBX don’t need to install or maintain hardware or software for call
completion and management. Hosted PBXs are also known as virtual PBXs.
■ Rather than
analog telephones, many VoIP installations use IP telephones, which transmit
and receive only digital signals. To communicate on the network, each IP
telephone must have a unique IP address. The IP telephone connects to an RJ-45 wall
jack, like a computer workstation.
■ One significant
benefit to using IP telephones is their mobility. Because IP telephones are
addressable over a network, they can be moved from one office to another
office, be connected to a wall jack, and be ready to accept or make calls.
■ Rather than using
traditional telephones or IP telephones, a third option is to use a computer
programmed to act like an IP telephone, otherwise known as a softphone. Softphones
and IP telephones provide the same calling functions; they simply connect to
the network and deliver services differently. A softphone’s versatile
connectivity makes it an optimal VoIP solution for traveling employees and
telecommuters.
■ Streaming video
refers to video signals that are compressed and delivered in a continuous
stream. It may be made available as files stored on a streaming server for video-on-demand
or as real-time feeds in live streaming. Streaming video may be delivered in a
point-to-point or point-to-multipoint fashion, though both types typically use
unicast transmission.
■ In IPTV (IP
television), television signals from broadcast or cable networks travel over packet-switched
connections. Telecommunications carriers and cable companies supply IPTV over
their existing networks. They accept video content from satellite feeds,
national or regional broadcasters, and local sources at a head-end. IPTV channels
are delivered in multicast fashion to consumers, where they are decoded and
issued to televisions by set-top boxes.
■ Videoconferencing
allows multiple participants to communicate and collaborate at once through
audiovisual means. To view a videoconference, each participant must have at
least a video terminal or video phone that can decode compressed video and interpret
signaling protocols. To send and receive audiovisual signals, participants must
also have a device equipped with a camera, microphone, and video-encoding capabilities.
■ Videoconferences
that are point to multipoint or multipoint to multipoint rely on video bridges
to manage communication among participants. Video bridges may be hardware
devices dedicated to this task or software running on conference servers.
■ In VoIP and
video-over-IP transmission, signaling is the exchange of information
between the
components of a network or system for the purposes of establishing,
monitoring, or
releasing connections as well as controlling system operations.
■ Voice and
video-over-IP services depend on signaling protocols to request a call or
videoconference
setup, locate clients on the network and determine the best routes
for calls to
follow, and acknowledge a request for a call or videoconference setup.
Voice and
video-over-IP services also depend on signaling protocols to set up the
connection; manage
ringing, dial tone, and other telephony features; detect and
reestablish dropped
calls or video transmissions; and properly terminate a call or
videoconference.
■ H.323 is an ITU
standard that describes an architecture and a group of protocols for
establishing and
managing multimedia sessions on a packet-switched network.
■ H.225 is the
protocol specified by the H.323 standard that handles call or
videoconference
signaling. For instance, when an IP telephone user wants to make a
call, the IP
telephone requests a call setup (from the H.323 gateway) via H.225.
■ Another H.323
Session layer protocol, H.245, ensures that the type of information—
whether voice or
video—issued to an H.323 terminal is formatted in a way that the
H.323 terminal can
interpret.
■ SIP (Session
Initiation Protocol) is an Application layer signaling and control protocol
for multiservice,
packet-based networks. SIP’s developers modeled it on the HTTP
protocol and aimed
to reuse as many existing TCP/IP protocols as possible for
managing sessions
and providing enhanced services.
■ SIP does not
attempt to perform and control as many functions as the H.323
protocols. Its
capabilities are limited to determining the location of an endpoint;
determining the
availability of an endpoint; establishing a session between two
endpoints; managing
calls by adding (inviting), dropping, or transferring participants;
negotiating
features of a call or videoconference when it’s established; and changing
features of a call
or videoconference while it’s connected.
■ Some VoIP vendors
prefer SIP because of its simplicity, which makes SIP easier to
maintain than
H.323. And because it requires fewer instructions to control a call,
SIP consumes fewer
processing resources than H.323.
■ Media gateways
rely on an intermediate device known as an MGC (media gateway
controller) to
exchange and translate signaling and control information with each
other. An MGC
facilitates the exchange of call signaling information between these
gateways and
manages and disseminates information about the paths that voice or
video signals take
between gateways.
■ MGCs communicate
with media gateways according to one of several protocols.
The older protocol
is MGCP (Media Gateway Control Protocol). MEGACO performs
the same functions
as MGCP, but uses different commands and processes. Many
network engineers
consider MEGACO superior to MGCP because it supports a
broader range of
network technologies, including ATM. The ITU has codified the
MEGACO protocol in
its H.248 standard.
■ RTP (Real-time
Transport Protocol) operates at the Application layer of the OSI model
and relies on UDP
at the Transport layer. It applies sequence numbers to indicate the
order in which
packets should be assembled at their destination and assigns each packet
a time stamp that
corresponds to when the data in the packet were sampled from the
voice or video
stream. This time stamp helps the receiving node to compensate for
network delay and
to synchronize the signals it receives.
■ RTCP (Real-time
Transport Control Protocol) provides feedback on the quality of a
call or
videoconference, such as the extent of delay or packet loss in a transmission.
■ Network engineers
have developed several techniques to overcome the QoS challenges
inherent in
delivering voice and video over IP. One, RSVP (Resource Reservation
Protocol), is a
Transport layer protocol that attempts to reserve a specific amount of
network resources
for a transmission before the transmission occurs.
■ DiffServ
(Differentiated Service) is a simple technique that addresses QoS issues by
prioritizing
traffic. DiffServ places information in the DiffServ field in an IPv4
datagram. In IPv6
datagrams, DiffServ uses a similar field known as the Traffic Class
field. This
information indicates to the network routers how the data stream should
be forwarded.
■ Another QoS
technique that modifies data streams at the Network layer is MPLS
(multiprotocol
label switching). To indicate where data should be forwarded, MPLS
replaces the IP
datagram header with a label at the first router a data stream
encounters. The
MPLS label contains information about where the router should
forward the packet
next. Each router in the data stream’s path revises the label to
indicate the data’s
next hop. In this manner, routers on a network can take into
consideration
network congestion, QoS indicators assigned to the packets, plus other
criteria.
Review Questions
1.   You have decided
to establish a VoIP system
in
your home. Which of the following devices is necessary to connect your analog telephone to your
VoIP server?
a.   Codec
b.   IP-PBX
c.   Softphone
d.  ATA
2.   Skype, the popular Internet
telephony service, provides
a user with what type of
interface?
a.   IP phone
b.   Analog telephone
c.   Softphone
d.   IP-PBX
3.   A company’s
use of VoIP on its WAN to avoid long distance telephone charges
is known as:
a.   Toll bypass
b.   WAN redirect
c.   Fee gauging
d.   Circuit redirect
4.   Which
of the following is
the most popular signaling protocol
used on traditional, circuit-switched
PSTN
connections?
a.   SIP
b.  SS7
c.   H.323
d.   MEGACO
5.   Watching
a YouTube video on the Web is an example of which
of the following types of video-over-IP services?
a.   Videoconferencing
b.  Streaming
video
c.   IP multicasting
d.   IPTV
6.   In an IPTV system,
which of the following functions does
a set
top box perform?
a.   Decodes
video
signals and issues them
to a television
b.   Determines
the appropriate
amount of bandwidth necessary to deliver a requested video and adjusts the connection accordingly
c.   Interprets multicast
routing protocols to determine the most efficient means of distributing video
signals
d.   Generates
video content based
on a subscriber’s channel selection
7.   What type of video-over-IP service relies on full-duplex communication?
a.   Webcasting
b.   Streaming video
c.   Videoconferencing
d.   IPTV
8.   What protocol
manages
addressing for
multicast groups?
a.   IGMP
b.  MGCP
c.   MEGACO
d.  H.245
9.   Which
of the following protocols would be used by a video bridge to
invite a video
phone to join a videoconference?
a.   MGCP 
b.  H.225
c.   IGMP
d.  RSVP
10. Suppose your organization’s
PSTN
and VoIP systems are integrated,
and that your
VoIP system adheres to architecture specified
in H.323.
Which of the following performs translation between the PSTN’s signaling protocols and H.323
on your network?
a.   H.323 terminal
b.  H.323 gatekeeper
c.   H.323 gateway
d.  H.323 zone
11. You are using Skype to
initiate a video
call with a friend in another state. Which of the following protocols is
generating segments at
the
Transport layer of this transmission?
a.   ICMP
b.   TCP 
c.   FTP
d.  UDP
12.
What
function does the H.225 protocol
provide, as part of
the H.323 VoIP
specification?
a.
Handles call setup, call routing, and call termination
b. 
Controls communication between media gateways and media gateway
controllers 
c.   Ensures that signals issued to an H.323
terminal are in a format that the terminal can interpret
d. Indicates priority of each IP datagram
13. In SIP,
which of the following network
elements maintains a database with network address information
for every SIP client?
a.  
Redirect server
b. 
Registrar server
c.  
Domain server 
d.  Proxy
server
14.
Which of the following are reasons
for choosing SIP over H.323? (Choose
two.) 
a.   SIP is an older,
more
reliable standard.
b.   SIP has
limited
functionality, which makes
it more flexible.
c.   SIP
messages
use
fewer processing
resources.
d.   SIP includes
QoS mechanisms
that make it more dependable.
e.   SIP supports a wider range of voice and video codecs.
15.
Which of the following devices enable
multiple media gateways
to communicate?
a.   VoIP router 
b.  IP-PBX
c.   MGC
d.  IP phone
16.
At what layer
of the OSI model does
RTP operate?
a.   Transport
b.  Presentation 
c.   Session
d.  Application
17.
What
can RTCP do that RTP cannot?
a.   Issue timestamps for
every transmission
b.  Assign
sequence numbers to each
packet in
a transmission
c.   Report
on the degree of packet
loss and delay in a
connection
d.  Modify each IP datagram to
assign a priority level
18.
How does RSVP help improve QoS?
a.   It assigns a label to
each IP datagram that will
be read
and
modified by every
router in the data’s path.
b.  It
continually assesses
the
status of likely routes in the transmission’s
path and dynamically modifies IP datagrams
as they’re issued
with instructions for following the best
path.
c.   It modifies the
Priority field
in each IP datagram
so that high-bandwidth applications are
given precedence over low-bandwidth
applications.
d.  It establishes
a path between the sender and receiver that is guaranteed to supply sufficient
bandwidth for the transmission.
19. The Traffic
Class
field in an IPv6
datagram
serves the same function
as which of the following fields
in an IPv4
datagram?
a.   TTL
b.  DiffServ
c.   RSVP
d.   Padding
20. On a VoIP network
that uses the DiffServ QoS
technique,
which of the following makes
certain that a router
forwards packets
within
a given
time period?
a.   Assured Forwarding 
b.   Superior Forwarding
c.   Expedited Forwarding
d.   Best-effort Forwarding
Practice Test
1. VoIP can run over any packet-switched network.
a.      
True
b.     
False
2. NASA is developing telemedicine capabilities for diagnosing
patients in space.
a.      
True
b.     
False
3. A(n) ____ emulate and interpret conventional fax signaling
protocols when communicating with a conventional fax machine.
fax gateway
4. In the case of long-distance calling, using VoIP over a WAN
allows an organization to avoid paying long-distance telephone charges, a
benefit known as ____.
toll bypass
5.  ____ is the protocol
specified by the H.323 standard that handles call or videoconference signaling.
a.       H.225
b.      H.245
c.       H.248
d.      H.252
6. Because of its simplicity and relatively low overhead, RSVP is
better suited to large, heavily trafficked networks than DiffServ.
a.       True
b.      False
7.  A(n) ____ is a private
switch that accepts and interprets both analog and digital voice signals.
a.       H.323
terminal
b.      H.323
gateway
c.       IP-PBX
d.      registrar
server
8. To prioritize traffic, DiffServ places information in the
DiffServ field in a(n) ____.
a.       toll
bypass
b.      IPv4 datagram
c.       set
top box
d.      redirect
server
9. In ____ Forwarding, a data stream is assigned a minimum
departure rate from a given node.
expedited
10. When VoIP relies on the Internet, it is often called ____.
Internet telephony
11. FoIP (Fax over IP) uses ____ networks to transmit faxes from
one node on the network to another.
a.       bridged
b.      multipoint
c.       circuit
switched
d.      packet-switched
12. Because IP telephones are addressable over a network, they can
be moved from one office to another office, connected to a wall jack, and be
ready to accept or make calls.
a.       True
b.      False
13. SIP does not attempt to perform and control as many functions
as the H.323 protocols.
a.       True
b.      False
14. IPTV, videoconferencing, streaming video, and IP multicasting
belong to the range of services known as ____.
a.       Fax
over IP
b.      voice
over DSL
c.       video over IP
d.      Webcasts
15.    ____ is the exchange
of information between one or more participants for the purposes of training
and education.
Tele-education
16. Mobility is a benefit of IP telephones.
a.       True
b.      False
17.   ____ is a simple
technique that addresses QoS issues by prioritizing traffic.
a.       RTCP
(Real-time Transport Control Protocol)
b.      RSVP
(Resource Reservation Protocol)
c.       MPLS
(multiprotocol label switching)
d.      DiffServ (Differentiated Service)
18. In videoconferencing, a video bridge serves as a(n) ____.
a.       gateway
b.      terminal
c.       H.323 zone
d.      MCU
19. Videoconferencing is a VoIP application.
a.       True
b.      False
20. The term telephony refers to video signals that are compressed
and delivered in a continuous stream.
a.       True
b.      False
21. Gatekeepers are optional on H.323 networks.
a.       True
b.      False
22. A(n) ____ manages multiple audiovisual sessions so that
participants can see and hear each other.
a.       proxy
server
b.      set
top box
c.       video-bridge
d.      softswitch
Chapter Test
1. Using
VoIP over a WAN allows an organization to avoid paying long-distance telephone
charges, a benefit known as ____.
a.       charge bypass
b.      easypass
c.       toll bypass
d.      distance bypass
2. SIP
and H.323 account for communication between media gateways.
a.       True
b.      False
3.  ____ signaling functions are more limited than
those performed by the protocols in the H.323 group.
a.       MEGACO
b.      MGC
c.       SIP
d.      RTCP
4. ____
is the use of one network to simultaneously carry voice, video, and data
communications.
a.       Convergence
b.      Divergence
c.       Multicasting
d.      Unicasting
5. SIP
and H.323 regulate ____ for VoIP or video-over-IP clients and servers.
a.       control only
b.      call signaling only
c.       call signaling and
control
d.      communication between media gateways
6.  When streaming videos are supplied via the
Web, they are often called ____________________.
Webcasts
7.  ____ describes the use of any network to carry
voice signals using the TCP/IP protocol.
a.       Internet telephony
b.      Voice telephony
c.       Telephony
d.      IP telephony
8. When
more than two people participate in a videoconference, for example, in a
point-to-multipoint or multipoint-to-multipoint scenario, a video ____ is
required.
a.       switch
b.      gateway
c.       bridge
d.      router
 9. In general, a(n) ____ is a private switch
that accepts and interprets both analog and digital voice signals.
a.       IT-PBX
b.      Data PBX
c.       IP-PBX
d.      analog PBX
10. It
is more difficult to transmit VoIP and video over IP signals over a
packet-switched network than it is to transmit data signals.
a.       True
b.      False
 11. The popular Internet telephony software,
Skype, is a type of ____.
            a. 
IP telephone
            b.  teleapplication
            c.  compu-phone
            d. 
softphone
12. When
a caller uses an IP telephone, his or her voice is immediately digitized and
issued from the telephone to the network in ____ form.
            a.   
frame
            b.   
segment
            c.   
packet
            d.   circuit
13. An
off-the-shelf workstation can be programmed to act like an IP telephone.
a.       True
b.      False
14. IPTV,
videoconferencing, streaming video, and IP multicasting belong to the range of
services known as ____.
            a.    voice over IP
            b.    data over IP
            c.    video over IP
            d.    Web
over IP
15. Many
streaming video services - and nearly all of those issued over a public
network, such as the Internet - are examples of ____ transmissions.
            a.     broadcast
            b.     telecast
            c.     unicast
            d.     multicast
16. A
computer programmed to act like an IP telephone is known as a(n) ____.
            a.   
video phone
            b.   
softphone
            c.   
compu-phone
            d.   
streaming server
17.    ____________________ is a simple technique
that addresses QoS issues by prioritizing traffic.
DiffServ
 18.   ____________________ performs the same
functions as MGCP, but using different commands and processes.
MEGACO
 19. Many network engineers consider ____ to be
superior to MGCP.
            a.     MGC
            b.     RTCP
            c.     SIP
            d.     MEGACO
20.  When using an analog telephone, a VoIP adapter
that performs analog-to-digital conversion is known as a(n) ____.
            a.     DTA (digital telephone adapter)
            b.     ATA (analog telephone adapter)
            c.     DTA (data telephone adapter)
            d.     VTA (voice telephone adapter)
21.  ____ is a measure of how well a network
service matches its expected performance.
            a.     RSVP
            b.     DiffServ
            c.     QoS
            d.     MPLS
22.  ____________________ is a QoS technique that
replaces the IP datagram header with a label at the first router a data stream
encounters.
MPLS
 23. One drawback to ____ video is that content
may not be edited before it’s distributed.
            a.     streaming server
            b.     live streaming
            c.     on demand
            d.     VoIP
24.  IP telephones are directly connected to the
local loop.
a.       True
b.      False
25. When
VoIP relies on the Internet, it is often called ____.
            a.     telephony
            b.     IP telephony
            c.     voice telephony
            d.     Internet telephony
