The
Digital Signal
Processing
Handbook
SECOND EDITION
Video, Speech, and
Audio Signal Processing
and Associated Standards
EDITOR-IN-CHIEF
Vijay K. Madisetti
Boca Raton London New York
CRC Press is an imprint of the
Taylor & Francis Group, an informa business
The Electrical Engineering Handbook Series
Series Editor
Richard C. Dorf
University of California, Davis
Titles Included in the Series
The Handbook of Ad Hoc Wireless Networks, Mohammad Ilyas
The Avionics Handbook, Second Edition, Cary R. Spitzer
The Biomedical Engineering Handbook, Third Edition, Joseph D. Bronzino
The Circuits and Filters Handbook, Second Edition, Wai-Kai Chen
The Communications Handbook, Second Edition, Jerry Gibson
The Computer Engineering Handbook, Vojin G. Oklobdzija
The Control Handbook, William S. Levine
The CRC Handbook of Engineering Tables, Richard C. Dorf
The Digital Avionics Handbook, Second Edition Cary R. Spitzer
The Digital Signal Processing Handbook, Second Edition, Vijay K. Madisetti
The Electrical Engineering Handbook, Second Edition, Richard C. Dorf
The Electric Power Engineering Handbook, Second Edition, Leonard L. Grigsby
The Electronics Handbook, Second Edition, Jerry C. Whitaker
The Engineering Handbook, Third Edition, Richard C. Dorf
The Handbook of Formulas and Tables for Signal Processing, Alexander D. Poularikas
The Handbook of Nanoscience, Engineering, and Technology, Second Edition
William A. Goddard, III, Donald W. Brenner, Sergey E. Lyshevski, and Gerald J. Iafrate
The Handbook of Optical Communication Networks, Mohammad Ilyas and
Hussein T. Mouftah
The Industrial Electronics Handbook, J. David Irwin
The Measurement, Instrumentation, and Sensors Handbook, John G. Webster
The Mechanical Systems Design Handbook, Osita D.I. Nwokah and Yidirim Hurmuzlu
The Mechatronics Handbook, Second Edition, Robert H. Bishop
The Mobile Communications Handbook, Second Edition, Jerry D. Gibson
The Ocean Engineering Handbook, Ferial El-Hawary
The RF and Microwave Handbook, Second Edition, Mike Golio
The Technology Management Handbook, Richard C. Dorf
The Transforms and Applications Handbook, Second Edition, Alexander D. Poularikas
The VLSI Handbook, Second Edition, Wai-Kai Chen
The Digital Signal Processing Handbook, Second Edition
Digital Signal Processing Fundamentals
Video, Speech, and Audio Signal Processing and Associated Standards
Wireless, Networking, Radar, Sensor Array Processing, and Nonlinear Signal Processing
MATLAB® is a trademark of The MathWorks, Inc. and is used with permission. The MathWorks does not warrant the accuracy
of the text or exercises in this book. This book’s use or discussion of MATLAB® software or related products does not consti-
tute endorsement or sponsorship by The MathWorks of a particular pedagogical approach or particular use of the MATLAB®
software.
CRC Press
Taylor & Francis Group
6000 Broken Sound Parkway NW, Suite 300
Boca Raton, FL 33487-2742
© 2010 by Taylor and Francis Group, LLC
CRC Press is an imprint of Taylor & Francis Group, an Informa business
No claim to original U.S. Government works
Printed in the United States of America on acid-free paper
10 9 8 7 6 5 4 3 2 1
International Standard Book Number: 978-1-4200-4608-3 (Hardback)
This book contains information obtained from authentic and highly regarded sources. Reasonable efforts have been made to
publish reliable data and information, but the author and publisher cannot assume responsibility for the validity of all materials
or the consequences of their use. The authors and publishers have attempted to trace the copyright holders of all material repro-
duced in this publication and apologize to copyright holders if permission to publish in this form has not been obtained. If any
copyright material has not been acknowledged please write and let us know so we may rectify in any future reprint.
Except as permitted under U.S. Copyright Law, no part of this book may be reprinted, reproduced, transmitted, or utilized in any
form by any electronic, mechanical, or other means, now known or hereafter invented, including photocopying, microfilming,
and recording, or in any information storage or retrieval system, without written permission from the publishers.
For permission to photocopy or use material electronically from this work, please access www.copyright.com (http://www.copy-
right.com/) or contact the Copyright Clearance Center, Inc. (CCC), 222 Rosewood Drive, Danvers, MA 01923, 978-750-8400.
CCC is a not-for-profit organization that provides licenses and registration for a variety of users. For organizations that have been
granted a photocopy license by the CCC, a separate system of payment has been arranged.
Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and are used only for identifica-
tion and explanation without intent to infringe.
Library of Congress Cataloging-in-Publication Data
Video, speech, and audio signal processing and associated standards / Vijay K. Madisetti.
p. cm.
“Second edition of the DSP Handbook has been divided into three parts.”
Includes bibliographical references and index.
ISBN 978-1-4200-4608-3 (alk. paper)
1. Signal processing--Digital techniques--Standards. 2. Digital video--Standards. 3. Image
processing--Digital techniques--Standards. 4. Speech processing systems--Standards 5. Sound--Recording
and reproducing--Digital techniques--Standards I. Madisetti, V. (Vijay) II. Digital signal processing
handbook. III. Title.
TK5102.9.V493 2009
621.382’2--dc22
Visit the Taylor & Francis Web site at
http://www.taylorandfrancis.com
and the CRC Press Web site at
http://www.crcpress.com
2009022594
Contents
Preface .................................................................................................................................................. vii
Editor ..................................................................................................................................................... ix
Contributors ........................................................................................................................................ xi
PART I Digital Audio Communications
Nikil Jayant
1 Auditory Psychophysics for Coding Applications ......................................................... 1-1
Joseph L. Hall
2 MPEG Digital Audio Coding Standards .......................................................................... 2-1
Schuyler R. Quackenbush and Peter Noll
3 Dolby Digital Audio Coding Standards ........................................................................... 3-1
Robert L. Andersen and Grant A. Davidson
4 The Perceptual Audio Coder .............................................................................................. 4-1
Deepen Sinha, James D. Johnston, Sean Dorward, and Schuyler R. Quackenbush
5 Sony Systems .......................................................................................................................... 5-1
Kenzo Akagiri, Masayuki Katakura, H. Yamauchi, E. Saito, M. Kohut,
Masayuki Nishiguchi, Kyoya Tsutsui, and Keisuke Toyama
PART II Speech Processing
Richard V. Cox and Lawrence R. Rabiner
6 Speech Production Models and Their Digital Implementations ................................ 6-1
M. Mohan Sondhi and Juergen Schroeter
7 Speech Coding ........................................................................................................................ 7-1
Richard V. Cox
8 Text-to-Speech Synthesis ..................................................................................................... 8-1
Richard Sproat and Joseph Olive
9 Speech Recognition by Machine ........................................................................................ 9-1
Lawrence R. Rabiner and Biing-Hwang Juang
v
vi
Contents
10 Speaker Verification ............................................................................................................ 10-1
Sadaoki Furui and Aaron E. Rosenberg
11 DSP Implementations of Speech Processing ................................................................. 11-1
Kurt Baudendistel
12 Software Tools for Speech Research and Development ............................................. 12-1
John Shore
PART III
Image and Video Processing
Jan Biemond and Russell M. Mersereau
13 Fundamentals of Image Processing ................................................................................. 13-1
Ian T. Young, Jan J. Gerbrands, and Lucas J. van Vliet
14 Still Image Compression .................................................................................................... 14-1
Tor A. Ramstad
15 Image and Video Restoration ........................................................................................... 15-1
A. Murat Tekalp
16 Video Scanning Format Conversion and Motion Estimation .................................. 16-1
Gerard de Haan and Ralph Braspenning
17 Document Modeling and Source Representation in Content-Based
Image Retrieval .................................................................................................................... 17-1
Soo Hyun Bae and Biing-Hwang Juang
18 Technologies for Context-Based Video Search over the World Wide Web ......... 18-1
Arshdeep Bahga and Vijay K. Madisetti
19 Image Interpolation ............................................................................................................ 19-1
Yucel Altunbasak
20 Video Sequence Compression .......................................................................................... 20-1
Osama Al-Shaykh, Ralph Neff, David Taubman, and Avideh Zakhor
21 Digital Television ................................................................................................................. 21-1
Kou-Hu Tzou
22 Stereoscopic Image Processing ......................................................................................... 22-1
Reginald L. Lagendijk, Ruggero E. H. Franich, and Emile A. Hendriks
23 A Survey of Image Processing Software and Image Databases ................................ 23-1
Stanley J. Reeves
24 VLSI Architectures for Image Communications.......................................................... 24-1
P. Pirsch and W. Gehrke
Index ................................................................................................................................................... I-1
Preface
Digital signal processing (DSP) is concerned with the theoretical and practical aspects of representing
information-bearing signals in a digital form and with using computers, special-purpose hardware and
software, or similar platforms to extract information, process it, or transform it in useful ways. Areas
where DSP has made a significant impact include telecommunications, wireless and mobile communi-
cations, multimedia applications, user interfaces, medical technology, digital entertainment, radar and
sonar, seismic signal processing, and remote sensing, to name just a few.
Given the widespread use of DSP, a need developed for an authoritative reference, written by the top
experts in the world, that would provide information on both theoretical and practical aspects in a
manner that was suitable for a broad audience—ranging from professionals in electrical engineering,
computer science, and related engineering and scientific professions to managers involved in technical
marketing, and to graduate students and scholars in the field. Given the abundance of basic and
introductory texts on DSP, it was important to focus on topics that were useful to engineers and scholars
without overemphasizing those topics that were already widely accessible. In short, the DSP handbook
was created to be relevant to the needs of the engineering community.
A task of this magnitude could only be possible through the cooperation of some of the foremost DSP
researchers and practitioners. That collaboration, over 10 years ago, produced the first edition of the
successful DSP handbook that contained a comprehensive range of DSP topics presented with a clarity of
vision and a depth of coverage to inform, educate, and guide the reader. Indeed, many of the chapters,
written by leaders in their field, have guided readers through a unique vision and perception garnered by
the authors through years of experience.
The second edition of the DSP handbook consists of Digital Signal Processing Fundamentals; Video,
Speech, and Audio Signal Processing and Associated Standards; and Wireless, Networking, Radar, Sensor
Array Processing, and Nonlinear Signal Processing to ensure that each part is dealt with in adequate detail
and that each part is then able to develop its own individual identity and role in terms of its educational
mission and audience. I expect each part to be frequently updated with chapters that reflect the changes
and new developments in the technology and in the field. The distribution model for the DSP handbook
also reflects the increasing need by professionals to access content in electronic form anywhere and
at anytime.
Video, Speech, and Audio Signal Processing and Associated Standards, as the name implies, provides a
comprehensive coverage of the basic foundations of speech, audio, image, and video processing and
associated applications to broadcast, storage, search and retrieval, and communications.
This book needs to be continuously updated to include newer aspects of these technologies, and I look
forward to suggestions on how this handbook can be improved to serve you better.
vii