Fundamentals of Multimedia Second Edition.pdf

发布时间：2022-06-13 发布人：admin 分类：说明书资料大小：22.17M 资料格式：pdf 举报版权申诉

6a706145-1f10-4cef-8c7f-6ad91eeca76b.pdf-第1页.png

第1页 / 共741页

6a706145-1f10-4cef-8c7f-6ad91eeca76b.pdf-第2页.png

第2页 / 共741页

6a706145-1f10-4cef-8c7f-6ad91eeca76b.pdf-第3页.png

第3页 / 共741页

6a706145-1f10-4cef-8c7f-6ad91eeca76b.pdf-第4页.png

第4页 / 共741页

6a706145-1f10-4cef-8c7f-6ad91eeca76b.pdf-第5页.png

第5页 / 共741页

6a706145-1f10-4cef-8c7f-6ad91eeca76b.pdf-第6页.png

第6页 / 共741页

6a706145-1f10-4cef-8c7f-6ad91eeca76b.pdf-第7页.png

第7页 / 共741页

6a706145-1f10-4cef-8c7f-6ad91eeca76b.pdf-第8页.png

第8页 / 共741页

Preface

Contents

Part IIntroduction and Multimedia Data Representations

1 Introduction to Multimedia

1.1 What is Multimedia?

1.1.1 Components of Multimedia

1.2 Multimedia: Past and Present

1.2.1 Early History of Multimedia

1.2.2 Hypermedia, WWW, and Internet

1.2.3 Multimedia in the New Millennium

1.3 Multimedia Software Tools: A Quick Scan

1.3.1 Music Sequencing and Notation

1.3.2 Digital Audio

1.3.3 Graphics and Image Editing

1.3.4 Video Editing

1.3.5 Animation

1.3.6 Multimedia Authoring

1.4 Multimedia in the Future

1.5 Exercises

2 A Taste of Multimedia

2.1 Multimedia Tasks and Concerns

2.2 Multimedia Presentation

2.3 Data Compression

2.4 Multimedia Production

2.5 Multimedia Sharing and Distribution

2.6 Some Useful Editing and Authoring Tools

2.6.1 Adobe Premiere

2.6.2 Adobe Director

2.6.3 Adobe Flash

2.7 Exercises

3 Graphics and Image Data Representations

3.1 Graphics/Image Data Types

3.1.1 1-Bit Images

3.1.2 8-Bit Gray-Level Images

3.1.3 Image Data Types

3.1.4 24-Bit Color Images

3.1.5 Higher Bit-Depth Images

3.1.6 8-Bit Color Images

3.1.7 Color Lookup Tables

3.2 Popular File Formats

3.2.1 GIF

3.2.2 JPEG

3.2.3 PNG

3.2.4 TIFF

3.2.5 Windows BMP

3.2.6 Windows WMF

3.2.7 Netpbm Format

3.2.8 EXIF

3.2.9 PS and PDF

3.2.10 PTM

3.3 Exercises

4 Color in Image and Video

4.1 Color Science

4.1.1 Light and Spectra

4.1.2 Human Vision

4.1.3 Spectral Sensitivity of the Eye

4.1.4 Image Formation

4.1.5 Camera Systems

4.1.6 Gamma Correction

4.1.7 Color-Matching Functions

4.1.8 CIE Chromaticity Diagram

4.1.9 Color Monitor Specifications

4.1.10 Out-of-Gamut Colors

4.1.11 White Point Correction

4.1.12 XYZ to RGB Transform

4.1.13 Transform with Gamma Correction

4.1.14 L*a*b* (CIELAB) Color Model

4.1.15 More Color Coordinate Schemes

4.1.16 Munsell Color Naming System

4.2 Color Models in Images

4.2.1 RGB Color Model for Displays

4.2.2 Multisensor Cameras

4.2.3 Camera-Dependent Color

4.2.4 Subtractive Color: CMY Color Model

4.2.5 Transformation from RGB to CMY

4.2.6 Undercolor Removal: CMYK System

4.2.7 Printer Gamuts

4.2.8 Multi-ink Printers

4.3 Color Models in Video

4.3.1 Video Color Transforms

4.3.2 YUV Color Model

4.3.3 YIQ Color Model

4.3.4 YCbCr Color Model

4.4 Exercises

5 Fundamental Concepts in Video

5.1 Analog Video

5.1.1 NTSC Video

5.1.2 PAL Video

5.1.3 SECAM Video

5.2 Digital Video

5.2.1 Chroma Subsampling

5.2.2 CCIR and ITU-R Standards for Digital Video

5.2.3 High-Definition TV

5.2.4 Ultra High Definition TV (UHDTV)

5.3 Video Display Interfaces

5.3.1 Analog Display Interfaces

5.3.2 Digital Display Interfaces

5.4 3D Video and TV

5.4.1 Cues for 3D Percept

5.4.2 3D Camera Models

5.4.3 3D Movie and TV Based on Stereo Vision

5.4.4 The Vergence-Accommodation Conflict

5.4.5 Autostereoscopic (Glasses-Free) Display Devices

5.4.6 Disparity Manipulation in 3D Content Creation

5.5 Exercises

6 Basics of Digital Audio

6.1 Digitization of Sound

6.1.1 What is Sound?

6.1.2 Digitization

6.1.3 Nyquist Theorem

6.1.4 Signal-to-Noise Ratio (SNR)

6.1.5 Signal-to-Quantization-Noise Ratio (SQNR)

6.1.6 Linear and Nonlinear Quantization

6.1.7 Audio Filtering

6.1.8 Audio Quality Versus Data Rate

6.1.9 Synthetic Sounds

6.2 MIDI: Musical Instrument Digital Interface

6.2.1 MIDI Overview

6.2.2 Hardware Aspects of MIDI

6.2.3 Structure of MIDI Messages

6.2.4 General MIDI

6.2.5 MIDI-to-WAV Conversion

6.3 Quantization and Transmission of Audio

6.3.1 Coding of Audio

6.3.2 Pulse Code Modulation

6.3.3 Differential Coding of Audio

6.3.4 Lossless Predictive Coding

6.3.5 DPCM

6.3.6 DM

6.3.7 ADPCM

6.4 Exercises

Part IIMultimedia Data Compression

7 Lossless Compression Algorithms

7.1 Introduction

7.2 Basics of Information Theory

7.3 Run-Length Coding

7.4 Variable-Length Coding

7.4.1 Shannon--Fano Algorithm

7.4.2 Huffman Coding

7.4.3 Adaptive Huffman Coding

7.5 Dictionary-Based Coding

7.6 Arithmetic Coding

7.6.1 Basic Arithmetic Coding Algorithm

7.6.2 Scaling and Incremental Coding

7.6.3 Integer Implementation

7.6.4 Binary Arithmetic Coding

7.6.5 Adaptive Arithmetic Coding

7.7 Lossless Image Compression

7.7.1 Differential Coding of Images

7.7.2 Lossless JPEG

7.8 Exercises

8 Lossy Compression Algorithms

8.1 Introduction

8.2 Distortion Measures

8.3 The Rate-Distortion Theory

8.4 Quantization

8.4.1 Uniform Scalar Quantization

8.4.2 Nonuniform Scalar Quantization

8.4.3 Vector Quantization

8.5 Transform Coding

8.5.1 Discrete Cosine Transform (DCT)

8.5.2 Karhunen--Loève Transform*

8.6 Wavelet-Based Coding

8.6.1 Introduction

8.6.2 Continuous Wavelet Transform*

8.6.3 Discrete Wavelet Transform*

8.7 Wavelet Packets

8.8 Embedded Zerotree of Wavelet Coefficients

8.8.1 The Zerotree Data Structure

8.8.2 Successive Approximation Quantization

8.8.3 EZW Example

8.9 Set Partitioning in Hierarchical Trees (SPIHT)

8.10 Exercises

9 Image Compression Standards

9.1 The JPEG Standard

9.1.1 Main Steps in JPEG Image Compression

9.1.2 JPEG Modes

9.1.3 A Glance at the JPEG Bitstream

9.2 The JPEG2000 Standard

9.2.1 Main Steps of JPEG2000 Image Compression*

9.2.2 Adapting EBCOT to JPEG2000

9.2.3 Region-of-Interest Coding

9.2.4 Comparison of JPEG and JPEG2000 Performance

9.3 The JPEG-LS Standard

9.3.1 Prediction

9.3.2 Context Determination

9.3.3 Residual Coding

9.3.4 Near-Lossless Mode

9.4 Bi-level Image Compression Standards

9.4.1 The JBIG Standard

9.4.2 The JBIG2 Standard

9.5 Exercises

10 Basic Video Compression Techniques

10.1 Introduction to Video Compression

10.2 Video Compression Based on Motion Compensation

10.3 Search for Motion Vectors

10.3.1 Sequential Search

10.3.2 2D Logarithmic Search

10.3.3 Hierarchical Search

10.4 H.261

10.4.1 Intra-Frame (I-Frame) Coding

10.4.2 Inter-Frame (P-Frame) Predictive Coding

10.4.3 Quantization in H.261

10.4.4 H.261 Encoder and Decoder

10.4.5 A Glance at the H.261 Video Bitstream Syntax

10.5 H.263

10.5.1 Motion Compensation in H.263

10.5.2 Optional H.263 Coding Modes

10.5.3 H.263+ and H.263++

10.6 Exercises

11 MPEG Video Coding: MPEG-1, 2, 4, and 7

11.1 Overview

11.2 MPEG-1

11.2.1 Motion Compensation in MPEG-1

11.2.2 Other Major Differences from H.261

11.2.3 MPEG-1 Video Bitstream

11.3 MPEG-2

11.3.1 Supporting Interlaced Video

11.3.2 MPEG-2 Scalabilities

11.3.3 Other Major Differences from MPEG-1

11.4 MPEG-4

11.4.1 Overview of MPEG-4

11.4.2 Video Object-Based Coding in MPEG-4

11.4.3 Synthetic Object Coding in MPEG-4

11.4.4 MPEG-4 Parts, Profiles and Levels

11.5 MPEG-7

11.5.1 Descriptor (D)

11.5.2 Description Scheme (DS)

11.5.3 Description Definition Language (DDL)

11.6 Exercises

12 New Video Coding Standards: H.264 and H.265

12.1 H.264

12.1.1 Motion Compensation

12.1.2 Integer Transform

12.1.3 Quantization and Scaling

12.1.4 Examples of H.264 Integer Transform and Quantization

12.1.5 Intra Coding

12.1.6 In-Loop Deblocking Filtering

12.1.7 Entropy Coding

12.1.8 Context-Adaptive Variable Length Coding (CAVLC)

12.1.9 Context-Adaptive Binary Arithmetic Coding (CABAC)

12.1.10 H.264 Profiles

12.1.11 H.264 Scalable Video Coding

12.1.12 H.264 Multiview Video Coding

12.2 H.265

12.2.1 Motion Compensation

12.2.2 Integer Transform

12.2.3 Quantization and Scaling

12.2.4 Intra Coding

12.2.5 Discrete Sine Transform

12.2.6 In-Loop Filtering

12.2.7 Entropy Coding

12.2.8 Special Coding Modes

12.2.9 H.265 Profiles

12.3 Comparisons of Video Coding Efficiency

12.3.1 Objective Assessment

12.3.2 Subjective Assessment

12.4 Exercises

13 Basic Audio Compression Techniques

13.1 ADPCM in Speech Coding

13.1.1 ADPCM

13.2 G.726 ADPCM, G.727-9

13.3 Vocoders

13.3.1 Phase Insensitivity

13.3.2 Channel Vocoder

13.3.3 Formant Vocoder

13.3.4 Linear Predictive Coding (LPC)

13.3.5 Code Excited Linear Prediction (CELP)

13.3.6 Hybrid Excitation Vocoders*

13.4 Exercises

14 MPEG Audio Compression

14.1 Psychoacoustics

14.1.1 Equal-Loudness Relations

14.1.2 Frequency Masking

14.1.3 Temporal Masking

14.2 MPEG Audio

14.2.1 MPEG Layers

14.2.2 MPEG Audio Strategy

14.2.3 MPEG Audio Compression Algorithm

14.2.4 MPEG-2 AAC (Advanced Audio Coding)

14.2.5 MPEG-4 Audio

14.3 Other Audio Codecs

14.3.1 Ogg Vorbis

14.4 MPEG-7 Audio and Beyond

14.5 Further Exploration

14.6 Exercises

Part IIIMultimedia Communications and Networking

15 Network Services and Protocols -13pt for Multimedia Communications

15.1 Protocol Layers of Computer Communication Networks

15.2 Local Area Network and Access Networks

15.2.1 LAN Standards

15.2.2 Ethernet Technology

15.2.3 Access Network Technologies

15.3 Internet Technologies and Protocols

15.3.1 Network Layer: IP

15.3.2 Transport Layer: TCP and UDP

15.3.3 Network Address Translation and Firewall

15.4 Multicast Extension

15.4.1 Router-Based Architectures: IP Multicast

15.4.2 Non Router-Based Multicast Architectures

15.5 Quality-of-Service for Multimedia Communications

15.5.1 Quality of Service

15.5.2 Internet QoS

15.5.3 Rate Control and Buffer Management

15.6 Protocols for Multimedia Transmission and Interaction

15.6.1 HyperText Transfer Protocol

15.6.2 Real-Time Transport Protocol

15.6.3 RTP Control Protocol

15.6.4 Real-Time Streaming Protocol

15.7 Case Study: Internet Telephony

15.7.1 Signaling Protocols: H.323 and Session Initiation Protocol

15.8 Further Exploration

15.9 Exercises

16 Internet Multimedia Content Distribution

16.1 Proxy Caching

16.1.1 Sliding-Interval Caching

16.1.2 Prefix Caching and Segment Caching

16.1.3 Rate-Split Caching and Work-Ahead Smoothing

16.1.4 Summary and Comparison

16.2 Content Distribution Networks (CDNs)

16.2.1 Representative: Akamai Streaming CDN

16.3 Broadcast/Multicast Video-on-Demand

16.3.1 Smart TV and Set-Top Box (STB)

16.3.2 Scalable Multicast/Broadcast VoD

16.4 Broadcast/Multicast for Heterogeneous Users

16.4.1 Stream Replication

16.4.2 Layered Multicast

16.5 Application-Layer Multicast

16.5.1 Representative: End-System Multicast (ESM)

16.5.2 Multi-tree Structure

16.6 Peer-to-Peer Video Streaming with Mesh Overlays

16.6.1 Representative: CoolStreaming

16.6.2 Hybrid Tree and Mesh Overlay

16.7 HTTP-Based Media Streaming

16.7.1 HTTP for Streaming

16.7.2 Dynamic Adaptive Streaming Over HTTP (DASH)

16.8 Exercises

17 Multimedia Over Wireless and Mobile Networks

17.1 Characteristics of Wireless Channels

17.1.1 Path Loss

17.1.2 Multipath Fading

17.2 Wireless Networking Technologies

17.2.1 1G Cellular Analog Wireless Networks

17.2.2 2G Cellular Networks: GSM and Narrowband CDMA

17.2.3 3G Cellular Networks: Wideband CDMA

17.2.4 4G Cellular Networks and Beyond

17.2.5 Wireless Local Area Networks

17.2.6 Bluetooth and Short-Range Technologies

17.3 Multimedia Over Wireless Channels

17.3.1 Error Detection

17.3.2 Error Correction

17.3.3 Error-Resilient Coding

17.3.4 Error Concealment

17.4 Mobility Management

17.4.1 Network Layer Mobile IP

17.4.2 Link-Layer Handoff Management

17.5 Further Exploration

17.6 Exercises

Part IVMultimedia Information Sharing and Retrieval

18 Social Media Sharing

18.1 Representative Social Media Services

18.1.1 User-Generated Content Sharing

18.1.2 Online Social Networking

18.2 User-Generated Media Content Sharing

18.2.1 YouTube Video Format and Meta-data

18.2.2 Characteristics of YouTube Video

18.2.3 Small-World in YouTube Videos

18.2.4 YouTube from a Partner's View

18.2.5 Enhancing UGC Video Sharing

18.3 Media Propagation in Online Social Networks

18.3.1 Sharing Patterns of Individual Users

18.3.2 Video Propagation Structure and Model

18.3.3 Video Watching and Sharing Behaviors

18.3.4 Coordinating Live Streaming and Online Storage

18.4 Further Exploration

18.5 Exercises

19 Cloud Computing for Multimedia Services

19.1 Cloud Computing Overview

19.1.1 Representative Storage Service: Amazon S3

19.1.2 Representative Computation Service: Amazon EC2

19.2 Multimedia Cloud Computing

19.3 Cloud-Assisted Media Sharing

19.3.1 Impact of Globalization

19.3.2 Case Study: Netflix

19.4 Computation Offloading for Multimedia Services

19.4.1 Requirements for Computation Offloading

19.4.2 Service Partitioning for Video Coding

19.4.3 Case Study: Cloud-Assisted Motion Estimation

19.5 Interactive Cloud Gaming

19.5.1 Issues and Challenges of Cloud Gaming

19.5.2 Real-World Implementation

19.6 Further Exploration

19.7 Exercises

20 Content-Based Retrieval in Digital Libraries

20.1 How Should We Retrieve Images?

20.2 Synopsis of Early CBIR Systems

20.3 C-BIRD: A Case Study

20.3.1 Color Histogram

20.3.2 Color Density and Color Layout

20.3.3 Texture Layout

20.3.4 Texture Analysis Details

20.3.5 Search by Illumination Invariance

20.3.6 Search by Object Model

20.4 Quantifying Search Results

20.5 Key Technologies in Current CBIR Systems

20.5.1 Robust Image Features and Their Representation

20.5.2 Relevance Feedback

20.5.3 Other Post-processing Techniques

20.5.4 Visual Concept Search

20.5.5 The Role of Users in Interactive CBIR Systems

20.6 Querying on Videos

20.7 Querying on Videos Based on Human Activity

20.7.1 Modeling Human Activity Structures

20.7.2 Experimental Results

20.8 Quality-Aware Mobile Visual Search

20.8.1 Related Work

20.8.2 Quality-Aware Method

20.8.3 Experimental Results

20.9 Exercises

Index

Texts in Computer Science Ze-Nian Li Mark S. Drew Jiangchuan Liu Fundamentals of Multimedia Second Edition

Texts in Computer Science Editors David Gries Fred B. Schneider For further volumes: http://www.springer.com/series/3191

Ze-Nian Li • Mark S. Drew Jiangchuan Liu Fundamentals of Multimedia Second Edition 123

Ze-Nian Li Simon Fraser University Vancouver, BC Canada Mark S. Drew Simon Fraser University Vancouver, BC Canada Series editors David Gries Department of Computer Science Cornell University Ithaca, NY USA Jiangchuan Liu Simon Fraser University Vancouver, BC Canada Fred B. Schneider Department of Computer Science Cornell University Ithaca, NY USA ISSN 1868-0941 Texts in Computer Science ISBN 978-3-319-05289-2 DOI 10.1007/978-3-319-05290-8 Springer Cham Heidelberg New York Dordrecht London ISBN 978-3-319-05290-8 ISSN 1868-095X (electronic) (eBook) Library of Congress Control Number: 2014933390 1st Edition: Ó Prentice-Hall, Inc. 2004 Ó Springer International Publishing Switzerland 2014 This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, speciﬁcally the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microﬁlms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. Exempted from this legal reservation are brief excerpts in connection with reviews or scholarly analysis or material supplied speciﬁcally for the purpose of being entered and executed on a computer system, for exclusive use by the purchaser of the work. Duplication of this publication or parts thereof is permitted only under the provisions of the Copyright Law of the Publisher’s location, in its current version, and permission for use must always be obtained from Springer. Permissions for use may be obtained through RightsLink at the Copyright Clearance Center. Violations are liable to prosecution under the respective Copyright Law. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a speciﬁc statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. While the advice and information in this book are believed to be true and accurate at the date of publication, neither the authors nor the editors nor the publisher can accept any legal responsibility for any errors or omissions that may be made. The publisher makes no warranty, express or implied, with respect to the material contained herein. Printed on acid-free paper Springer is part of Springer Science+Business Media (www.springer.com)

To my mom, and my wife Yansin Ze-Nian To Noah, Ira, Eva and, especially, to Jenna To my wife Jill, and my children Mark Jiangchuan

Preface A course in Multimedia is rapidly becoming a necessity in Computer Science and Engineering curricula, especially now that multimedia touches most aspects of these ﬁelds. Multimedia was originally seen as a vertical application area, i.e., a niche application with methods that belong only to itself. However, like pervasive computing, with many people’s day regularly involving the Internet, multimedia is now essentially a horizontal application area and forms an important component of the study of algorithms, computer graphics, computer networks, image processing, computer vision, databases, real-time systems, operating systems, information retrieval, and so on. Multimedia is a ubiquitous part of the technological envi- ronment in which we work and think. This book ﬁlls the need for a university-level text that examines a good deal of the core agenda that Computer Science sees as belonging to this subject area. This edition constitutes a signiﬁcant revision, and we include an introduction to such current topics as 3D TV, social networks, high efﬁciency video compression and conferencing, wireless and mobile networks, and their attendant technologies. The textbook has been updated throughout to include recent developments in the ﬁeld, including considerable added depth to the net- working aspect of the book. To this end, Dr. Jiangchuan Liu has been added to the team of authors. While the ﬁrst edition was published by Prentice-Hall, for this update we have chosen Springer, a prestigious publisher that has a superb and rapidly expanding array of Computer Science textbooks, particularly the excellent, dedicated, and long-running/established textbook series: Texts in Computer Science, of which this textbook now forms a part. Multimedia has become associated with a certain set of issues in Computer Science and Engineering, and we address those here. The book is not an intro- duction to simple design considerations and tools—it serves a more advanced audience than that. On the other hand, the book is not a reference work—it is more a traditional textbook. While we perforce may discuss multimedia tools, we would like to give a sense of the underlying issues at play in the tasks those tools carry out. Students who undertake and succeed in a course based on this text can be said to really understand fundamental matters in regard to this material, hence the title of the text. In conjunction with this text, a full-ﬂedged course should also allow students to make use of this knowledge to carry out interesting or even wonderful practical vii

viii Preface projects in multimedia, interactive projects that engage and sometimes amuse and, perhaps, even teach these same concepts. Who Should Read this Book? This text aims at introducing the basic ideas used in multimedia, for an audience that is comfortable with technical applications, e.g., Computer Science students and Engineering students. The book aims to cover an upper-level undergraduate multimedia course, but could also be used in more advanced courses. Indeed, a (quite long) list of courses making use of the ﬁrst edition of this text includes many undergraduate courses as well as use as a pertinent point of departure for graduate students who may not have encountered these ideas before in a practical way. As well, the book would be a good reference for anyone, including those in industry, who are interested in current multimedia technologies. The text mainly presents concepts, not applications. A multimedia course, on the other hand, teaches these concepts, and tests them, but also allows students to utilize skills they already know, in coding and presentation, to address problems in multimedia. The accompanying website materials for the text include some code for multimedia applications along with some projects students have developed in such a course, plus other useful materials best presented in electronic form. The ideas in the text drive the results shown in student projects. We assume that the reader knows how to program, and is also completely comfortable learning yet another tool. Instead of concentrating on tools, however, the text emphasizes what students do not already know. Using the methods and ideas collected here, students are also enabled to learn more themselves, sometimes in a job setting: it is not unusual for students who take the type of multimedia course this text aims at to go on to jobs in multimedia-related industry immediately after their senior year, and sometimes before. The selection of material in the text addresses real issues that these learners will be facing as soon as they show up in the workplace. Some topics are simple, but new to the students; some are somewhat complex, but unavoidable in this emerging area. Have the Authors Used this Material in a Real Class? Since 1996, we have taught a third-year undergraduate course in Multimedia Systems based on the introductory materials set out in this book. A one-semester course very likely could not include all the material covered in this text, but we have usually managed to consider a good many of the topics addressed, with mention made of a selected number of issues in Parts 3 and 4, within that time frame.

Preface ix As well, over the same time period and again as a one-semester course, we have also taught a graduate-level course using notes covering topics similar to the ground covered by this text, as an introduction to more advanced materials. A fourth-year or graduate-level course would do well to discuss material from the ﬁrst three Parts of the book and then consider some material from the last Part, perhaps in conjunction with some of the original research references included here along with results presented at topical conferences. We have attempted to ﬁll both needs, concentrating on an undergraduate audience but including more advanced material as well. Sections that can safely be omitted on a ﬁrst reading are marked with an asterisk in the Table of Contents. What is Covered in this Text? In Part 1, Introduction and Multimedia Data Representations, we introduce some of the notions included in the term Multimedia, and look at its present as well as its history. Practically speaking, we carry out multimedia projects using software tools, so in addition to an overview of multimedia software tools we get down to some of the nuts and bolts of multimedia authoring. The representation of data is critical in the study of multimedia, and we look at the most important data rep- resentations for use in multimedia applications. Speciﬁcally, graphics and image data, video data, and audio data are examined in detail. Since color is vitally important in multimedia programs, we see how this important area impacts mul- timedia issues. In Part 2, Multimedia Data Compression, we consider how we can make all this data ﬂy onto the screen and speakers. Multimedia data compression turns out to be a very important enabling technology that makes modern multimedia systems possible. Therefore we look at lossless and lossy compression methods, supplying the fundamental concepts necessary to fully understand these methods. For the latter category, lossy compression, arguably JPEG still-image compression stan- dards, including JPEG2000, are the most important, so we consider these in detail. But since a picture is worth 1,000 words, and so video is worth more than a million words per minute, we examine the ideas behind the MPEG standards MPEG-1, MPEG-2, MPEG-4, MPEG-7, and beyond into new video coding standards H.264 and H.265. Audio compression is treated separately and we consider some basic audio and speech compression techniques and take a look at MPEG Audio, including MP3 and AAC. In Part 3, Multimedia Communications and Networking, we consider the great demands multimedia communication and content sharing places on networks and systems. We go on to consider wired Internet and wireless mobile network tech- nologies and protocols that make interactive multimedia possible. We consider current multimedia content distribution mechanisms, an introduction to the basics of wireless mobile networks, and problems and solutions for multimedia communication over such networks.

分享到：

赞收藏

资料库

Fundamentals of Multimedia Second Edition.pdf

相关推荐

课程资源

热门标签

最新资料