MPEG-4


MPEG-4 is a method of defining compression of audio and visual digital data. It was introduced in late 1998 and designated a standard for a group of audio and video coding formats and related technology agreed upon by the ISO/IEC Moving Picture Experts Group under the formal standard ISO/IEC 14496 – Coding of audio-visual objects. Uses of MPEG-4 include compression of AV data for web and CD distribution, voice and broadcast television applications. The MPEG-4 standard was developed by a group led by Touradj Ebrahimi and Fernando Pereira.

Background

MPEG-4 absorbs many of the features of MPEG-1 and MPEG-2 and other related standards, adding new features such as VRML support for 3D rendering, object-oriented composite files, support for externally specified Digital Rights Management and various types of interactivity. AAC was standardized as an adjunct to MPEG-2 before MPEG-4 was issued.
MPEG-4 is still an evolving standard and is divided into a number of parts. Companies promoting MPEG-4 compatibility do not always clearly state which "part" level compatibility they are referring to. The key parts to be aware of are MPEG-4 Part 2 and MPEG-4 part 10.
Most of the features included in MPEG-4 are left to individual developers to decide whether or not to implement. This means that there are probably no complete implementations of the entire MPEG-4 set of standards. To deal with this, the standard includes the concept of "profiles" and "levels", allowing a specific set of capabilities to be defined in a manner appropriate for a subset of applications.
Initially, MPEG-4 was aimed primarily at low bit-rate video communications; however, its scope as a multimedia coding standard was later expanded. MPEG-4 is efficient across a variety of bit-rates ranging from a few kilobits per second to tens of megabits per second. MPEG-4 provides the following functions:
MPEG-4 provides a series of technologies for developers, for various service-providers and for end users:
The MPEG-4 format can perform various functions, among which might be the following:
MPEG-4 provides a large and rich set of tools for encoding.
Subsets of the MPEG-4 tool sets have been provided for use in specific applications.
These subsets, called 'Profiles', limit the size of the tool set a decoder is required to implement. In order to restrict computational complexity, one or more 'Levels' are set for each Profile. A Profile and Level combination allows:
MPEG-4 consists of several standards—termed "parts"—including the following :
PartNumberFirst public release date Latest public release date Latest amendmentTitleDescription
Part 1199920102014SystemsDescribes synchronization and multiplexing of video and audio. For example, the MPEG-4 file format version 1. The functionality of a transport protocol stack for transmitting and/or storing content complying with ISO/IEC 14496 is not within the scope of 14496-1 and only the interface to this layer is considered. Information about transport of MPEG-4 content is defined e.g. in MPEG-2 Transport Stream, RTP Audio Video Profiles and others.
Part 2199920042009VisualA compression format for visual data. One of the many "profiles" in Part 2 is the Advanced Simple Profile.
Part 3199920092017AudioA set of compression formats for perceptual coding of audio signals, including some variations of Advanced Audio Coding as well as other audio/speech coding formats and tools, Scalable Lossless Coding, Structured Audio, Text-To-Speech Interface
Part 4200020042016Conformance testingDescribes procedures for testing conformance to other parts of the standard.
Part 5200020012017Reference softwareProvides reference software for demonstrating and clarifying the other parts of the standard.
Part 619992000Delivery Multimedia Integration Framework
Part 720022004Optimized reference software for coding of audio-visual objectsProvides examples of how to make improved implementations.
Part 820042004Carriage of ISO/IEC 14496 contents over IP networksSpecifies a method to carry MPEG-4 content on IP networks. It also includes guidelines to design RTP payload formats, usage rules of SDP to transport ISO/IEC 14496-1-related information, MIME type definitions, analysis on RTP security and multicasting.
Part 920042009Reference hardware descriptionProvides hardware designs for demonstrating how to implement the other parts of the standard.
Part 10200320142016Advanced Video Coding A compression format for video signals which is technically identical to the ITU-T H.264 standard.
Part 1120052015Scene description and application engineCan be used for rich, interactive content with multiple profiles, including 2D and 3D versions. MPEG-4 Part 11 revised MPEG-4 Part 1 – ISO/IEC 14496-1:2001 and two amendments to MPEG-4 Part 1. It describes a system level description of an application engine and the Binary Format for Scene and the Extensible MPEG-4 Textual format – a textual representation of the MPEG-4 multimedia content using XML, etc.
Part 12200420152017ISO base media file formatA file format for storing time-based media content. It is a general format forming the basis for a number of other more specific file formats. It is technically identical to ISO/IEC 15444-12.
Part 1320042004Intellectual Property Management and Protection ExtensionsMPEG-4 Part 13 revised an amendment to MPEG-4 Part 1 – ISO/IEC 14496-1:2001/Amd 3:2004. It specifies common Intellectual Property Management and Protection processing, syntax and semantics for the carriage of IPMP tools in the bit stream, IPMP information carriage, mutual authentication for IPMP tools, a list of registration authorities required for the support of the amended specifications, etc. It was defined due to the lack of interoperability of different protection mechanisms for protecting and distributing copyrighted digital content such as music or video.
Part 14200320032010MP4 file formatIt is also known as "MPEG-4 file format version 2". The designated container file format for MPEG-4 content, which is based on Part 12. It revises and completely replaces Clause 13 of ISO/IEC 14496-1, in which the MPEG-4 file format was previously specified.
Part 1520042017Part 15: Carriage of network abstraction layer unit structured video in the ISO base media file formatFor storage of Part 10 video. File format is based on Part 12, but also allows storage in other file formats.
Part 16200420112016Animation Framework eXtension It specifies MPEG-4 Animation Framework eXtension model for representing 3D Graphics content. MPEG-4 is extended with higher-level synthetic objects for specifying geometry, texture, animation and dedicated compression algorithms.
Part 1720062006Streaming text formatTimed Text subtitle format
Part 18200420042014Font compression and streamingFor Open Font Format defined in Part 22.
Part 1920042004Synthesized texture streamSynthesized texture streams are used for creation of very low bitrate synthetic video clips.
Part 20200620082010Lightweight Application Scene Representation and Simple Aggregation Format LASeR requirements are fulfilled by building upon the existing the Scalable Vector Graphics format defined by the World Wide Web Consortium.
Part 2120062006MPEG-J Graphics Framework eXtensions Describes a lightweight programmatic environment for advanced interactive multimedia applications – a framework that marries a subset of the MPEG standard Java application environment with a Java API..
Part 22200720152017Open Font FormatOFFS is based on the OpenType version 1.4 font format specification, and is technically equivalent to that specification. Reached "CD" stage in July 2005, published as ISO standard in 2007
Part 2320082008Symbolic Music Representation Reached "FCD" stage in October 2006, published as ISO standard in 2008-01-28
Part 2420082008Audio and systems interactionDescribes the desired joint behavior of MPEG-4 File Format and MPEG-4 Audio.
Part 25200920113D Graphics Compression ModelDefines a model for connecting 3D Graphics Compression tools defined in MPEG-4 standards to graphics primitives defined in any other standard or specification.
Part 26201020102016Audio Conformance
Part 272009200920153D Graphics conformance3D Graphics Conformance summarizes the requirements, cross references them to characteristics, and defines how conformance with them can be tested. Guidelines are given on constructing tests to verify decoder conformance.
Part 2820122012Composite font representation
Part 2920142015Web video codingText of Part 29 is derived from Part 10 - ISO/IEC 14496-10. Web video coding is a technology that is compatible with the Constrained Baseline Profile of ISO/IEC 14496-10.
Part 3020142014Timed text and other visual overlays in ISO base media file formatIt describes the carriage of some forms of timed text and subtitle streams in files based on ISO/IEC 14496-12 - W3C Timed Text Markup Language 1.0, W3C WebVTT. The documentation of these forms does not preclude other definition of carriage of timed text or subtitles; see, for example, 3GPP Timed Text.
Part 31Under development Video Coding for BrowsersVideo Coding for Browsers - a video compression technology that is intended for use within World Wide Web browser
Part 32Under developmentConformance and reference software
Part 33Under developmentInternet video coding

Profiles are also defined within the individual "parts", so an implementation of a part is ordinarily not an implementation of an entire part.
MPEG-1, MPEG-2, MPEG-7 and MPEG-21 are other suites of MPEG standards.

MPEG-4 Levels

The low profile levels are part of the MPEG-4 video encoding/decoding constraints and are compatible with the older ITU H.261 standard, also compatible with former analog TV standards for broadcast and records. The ASP profile in its highest level is suitable for most usual DVD medias and players or for many online video sites, but not for Blu-ray records or online HD video contents.
More advanced profiles for HD media have been defined later in the AVC profile, which is functionally identical to the ITU H.264 standard but are now also integrated in MPEG-4 Part 10.

Licensing

MPEG-4 contains patented technologies, the use of which requires licensing in countries that acknowledge software algorithm patents. Over two dozen companies claim to have patents covering MPEG-4. MPEG LA licenses patents required for MPEG-4 Part 2 Visual from a wide range of companies and lists all of its licensors and licensees on the site. New licenses for MPEG-4 System patents are under development and no new licenses are being offered while holders of its old MPEG-4 Systems license are still covered under the terms of that license for the patents listed.
The majority of patents used for the MPEG-4 Visual format are held by three Japanese companies: Mitsubishi Electric, Hitachi, and Panasonic.