INTERNATIONAL ORGANISATION FOR STANDARDISATION
ORGANISATION INTERNATIONALE DE NORMALISATION
ISO/IEC JTC1/SC29/WG11
CODING OF MOVING PICTURES AND AUDIO
ISO/IEC JTC1/SC29/WG11
MPEG 98/N2464
October 1998, Atlantic City, USA
Title: |
MPEG-7 Proposal Package Description (PPD) |
Source: |
Requirements Group |
1. Introduction
More and more audiovisual information is available in digital form, in various places around the world. Along with the information, people appear that want to use it. However, before any information can be used, it will have to be located. At the same time, the increasing availability of potentially interesting material makes this search harder. Currently, solutions exist that allow searching for textual information. Many text-based search engines are available on the World Wide Web, and they are among the most visited sites - indicating a real demand. The same is, however, not possible for audiovisual content, as no generally recognized description of this material exists. In general, it is not possible to efficiently search the web for, say, a picture of ‘the Motorbike from Terminator II’, or to search a sequence where "King Lear congratulates his assistants on the night after the battle," or to search for "twenty minutes of video according to my preferences of today". In specific cases, solutions do exist. Multimedia databases on the market today allow searching for pictures using characteristics like color, texture and information about the shape of objects in the picture. One could envisage a similar example for audio, in which one can whistle a melody to find a song. While the capability to search for relevant information is a direct result of having standardised descriptions, several other applications can be envisaged. For example, one may wish to generate a paragraph of text describing an image or generate a different presentation of a video using its description.
MPEG started a new work item to provide a solution to the questions described above. The new member of the MPEG family, called "Multimedia Content Description Interface" (in short ‘MPEG-7’), will extend the limited capabilities of proprietary solutions in identifying content that exist today, notably by including more data types. In other words: MPEG-7 will specify a standard set of descriptors that can be used to describe various types of multimedia information. This description shall be associated with the content itself, to allow fast and efficient searching for material of a user’s interest. AV material that has MPEG-7 data associated with it, can be indexed and searched for. This ‘material’ may include: still pictures, graphics, 3D models, audio, speech, video, and information about how these elements are combined in a multimedia presentation (‘scenarios’, composition information). Special cases of these general data types may include facial expressions and personal characteristics.
The question of finding content is not restricted to database retrieval applications, which can be viewed as "pull" applications, but the question of finding content also exists in many other areas. For instance, there is an increasing amount of (digital) broadcast channels available, and this makes it harder to select the broadcast channel (radio or TV) that is potentially interesting. These applications of finding content within broadcast channels are referred to as "push" applications. MPEG-7 will address both "pull" and ‘push" applications. While searhing and filtering constitute a large class of applications, MPEG-7 does not limit itself to such applications alone. MPEG-7 descriptions will support other applications which can exploit a standardized description.
Providing novel solutions addressing text-only documents (i.e. standardising descriptions for text-only documents) is not among the goals of MPEG-7. The emphasis is on description of audiovisual content. The content may include text, or references to text, in addition to audiovisual information. MPEG-7 will consider existing solutions developed by other standardization groups for text only documents and support them as appropriate.
2. MPEG-7 Framework
MPEG-7 will standardize the description of Audio-Visual Content, but will not standardize the algorithms for extracting the descriptions or searching / filtering using these descriptions. For details on the background and goals of MPEG-7 please refer to MPEG-7: Context and Objectives document [2].
MPEG-7 will address applications that can be stored (on-line or off-line) or streamed (e.g. broadcast, push models on the Internet), and can operate in both real-time and non real-time environments. A ‘real-time environment’ means that information is associated with the content while it is being captured.
MPEG-7 will standardize:
For a definition of the terms please refer to the MPEG-7 Requirements document [1].
It may be pointed out that while MPEG-7 aims to standardize a "Multimedia Content Description Interface, the emphasis of MPEG is on audio-visual content. That is, MPEG-7 does not aim to create description schemes or descriptors for text medium. However, MPEG-7 will consider existing solutions for describing text documents and support them as appropriate with suitable, necessary interfaces between audio-visual-content descriptions and the textual-content descriptions. Specifically MPEG-7 will:
MPEG-7 will study existing schemes for multimedia content description and act as appropriate based on the requirements of MPEG-7. However, the need to comply or inter-operate with existing database systems describing multimedia content will not be a constraint on MPEG-7.
For more details regarding the MPEG-7 background, goals, areas of interest, and work plan please refer to MPEG-7: Context and Objectives document [ 2] . A detailed list of functionality and requirements of MPEG-7 may be found in the MPEG-7 Requirements document [1]. For a list of potential applications which may be enabled by MPEG-7 please refer to the MPEG-7 Applications document [3].
3. What is called for
MPEG calls for the following tools in order to develop the MPEG-7 standard:
A. For the normative part:
While descriptors, description schemes, DDL and coding schemes proposals will be evaluated in February 1999 following the procedures defined in the MPEG-7 Evaluation document [4], the proposals addressing MPEG-7 systems tools will not be part of the MPEG-7 Evaluation process in February 1999. These tools will be considered at the Seoul MPEG meeting. The proposers are kindly invited to present and show a demo regarding these tools at the Seoul MPEG meeting, in March 1999 [4].
B. For the development of the standard (to be used in the eXperimentation Model (XM) and play an important role in the standardization process):
It should be noted that it is not necessary for a proposal to address all the elements called above. It is possible for a proposal, for example, to only propose descriptors for some set of features, or description schemes or parts of description schemes. The major purpose of the MPEG-7 evaluation process is to gather relevant technology for the collaborative development of the MPEG-7 standard.
Selected aspects of different proposals will be incorporated into a common model during the collaborative phase of the standard with the goal of building the best possible model. The commom model called the experimentation model (XM) may be implemented in more than one platform/architecture and may be implemented in different parts to satisfy different requirements and/or application areas.
All the proposals will have to be pre-registered by the 1st December 1998 (see [4]). Those intending to answer the MPEG-7 Call for Proposals must follow the guidelines specified in the MPEG-7 Evaluation document [4].
The proposer should provide:
Note:Those intending to participate in the Ad Hoc Group Evaluation meeting should register by the 16th December with the host (see [5]).
MPEG-7 Content Set and its Distribution
The MPEG-7 content set has been defined at the MPEG October 1998 meeting. The list of the content set items is included in doc. N2467 and in annex H of the MPEG-7 Evaluation document [4]. The conditons of use of the MPEG-7 content set are include doc. N2466 and in annex I of the MPEG-7 Evaluation document. The distribution procedure regarding the MPEG-7 content set is defined in doc. 2468.
4. Method of Developing the Standard
The major processes in the development of the standard are as follows:
5. MPEG-7 Time Schedule
October 16, 1998 |
Call for Proposals Final version of Proposal Package Description Final version of the MPEG-7 Evaluation document |
December 1, 1998 |
Pre-registration of Proposals |
February 1, 1999 |
Proposals due |
February 15-19, 1999 |
Evaluation of proposals (in an Ad Hoc Group meeting) |
March 1999 |
First version of MPEG-7 eXperimentation Model |
December 1999 |
Working Draft |
October 2000 |
Committee Draft |
February 2001 |
Final Committee Draft |
July 2001 |
Final Draft International Standard |
September 2001 |
International Standard |
6. References