1 Definition 2 Functional Requirements 3 Syntax 4 Semantics

1 Definition

Audio-Visual Qualifier is a set of Data providing additional information on Audio-Visual Data for potential use by a machine.

The combination of Audio-Visual Data and Audio-Visual Qualifier is called Audio-Visual Object, specified by MPAI-OSD V1.3.

2 Functional Requirements

Asset Qualifier must allow the expression of the following Elements:

  1. Formats
    1. Content
      1. Speech Qualifiers with their Time information.
      2. Audio Qualifiers with their Time information.
      3. Visual Qualifiers with their Time information.
    2. Transport

Users needing additional entries in the Audio-Visual Qualifier or support of new Qualifiers should make a documented request to the MPAI Secretariat. Requests will be considered by the appropriate MPAI committee.

3 Syntax

https://schemas.mpai.community/TFA/V1.3/data/AudioVisualQualifier.json

4 Semantics

  1. Formats

    1. Content
      1. Speech Components
        1. Times
        2. Speech Qualifiers
      2. Audio Components
        1. Times
        2. Audio Qualifiers
      3. Visual Components
        1. Times
        2. Visual Qualifiers
    2. Transport
      1. Definition: the types  of data arrangement used to transport a Visual instance.
      2.  Methods
        1. File
          1. AVI
          2. EXIF
          3. MP4 (ISO/IEC 14496-12:2022)
        2. Stream
          1.  DASH (ISO/EC 23009-1:2022)
          2. HTTP Live Streaming
          3. WebRTC
          4. MPEG-2 TS (ISO/IEC 13818-1:2023)