1     Version

V1.0

2     Functions

Audio-Visual Alignment (OSD-AVA):

  1. Receives
    1. Audio Scene Geometry
    2. Visual Scene Geometry.
  2. Produces Identifiers of the Audio Objects and Visual Objects that share the same Spatial Attitude.

3      Reference Architecture

Figure 1 depicts the Reference Architecture of the Audio-Visual Alignment AIM is.

Figure 1 – Audio-Visual Alignment AIM

4      I/O Data

Table 1 specifies the Input and Output Data of the Audio-Visual Alignment AIM.

Table 1 – I/O Data of the Audio-Visual Alignment AIM

Input Description
Visual Scene Geometry The digital representation of the spatial arrangement of the Visual Objects of the Scene.
Audio Scene Geometry The digital representation of the Spatial arrangement of the Audio Objects of the Scene.
Output Description
Audio-Visual Scene Geometry The digital representation of the Spatial arrangement of the Audio, Visual and Audio-Visual Objects of the Scene.

5      SubAIMs

No SubAIMs.

6     JSON Metadata

https://schemas.mpai.community/OSD/V1.0/AIMs/AudioVisualAlignment.json