1     Functions

Audio-Visual Alignment (OSD-AVA):

Receives Audio Scene Descriptors
Visual Scene Descriptors
Aligns The Audio and Visual Objects sharing the same Spatial Attitude
Produces Audio-Visual Scene Descriptors where Audio Objects and Visual Objects having the same Spatial Attitude have the same or compatible Identifiers.

2      Reference Architecture

Figure 1 depicts the Reference Architecture of the Audio-Visual Alignment AIM.

Figure 1 – Audio-Visual Alignment AIM

3      I/O Data

Table 1 specifies the Input and Output Data of the Audio-Visual Alignment AIM.

Table 1 – I/O Data of the Audio-Visual Alignment AIM

Input Description
Audio Scene Descriptors The digital representation of the spatial arrangement of the Visual Objects of the Scene.
Visual Scene Descriptors The digital representation of the Spatial arrangement of the Audio Objects of the Scene.
Output Description
Audio-Visual Scene Descriptors The digital representation of the Spatial arrangement of the Audio, Visual and Audio-Visual Objects of the Scene.

5      SubAIMs

No SubAIMs.

6     JSON Metadata

https://schemas.mpai.community/OSD/V1.1/AIMs/AudioVisualAlignment.json

6     Profiles

No profile