This AI Modules is being developed.

1     Functions

Audio-Visual Alignment (OSD-AVA):

  1. Receives
    1. Audio Scene Description
    2. Visual Scene Description.
  2. Produces Audio-Visual Scene Descriptors where Audio Objects and Visual Objects that share the same Spatial Attitude have the same or compatible Identifiers.

2      Reference Architecture

Figure 1 depicts the Reference Architecture of the Audio-Visual Alignment AIM.

Figure 1 – Audio-Visual Alignment AIM

3      I/O Data

Table 1 specifies the Input and Output Data of the Audio-Visual Alignment AIM.

Table 1 – I/O Data of the Audio-Visual Alignment AIM

Input Description
Audio Scene Descriptors The digital representation of the spatial arrangement of the Visual Objects of the Scene.
Visual Scene Descriptors The digital representation of the Spatial arrangement of the Audio Objects of the Scene.
Output Description
Audio-Visual Scene Descriptors The digital representation of the Spatial arrangement of the Audio, Visual and Audio-Visual Objects of the Scene.

5      SubAIMs

No SubAIMs.

6     JSON Metadata

https://schemas.mpai.community/OSD/V1.1/AIMs/AudioVisualAlignment.json

6     Profiles

No profile