This AI Modules is being developed.
1 Functions
Audio-Visual Alignment (OSD-AVA):
- Receives
- Audio Scene Description
- Visual Scene Description.
- Produces Audio-Visual Scene Descriptors where Audio Objects and Visual Objects that share the same Spatial Attitude have the same or compatible Identifiers.
2 Reference Architecture
Figure 1 depicts the Reference Architecture of the Audio-Visual Alignment AIM.
Figure 1 – Audio-Visual Alignment AIM
3 I/O Data
Table 1 specifies the Input and Output Data of the Audio-Visual Alignment AIM.
Table 1 – I/O Data of the Audio-Visual Alignment AIM
Input | Description |
Audio Scene Descriptors | The digital representation of the spatial arrangement of the Visual Objects of the Scene. |
Visual Scene Descriptors | The digital representation of the Spatial arrangement of the Audio Objects of the Scene. |
Output | Description |
Audio-Visual Scene Descriptors | The digital representation of the Spatial arrangement of the Audio, Visual and Audio-Visual Objects of the Scene. |
5 SubAIMs
No SubAIMs.
6 JSON Metadata
https://schemas.mpai.community/OSD/V1.1/AIMs/AudioVisualAlignment.json
6 Profiles
No profile