1     Function 2     Reference Model 3     Input/Output Data
4     SubAIMs 5     JSON Metadata 6     Profiles
7     Reference Software 8     Conformance Texting 9     Performance Assessment

1     Functions

Audio-Visual Basic Scene Description (OSD-AVB):

Receives Space-Time.
Audio Objects.
Speech Objects.
Visual Objects.
Audio-Visual Objects.
Processes All Objects.
Creates Audio, Speech, Visual, and Audio-Visual Scene Descriptors from the Objects, if possible.
Combines The Scene Descriptors of the all Objects.
Produces Audio-Visual Visual Scene Descriptors

2     Reference Model

Figure 1 specifies the Reference Model of the Audio-Visual Basic Scene Description (OSD-AVB) AIM.

Figure 1 – The Audio-Visual Basic Scene Description (OSD-AVB) AIM

3    Input/Output Data

Table 1 specifies the Input and Output Data of the Audio-Visual Basic Scene Description (OSD-AVB).

Table 1 – I/O Data of the Audio-Visual Basic Scene Description (OSD-AVB) AIM

Input Description
SpaceTime Space-Time information of Objects.
Audio Objects Input Audio Objects.
Speech Objects Input Speech Objects.
Visual Objects Input Visual Objects.
Output Description
Audio-Visual Scene Descriptors The Audio-Visual Descriptors of the Scene.

4     SubAIMs

Audio-Visual Basic Scene Description (OSD-AVB) is a Composite AIM whose reference Model is depicted in Figure 2.

Figure 2 – Reference Model of Audio-Visual Basic Scene Description Composite AIM

Table 2 provides the AI Modules composing the AIM.

Table 2 – AI Modules of the Audio-Visual Basic Scene Description (OSD-AVB) AIM

AIM Acron. AIMs JSON
OSD-AVS Audio-Visual Basic Scene Description X
CAE-ABS Audio Basic Scene Description X
MMC-SBS Speech Basic Scene Description X
OSD-VBS Visual Basic Scene Description X
OSD-AV Audio-Visual Alignment X

5     JSON Metadata

https://schemas.mpai.community/OSD/V1.1/AIMs/AudioVisualBasicSceneDescription.json

6     Profiles

No Profiles.

7     Reference Software

8     Conformance Testing

9     Performance Assessment