Go To MPAI-OSD AI Modules

1     Function 2     Reference Model 3     Input/Output Data
4     SubAIMs 5     JSON Metadata 6     Profiles
7     Reference Software 8     Conformance Texting 9     Performance Assessment

1     Functions

Audio-Visual Scene Demultiplexing (OSD-SDX):

Receives Audio-Visual Scene Descriptors
Demultiplexes Audio-Visual Scene Descriptors
Produces Speech Scene Geometry
Audio Scene Geometry
Visual Scene Geometry
Speech Objects
Audio Objects
Visual Objects

2     Reference Model

Figure 1 depicts the Reference Model of the Audio-Visual Scene Demultiplexing (OSD-SDX) AIM.

Figure 1 – Audio-Visual Scene Demultiplexing (OSD-SDX) AIM Reference Model

3    Input/Output Data

Table 1 specifies the Input and Output Data of the of the Audio-Visual Scene Demultiplexing (OSD-SDX) AIM.

Table 1 – I/O Data of the Audio-Visual Scene Demultiplexing (OSD-SDX) AIM

Input Description
Audio-Visual Scene Descriptors The Descriptors of the Audio-Visual Scene.
Output Description
Space-Time Space-Time information of the Audio-Visual Scene
Speech Scene Geometry The Descriptors of the Speech Scene.
Audio Scene Geometry The Descriptors of the Audio Scene.
Visual Scene Geometry The Descriptors of the Visual Scene.
Audio Object The Audio Objects in the Scene.
Speech Object The Speech Objects in the Scene.
Visual Object The Visual Objects in the Scene.

4     SubAIMs

No SubAIMs.

5     JSON Metadata

https://schemas.mpai.community/OSD/V1.2/AIMs/AudioVisualSceneDemultiplexing.json

6     Profiles

No Profiles.

7     Reference Software

8     Conformance Testing

Table 2 provides the Conformance Testing Method for the OSD-SDX AIM as a Basic AIM.

If a schema contains references to other schemas, conformance of data for the primary schema implies that any data referencing a secondary schema shall also validate against the relevant schema, if present and conform with the Qualifier, if present.

Table 2 – Conformance Testing Method for OSD-SDX AIM

Receives Audio-Visual Scene Descriptors Shall validate against AV Scene Descriptors schema
Produces Speech Scene Geometry Shall validate against Speech Scene Geometry schema
Audio Scene Geometry Shall validate against Audio Scene Geometry schema
Visual Scene Geometry Shall validate against Visual Scene Geometry schema
Speech Objects Shall validate against Speech Objects schema
Speech Data shall conform with Qualifier
Audio Objects Shall validate against Audio Objects schema
Audio Data shall conform with Qualifier
Visual Objects Shall validate against Visual Objects schema
Visual Data shall conform with Qualifier

9     Performance Assessment

Go To MPAI-OSD AI Modules