MPAI-OSD V1.1 AIM Audio-Visual Scene Demultiplexing

Audio-Visual Scene Demultiplexing (OSD-SDX):

Receives	Audio-Visual Scene Descriptors
Demultiplexes	Audio-Visual Scene Descriptors
Produces	Speech Scene Geometry
	Audio Scene Geometry
	Visual Scene Geometry
	Speech Objects
	Audio Objects
	Visual Objects

Figure 1 depicts the Reference Model of the Audio-Visual Scene Demultiplexing AIM.

Figure 1 – Audio-Visual Scene Demultiplexing

Table 1 specifies the Input and Output Data of the of the Audio-Visual Scene Demultiplexing AIM.

Table 1 – I/O Data of the Audio-Visual Scene Demultiplexing AIM

Input	Description
Audio-Visual Scene Descriptors	The Descriptors of the Audio-Visual Scene.
Output	Description
Space-Time	Space-Time information of the Audio-Visual Scene
Speech Scene Geometry	The Descriptors of the Speech Scene.
Audio Scene Geometry	The Descriptors of the Audio Scene.
Visual Scene Geometry	The Descriptors of the Visual Scene.
Audio Object	The Audio Objects in the Scene.
Speech Object	The Speech Objects in the Scene.
Visual Object	The Visual Objects in the Scene.

No SubAIMs.

Cookie	Duration	Description
cookielawinfo-checkbox-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Technical".
CookieLawInfoConsent	1 year	The cookie is set by the GDPR Cookie Consent plug-in and is used to store whether the user has consented to the use of cookies or not. It does not store any personal data.
viewed_cookie_policy	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
_pk_id.6.08a8	13 months	Used to store a few details about the user such as the unique visitor ID
_pk_ses.6.08a8	30 minutes	Short lived cookies used to temporarily store data for the visit