MPAI-OSD V1.1 AIM Audio-Visual Alignment

This AI Modules is being developed.

Audio-Visual Alignment (OSD-AVA):

Receives
1. Audio Scene Description
2. Visual Scene Description.
Produces Audio-Visual Scene Descriptors where Audio Objects and Visual Objects that share the same Spatial Attitude have the same or compatible Identifiers.

Figure 1 depicts the Reference Architecture of the Audio-Visual Alignment AIM.

Figure 1 – Audio-Visual Alignment AIM

Table 1 specifies the Input and Output Data of the Audio-Visual Alignment AIM.

Table 1 – I/O Data of the Audio-Visual Alignment AIM

Input	Description
Audio Scene Descriptors	The digital representation of the spatial arrangement of the Visual Objects of the Scene.
Visual Scene Descriptors	The digital representation of the Spatial arrangement of the Audio Objects of the Scene.
Output	Description
Audio-Visual Scene Descriptors	The digital representation of the Spatial arrangement of the Audio, Visual and Audio-Visual Objects of the Scene.

No SubAIMs.

No profile

Cookie	Duration	Description
cookielawinfo-checkbox-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Technical".
CookieLawInfoConsent	1 year	The cookie is set by the GDPR Cookie Consent plug-in and is used to store whether the user has consented to the use of cookies or not. It does not store any personal data.
viewed_cookie_policy	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
_pk_id.6.08a8	13 months	Used to store a few details about the user such as the unique visitor ID
_pk_ses.6.08a8	30 minutes	Short lived cookies used to temporarily store data for the visit