Videoconference Client Receiver

The Function of the Receiving Client is to:

Create the local Audio-Visual Scene using the Avatar Videoconference Server.
Place and animate the Avatar Models with their Spatial Attitudes.
Add Speech to Avatars’ mouths.
Render the Audio-Visual Scene as seen from the Participant-selected Point of View.

Figure 1 depicts the Reference Model of the Videoconference Client Receiver. Red text for data received at the start. This is the operation:

Figure 1 – Reference Model of Videoconference Client Receiver

Note: An implementation may decide to display text with a visual image for accessibility purposes.

Table 1 gives the input and output data of Videoconference Client Receiver.

Table 1 – Input and output data of Videoconference Client Receiver AIW

Input	Description
Point of View	Participant-selected point of view to see the Audio-Visual Scene.
Portable Avatars	Portable Avatars from Server.
Output	Description
Output Audio	Presented using loudspeaker (array)/earphones.
Output Visual	Presented using 2D or 3D display.

4 Functions of Videoconference Client Receiver’s AI Modules

Table 2 gives the AI Modules of Videoconference Client Receiver AIW.

Table 2 – Functions of Videoconference Client Receivers’ AI Modules

AIM	Input
Portable Avatar Demultiplexing	Extracts Avatar ID, Audio-Visual Scene Descriptors, Avatar Model, Spatial Attitude, Body Descriptors, Face Descriptors, and Input Speech from Portable Avatars.
Visual Scene Creation	Creates the Visual Scene and provides the Spatial Attitudes of the mouths of all Avatars.
Audio Scene Creation	Creates the Audio Scene.
Audio-Visual Scene Rendering	Provides a ready-to-rendered AV Scene.

Table 3 gives the AI Modules of Videoconference Receiving Client AIW.

Table 3 – I/O Data of Videoconference Client Receivers’ AI Modules

AIM	Input	Output
Portable Avatar Demultiplexing	Portable Avatar	Avatar ID Audio-Visual Scene Descriptors Avatar Model Spatial Attitude Body Descriptors Face Descriptors Input Speech
Visual Scene Creation	Avatar ID Audio-Visual Scene Descriptors Avatar Model Spatial Attitude Body Descriptors Face Descriptors	Visual Scene Descriptors Mouth Spatial Attitude
Audio Scene Creation	Avatar ID Input Speech Mouth Spatial Attitude	Audio Scene Descriptors
Audio-Visual Scene Rendering	Audio Scene Descriptors Visual Scene Descriptors Point of View	Output Audio Output Visual

Table 4 – AIMs and JSON Metadata

AIMs		Name	JSON
PAF-CRX		Videoconference Client Receiver	X
–	PAF-PDX	Portable Avatar Demultiplexing	X
–	PAF-VSC	Visual Scene Creation	X
–	PAF-ASC	Audio Scene Creation	X
–	PAF-AVR	Audio-Visual Scene Rendering	X

Cookie	Duration	Description
cookielawinfo-checkbox-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Technical".
CookieLawInfoConsent	1 year	The cookie is set by the GDPR Cookie Consent plug-in and is used to store whether the user has consented to the use of cookies or not. It does not store any personal data.
viewed_cookie_policy	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
_pk_id.6.08a8	13 months	Used to store a few details about the user such as the unique visitor ID
_pk_ses.6.08a8	30 minutes	Short lived cookies used to temporarily store data for the visit