1 Version
V1.1
2 Functions
Videoconference Client Transmitter (PAF-CTX)
- Receives Portable Avatar conveying
- Participants’ Avatar Models and Language Preference
- Visual Scene Descriptors
- Avatars’ Spatial Attitudes
- Avatars’ Face and Body Descriptors and Speech Objects.
- Participant’s Point of View.
- Performs the following:
- Instantiates the Audio-Visual Scene.
- Place and animate the Avatar Models with their Spatial Attitudes.
- Add Speech to Avatars’ mouths.
- Renders the Audio-Visual Scene as seen from the Participant-selected Point of View.
3 Reference Architecture
The Reference Architecture is depicted in Figure 1.

Figure 1 – The Videoconference Client Receiverer AIW
4 I/O Data
Table 1 specifies the Input and Output Data of the Visual Scene Description AIM.
Table 1 – I/O Data of the Videoconference Client Receiver AIW
| Input | Description |
| Point of View | Participant-selected point of view to see the Audio-Visual Scene. |
| Portable Avatars | Portable Avatars from Server. |
| Output | Description |
| Output Audio | Presented using loudspeaker (array)/earphones. |
| Output Visual | Presented using 2D or 3D display. |
5 SubAIMs
| Portable Avatar Demultiplexing |
| Visual Scene Creation |
| Audio Scene Creation |
| Audio-Visual Scene Rendering |
6 JSON Metadata
https://schemas.mpai.community/PAF/V1.1/AIWs/VideoconferenceClientReceiver.json