1 Version
V1.1
2 Functions
The Server:
- At the start:
- Receives Speech Object of each Participant.
- Authenticates Participants.
- Receives Portable Avatars each containing Language Preference and Avatar Model.
- Selects an Audio-Visual Scene.
- Selects the Spatial Attitudes of the Avatar Models in the Audio-Visual Scene.
- Selects the common meeting language.
- Distributes all Portable Avatars each containing: Audio-Visual Scene, Language Preference, Avatar Model, and Spatial Attitude.
- During the videoconference:
- Receives Participants’ and Virtual Meeting Secretary’s Speech Avatar Descriptors.
- Translates participants’ Speech according to their Language Preferences.
- Sends Portable Avatars containing Avatar ID, Text, Speech translated to the common meeting language, Face Descriptors and Gesture Descriptors to Virtual Meeting Secretary.
- Receives Virtual Meeting Secretary’s Portable Avatar containing Avatar ID, Text, Speech in the common meeting language, Face Descriptors and Gesture Descriptors.
- Translates Virtual Meeting Secretary’s Speech according to each participant’s Language Preferences.
- Sends Participants’ and Virtual Meeting Secretary’s Portable Avatars containing Avatar ID, Text, Translated Speech, Face Descriptors and Gesture Descriptors to Client Receivers.
3 Reference Model
The Reference Model is depicted in Figure 1.
![]()
Figure 1 – The Avatar Videoconference Server AIW
4 I/O Data
Table 1 specifies the Input and Output Data of the .
Table 1 – I/O Data of the Avatar Videoconference Server AIW
| Input | Description |
| Summary | From Virtual Meeting Secretary |
| Audio-Visual Scene Descriptors | Set by Server |
| Spatial Attitude | Set by Server |
| Input+Virtual Secretary Portable Avatar | From Transmitting Clients and Virtual Meeting Secretary |
| Speech Objects | Participants’ Speech Object for Authentication |
| Face Object | Participants’ Face Object for Authentication |
| Outputs | Description |
| Summary | As above |
| Portable Avatar | As re-multiplexed by Server |
5 JSON Metadata
https://schemas.mpai.community/PAF/V1.1/AIWs/AvatarVideoconferenceServer.json
6 SubAIMs
| Portable Avatar Demultiplexing |
| Text and Speech Translation |
| Service Participant Authentication |
| Portable Avatar Multiplexing |