1     Version

V1.1

2    Functions

Videoconference Client Transmitter (PAF-CTX)

  1. Receives Visual Scene.
  2. Produces Visual Scene Descriptors.

3      Reference Architecture

The Reference Architecture is depicted in Figure 1.

Figure 1 – The Videoconference Client Transmitter AIW

4      I/O Data

Table 1 specifies the Input and Output Data of the Visual Scene Description AIM.

Table 1 – I/O Data of the Videoconference Client Transmitter AIW

Input Description
Input Text Chat text used by a human to communicate with Virtual Meeting Secretary or other participants
Language Preference The language participant wishes to speak and hear.
Input Audio Audio of Speech of participants in a meeting room.
Input Visual Video of participants in a meeting room.
Avatar Model The avatar model selected by the participant.
Output Description
Speech Object For authentication by Server.
Input Portable Avatar Portable Avatar produced by Transmitting Client.
Face Object For authentication by Server.

5     JSON Metadata

https://schemas.mpai.community/PAF/V1.1/AIWs/VideoconferenceClientTransmitter.json

6     SubAIMs

Audio-Visual Scene Description
Visual Scene Description
Audio-Visual Alignment
Automatic Speech Recognition
Natural Language Understanding
Personal Status Extraction
Personal Status Multiplexing