1 Version
V1.1
2 Functions
Videoconference Client Transmitter (PAF-CTX)
- Receives Visual Scene.
- Produces Visual Scene Descriptors.
3 Reference Architecture
The Reference Architecture is depicted in Figure 1.

Figure 1 – The Videoconference Client Transmitter AIW
4 I/O Data
Table 1 specifies the Input and Output Data of the Visual Scene Description AIM.
Table 1 – I/O Data of the Videoconference Client Transmitter AIW
| Input | Description |
| Input Text | Chat text used by a human to communicate with Virtual Meeting Secretary or other participants |
| Language Preference | The language participant wishes to speak and hear. |
| Input Audio | Audio of Speech of participants in a meeting room. |
| Input Visual | Video of participants in a meeting room. |
| Avatar Model | The avatar model selected by the participant. |
| Output | Description |
| Speech Object | For authentication by Server. |
| Input Portable Avatar | Portable Avatar produced by Transmitting Client. |
| Face Object | For authentication by Server. |
5 JSON Metadata
https://schemas.mpai.community/PAF/V1.1/AIWs/VideoconferenceClientTransmitter.json
6 SubAIMs
| Audio-Visual Scene Description |
| Visual Scene Description |
| Audio-Visual Alignment |
| Automatic Speech Recognition |
| Natural Language Understanding |
| Personal Status Extraction |
| Personal Status Multiplexing |