1 Version
V1.1
2 Functions
Videoconference Client Transmitter (PAF-CTX)
- Receives Visual Scene.
- Produces Visual Scene Descriptors.
3 Reference Architecture
The Reference Architecture is depicted in Figure 1.
Figure 1 – The Videoconference Client Transmitter AIW
4 I/O Data
Table 1 specifies the Input and Output Data of the Visual Scene Description AIM.
Table 1 – I/O Data of the Videoconference Client Transmitter AIW
Input | Description |
Input Text | Chat text used by a human to communicate with Virtual Meeting Secretary or other participants |
Language Preference | The language participant wishes to speak and hear. |
Input Audio | Audio of Speech of participants in a meeting room. |
Input Visual | Video of participants in a meeting room. |
Avatar Model | The avatar model selected by the participant. |
Output | Description |
Speech Object | For authentication by Server. |
Input Portable Avatar | Portable Avatar produced by Transmitting Client. |
Face Object | For authentication by Server. |
5 JSON Metadata
https://schemas.mpai.community/PAF/V1.1/AIWs/VideoconferenceClientTransmitter.json
6 SubAIMs
Audio-Visual Scene Description |
Visual Scene Description |
Audio-Visual Alignment |
Automatic Speech Recognition |
Natural Language Understanding |
Personal Status Extraction |
Personal Status Multiplexing |