| 1 Function | 2 Reference Model | 3 Input/Output Data | 
| 4 SubAIMs | 5 JSON Metadata | 6 Profiles | 
| 7 Reference Software | 8 Conformance Texting | 9 Performance Assessment | 
1 Functions
The Audio-Visual Scene Rendering (PAF-AVR) AIM
- Receives an input Point of View and all or some of the following input data: an input Portable Avatar, input Audio-Visual Scene Descriptors, and an input Spatial Attitude.
- Produces the Audio, Speech, and Visual components resulting from the rendering from the input Point of View of one of:
- The input Audio-Visual Scene Descriptors – if no input Portable Avatar is present.
- A speaking avatar constructed according to the data of the input Portable Avatar embedded in the input Potable Avatar’s Audio-Visual Scene Descriptors with the input Spatial Attitude – if the input Audio-Visual Scene Descriptors are not present.
- The input Audio-Visual Scene Descriptors that include an avatar constructed according to the data of the input Portable Avatar and embedded in the input Audio-Visual Scene Descriptors with the input Spatial Attitude – if both input Portable Avatar and input Audio-Visual Scene Descriptors are present.
 
| Receives | Portable Avatar | Jointly with or alternatively with AV Scene Descriptors. | 
| Audio-Visual Scene Descriptors | Alternative to or superseding that of the Portable Avatar. | |
| Spatial Attitude | Spatial Attitude of the Avatar in the Audio-Visual Scene. | |
| Point of View | To be used in rendering the scene and its objects. | |
| Transforms | Portable Avatar | Into generic Audio-Visual Scene Descriptors if input Portable Avatar is present. | 
| Produces | Portable Avatar’s Output Speech | Always integrated in the Audio-Visual Scene. Output Speech results from the rendering of Audio Scene Descriptors from human-selected Point of View. | 
| Output Audio | Resulting from the rendering of Audio Scene Descriptors from human-selected Point of View. | |
| Output Visual | Resulting from the rendering of Audio Scene Descriptors from human-selected Point of View. View Selector tells the OSD-AVR AIM where the visual components of the Portable Avatar should also be integrated. | 
2 Reference Model
Figure 1 specifies the Reference Model of the Audio-Visual Scene Rendering (PAF-AVR) AIM.

Figure 1 – The Audio-Visual Scene Rendering (PAF-AVR) AIM
3 Input/Output Data
Table 1 specifies the Input and Output Data of the Audio-Visual Scene Rendering (PAF-AVR) AIM.
Table 1 – I/O Data of the Audio-Visual Scene Rendering (PAF-AVR) AIM
| Input | Description | 
| Portable Avatar | Data produced, e.g., by Personal Status Display. | 
| AV Scene Descriptors | Audio-Visual Scene Descriptors. | 
| Point of View | Point from where an Entity perceives the Audio-Visual Scene | 
| Spatial Attitude | of the Avatar in the Audio-Visual Scene. | 
| Output | Description | 
| Output Speech Object | The Speech components of the Audio-Visual Scene. | 
| Output Audio Object | The Audio components of the Audio-Visual Scene. | 
| Output Visual Object | The Visual components of the Audio-Visual Scene. | 
4 SubAIMs
No SubAIMs.
5 JSON Metadata
https://schemas.mpai.community/PAF/V1.5/AIMs/AudioVisualSceneRendering.json
6 Profiles
The Profiles of Audio-Visual Scene Rendering are specified.
7 Reference Software
8 Conformance Testing
Table 2 provides the Conformance Testing Method for PAF-AVR AIM.
If a schema contains references to other schemas, conformance of data for the primary schema implies that any data referencing a secondary schema shall also validate against the relevant schema, if present and conform with the Qualifier, if present.
Table 2 – Conformance Testing Method for PAF-AVR AIM
| Receives | Portable Avatar | Shall validate against Point of View Schema. | 
| AV Scene Descriptors | Shall validate against AV Scene Descriptors Schema. | |
| Point of View | Shall validate against Portable Avatar Schema. Portable Avatar Data shall conform with respective Qualifiers. | |
| Spatial Attitude | Shall validate against Spatial Attitude Schema. | |
| Produces | Output Speech Object | Shall validate against Speech Object Schema. Speech Data shall conform with Speech Qualifier. | 
| Output Audio Object | Shall validate against Audio Object Schema. Audio Data shall conform with Audio Qualifier. | |
| Output Visual Object | Shall validate against Visual Object or 3D Model Schema. Visual Data shall conform with Visual Object. | 
