| Acron. | Definition | Acron. | Definition | ||
| # | 6DoF | Six Degree of Freedom | H | HCI | Motion Actuation Subsystem |
| A | AI | Artificial Intelligence | HD | High Definition | |
| AIF | AI Framework | HEVC | High-Efficiency Video Coding | ||
| AIM | AI Module | HMM | Hidden Markov Models | ||
| AIW | AI Workflow | HOA | Higher-Order Ambisonics | ||
| AMS | Autonomous Motion Subsystem | HPC | High Performance Computers | ||
| ANN | Artificial Neural Network | HRTF | Head-Related Transfer Function | ||
| API | Application Programming Interface | Hz | Hertz | ||
| AR | Augmented Reality | I | IP | Internet Protocol | |
| ARP | Audio Recording Preservation | IPR | Intellectual Property Right | ||
| ASR | Automatic Speech Recognition | ISDB | Integrated Services Digital Broadcasting | ||
| ATSC | Advanced Television Standard Committee | IT | Information Technology | ||
| AVC | Advanced Video Coding | L | LAT | Lexical Answer Type | |
| AVS | Audio Video Coding Standard | LSTM | Long Short-Term Memory | ||
| B | BART | Bidirectional and Auto-Regressive Transformer | M | MAS | Motion Actuation Subsystem |
| BE | Behaviour Engine | MCS | Mixed-reality Collaborative Spaces | ||
| BERT | Bidirectional Encoder Representations from Transformers | MCU | MicroController Unit | ||
| BWR | Basic World Representation | ML | Machine Learning | ||
| C | CAE | Context-based Audio Enhancement | MMC | MultiModal Conversation | |
| CAV | Connected Autonomous Vehicle | MQA | Multimodal Question Answering | ||
| CDVA | Compact Descriptors for Video Analysis | MSE | Mean Square Error | ||
| CDVS | Compact Descriptors for Visual Search | NE | Named Entity | ||
| CNN | Convolutional Neural Network | N | NLP | Natural Language Processing | |
| CPP | Company Performance Prediction | NN | Neural Network | ||
| CPU | Central Processing Unit | O | OBA | Object-Based Audio | |
| CT | Conformance Testing | OTT | Over-The-Top | ||
| CU | Coding Unit | P | PCC | Point Cloud Compression | |
| CUI | Compression and Understanding of Industrial Data | PA | Performance Assessment | ||
| CWE | Conversation With Emotion | PE | Physics Engine | ||
| D | dB | decibel | POS | Part Of Speech | |
| DB | Data Base | PSR | Perceptual Sound field Reconstruction | ||
| DC | Development Committee | PCA | Principal Component Analysis | ||
| DDSP | Differentiable Digital Signal Processing | Q | QA | Question Answering | |
| DNA | Deoxyribo-Nucleic Acid | R | RAM | Random Access Memory | |
| DNN | Deep Neural Networks | RE | Rules Engine | ||
| DP | Data Processing | RNN | Recurrent Neural Network | ||
| DSP | Digital Signal Processing | RS | Reference Software | ||
| DVB | Digital Video Broadcasting | S | SAT | Semantic Answer Type | |
| E | E2E | End-to-End | SD | Standard Definition (SD) | |
| EAE | Enhanced Audioconference Experience | SDO | Standards Developing Organisation | ||
| EES | Emotion-Enhanced Speech | SEP | Standard Essential Patent | ||
| EEV | End-to-End Video Coding | SFT | Spherical Fourier Transform | ||
| ESS | Environment Sensing Subsystem | SPG | Server-based Predictive multiplayer Gaming | ||
| EVC | Essential Video Coding | SRL | Semantic Role Labelling | ||
| F | FFT | Fast Fourier Transform | SRS | Speech Restoration System | |
| FFNN | Feed-Forward Neural Network | T | TS | Technical Specification | |
| FRAND | Fair, Reasonable and Non-Discriminatory | TTS | Text-To-Speech | ||
| FWL | Framework Licence | U | UHD | Ultra High Definition | |
| FWR | Full World Representation | UHF | Ultra High Frequency | ||
| G | GA | General Assembly | UST | Unidirectional Speech Translation | |
| GAN | Generative Adversarial Networks | V | VR | Virtual Reality | |
| GNSS | Global Navigation Satellite System | VVC | Versatile Video Coding | ||
| GPT | Generative Pre-trained Transformer | ||||
| GSE | Game State Engine |