| 1 Function | 2 Reference Model | 3 Input/Output Data |
| 4 SubAIMs | 5 JSON Metadata | 6 Profiles |
| 7 Reference Software | 8 Conformance Texting | 9 Performance Assessment |
1 Functions
- Receives the Spherical Harmonic Decomposition coefficients of the sound field
- Detects and directions of active sound sources (either be a speech or a non-speech signal) and to separate them.
- Produces the Transformed Speech and Audio Scene Geometry..
2 Reference Model

3 Input/Output Data
| Input data | Semantics |
| Spherical Harmonics Decomposition Coefficients | Result of the transformation of Transform Multichannel Audio into the spherical frequency domain. |
| Source Model KB info | Discrete-time and discrete-valued simple acoustic source models used in source separation. |
| Output data | Semantics |
| Transform Audio | Audio in the Transform domain. |
| Audio Scene Geometry | Geometry of the Audio Scene. |
4 SubAIMs
No SubAIMs.
5 JSON Metadata
https://schemas.mpai.community/CAE1/V2.4/AIMs/SpeechDetectionAndSeparation.json