1 Function | 2 Reference Model | 3 Input/Output Data |
4 SubAIMs | 5 JSON Metadata | 6 Profiles |
7 Reference Software | 8 Conformance Texting | 9 Performance Assessment |
1 Functions
- Receives the Spherical Harmonic Decomposition coefficients of the sound field
- Detects and directions of active sound sources (ither be a speech or a non-speech signal) and to separate them.
- Produces the Transformed Speech and Audio Scene Geometry..
2 Reference Model
3 Input/Output Data
Input data | Semantics |
Spherical Harmonics Decomposition Coefficients | Result of the transformation of Transform Multichannel Audio into the spherical frequency domain. |
Source Model KB info | Discrete-time and discrete-valued simple acoustic source models used in source separation. |
Output data | Semantics |
Transform Audio | Audio in the Transform domain. |
Audio Scene Geometry | Geometry of the Audio Scene. |
4 SubAIMs
No SubAIMs.
5 JSON Metadata
https://schemas.mpai.community/CAE1/V2.3/AIMs/SpeechDetectionAndSeparation.json