Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -1,9 +1,57 @@
|
|
| 1 |
---
|
|
|
|
|
|
|
|
|
|
| 2 |
tags:
|
| 3 |
-
-
|
| 4 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 5 |
---
|
| 6 |
|
| 7 |
-
|
| 8 |
-
|
| 9 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
language:
|
| 3 |
+
- en
|
| 4 |
+
license: apache-2.0
|
| 5 |
tags:
|
| 6 |
+
- hearing loss
|
| 7 |
+
- challenge
|
| 8 |
+
- signal processing
|
| 9 |
+
- source separation
|
| 10 |
+
- audio
|
| 11 |
+
- audio-to-audio
|
| 12 |
+
- NonCausal
|
| 13 |
---
|
| 14 |
|
| 15 |
+
# Cadenza Challenge: CAD2-Task1
|
| 16 |
+
|
| 17 |
+
A NonCausal Viola/Others separation model for the CAD2-Task2 baseline system.
|
| 18 |
+
|
| 19 |
+
* Architecture: ConvTasNet (Kaituo XU) with multichannel support (Alexandre Defossez).
|
| 20 |
+
* Parameters:
|
| 21 |
+
* B: 256
|
| 22 |
+
* C: 2
|
| 23 |
+
* H: 512
|
| 24 |
+
* L: 20
|
| 25 |
+
* N: 256
|
| 26 |
+
* P: 3
|
| 27 |
+
* R: 3
|
| 28 |
+
* X: 8
|
| 29 |
+
* audio_channels: 2
|
| 30 |
+
* causal: false
|
| 31 |
+
* mask_nonlinear: relu
|
| 32 |
+
* norm_type: gLN
|
| 33 |
+
* training:
|
| 34 |
+
* sample_rate: 44100
|
| 35 |
+
* samples_per_track: 64
|
| 36 |
+
* segment: 5.0
|
| 37 |
+
* aggregate: 2
|
| 38 |
+
* batch_size: 4
|
| 39 |
+
* early_stop: true
|
| 40 |
+
* epochs: 200
|
| 41 |
+
|
| 42 |
+
|
| 43 |
+
## Dataset
|
| 44 |
+
The model was trained using EnsembleSet and CadenzaWoodwind datasets.
|
| 45 |
+
|
| 46 |
+
## How to use
|
| 47 |
+
|
| 48 |
+
```
|
| 49 |
+
from tasnet import ConvTasNetStereo
|
| 50 |
+
|
| 51 |
+
model = ConvTasNetStereo.from_pretrained(
|
| 52 |
+
"cadenzachallenge/ConvTasNet_Viola_NonCausal"
|
| 53 |
+
).cpu()
|
| 54 |
+
|
| 55 |
+
```
|
| 56 |
+
|
| 57 |
+
|