popcornell
/

FasNetTAC-paper

Model card Files Files and versions Community

popcornell commited on Feb 23, 2021

Commit

2adcca6

•

1 Parent(s): 653c878

Initial commit

Files changed (2) hide show

README.md +63 -0
pytorch_model.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,63 @@

+---
+tags:
+- asteroid
+- audio
+- FasNet-TAC
+- audio-source-separation
+- multichannel
+- beamforming
+datasets:
+- TACDataset
+- sep_noisy
+license: cc-by-sa-3.0
+inference: false
+---
+## Asteroid model `Samuele Cornell/FasNetTAC_TACDataset_separatenoisy`
+Imported from [Zenodo](https://zenodo.org/record/4557489)
+### Description:
+This model was trained by popcornell using the TAC/TAC recipe in Asteroid. It was trained on the separate_noisy task of the TACDataset dataset.
+### Training config:
+```yaml
+data:
+    dev_json: ./data/validation.json
+    sample_rate: 16000
+    segment: None
+    test_json: ./data/test.json
+    train_json: ./data/train.json
+net:
+    chunk_size: 50
+    context_ms: 16
+    enc_dim: 64
+    feature_dim: 64
+    hidden_dim: 128
+    hop_size: 25
+    n_layers: 4
+    n_src: 2
+    window_ms: 4
+optim:
+    lr: 0.001
+    weight_decay: 1e-06
+training:
+    accumulate_batches: 1
+    batch_size: 8
+    early_stop: True
+    epochs: 200
+    gradient_clipping: 5
+    half_lr: True
+    num_workers: 8
+    patience: 30
+    save_top_k: 10
+```
+### Results:
+```yaml
+si_sdr: 10.871864315894744
+si_sdr_imp: 11.322284052560262
+```
+### License notice:
+This work "FasNetTAC_TACDataset_separatenoisy" is a derivative of LibriSpeech ASR corpus by Vassil Panayotov, used under CC BY 4.0; of End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation by Yi Luo, Zhuo Chen, Nima Mesgarani, Takuya Yoshioka, used under CC BY 4.0. "FasNetTAC_TACDataset_separatenoisy" is licensed under Attribution-ShareAlike 3.0 Unported by popcornell.

pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7efd07c16bba219c19cc499a2bce185bb0a1e6bab0c3855346303d3234185956
+size 11095294