Checkpoints and RIRs
Checkpoints
The release page has two model checkpoints. All checkpoints include “model_state_dict”, “optimizer_state_dict”, and some other meta information.
The first model checkpoint is the original model checkpoint at the 58th epoch. The performance is shown in this table:
With Reverb |
No Reverb |
|||||||
---|---|---|---|---|---|---|---|---|
Method |
WB-PESQ |
NB-PESQ |
SI-SDR |
STOI |
WB-PESQ |
NB-PESQ |
SI-SDR |
STOI |
FullSubNet |
2.987 |
3.496 |
15.756 |
0.926 |
2.889 |
3.385 |
17.635 |
0.964 |
In addition, some people are interested in the performance when using cumulative normalization. The below one is a pre-trained FullSubNet using cumulative normalization:
With Reverb |
No Reverb |
|||||||
---|---|---|---|---|---|---|---|---|
Method |
WB-PESQ |
NB-PESQ |
SI-SDR |
STOI |
WB-PESQ |
NB-PESQ |
SI-SDR |
STOI |
FullSubNet (Cumulative Norm) |
2.978 |
3.503 |
15.820 |
0.928 |
2.863 |
3.376 |
17.913 |
0.964 |
If you want to inference or fine-tune based on these checkpoints, please check the usage in the documents.
Room Impulse Responses
As mentioned in the paper, the room impulse responses (RIRs) come from the Multichannel Impulse Response Database and the Reverb Challenge dataset. Please download the zip package “RIR (Multichannel Impulse Response Database + The REVERB challenge).zip” from the release page if you would like to retrain the FullSubNet.
Note that the zip package includes a folder “rir” and a file “rir.txt.” The folder “rir” contains all separated single-channel RIRs extracted from the above two datasets. The suffix (e.g., “m_
For some cases, if you would like to extract channel by yourself, you can download these RIRs from pages:
The REVERB challenge data
https://reverb2014.dereverberation.com/tools/reverb_tools_for_Generate_mcTrainData.tgz
https://reverb2014.dereverberation.com/tools/reverb_tools_for_Generate_SimData.tgz