@ -72,4 +72,4 @@ rhy_e2e_pretrain
├── energy_stats.npy
├── pitch_stats.npy
└── speech_stats.npy # statistics used to normalize spectrogram when training fastspeech2
```
@ -74,4 +74,4 @@ fastspeech2_nosil_baker_ckpt_0.4
├── durations.txt # preprocess.sh的中间过程
└── speech_stats.npy # 训练 fastspeech2 时用于规范化频谱图的统计数据