From 7f7557f8aaf44f3e02272d2fc0a34c29fa5006ed Mon Sep 17 00:00:00 2001
From: enkilee <jeffrey0122@163.com>
Date: Thu, 5 Dec 2024 09:25:12 +0800
Subject: [PATCH] fix md

---
 examples/csmsc/voc1/README.md | 20 +++++++++++---------
 1 file changed, 11 insertions(+), 9 deletions(-)

diff --git a/examples/csmsc/voc1/README.md b/examples/csmsc/voc1/README.md
index a1f63fc46..c6c70172b 100644
--- a/examples/csmsc/voc1/README.md
+++ b/examples/csmsc/voc1/README.md
@@ -3,15 +3,17 @@ This example contains code used to train a [parallel wavegan](http://arxiv.org/a
 ## Dataset
 ### Download and Extract
 Download CSMSC from it's [official website](https://test.data-baker.com/data/index/TNtts/) and extract it to `~/datasets`. Then the dataset is in the directory `~/datasets/BZNSYP`.
-datasets/BZNSYP should have three folders:
-
-└─ Wave
-  └─ .wav files (audio speech)
-  └─ PhoneLabeling
-      └─ .interval files (alignment between phoneme and duration)
-  └─ ProsodyLabeling
-    └─ 000001-010000.txt (text with prosodic by pinyin)
-Still we only use .wav files in training.
+
+After processing the data, the ``BZNSYP`` directory will look like this:
+```text
+BZNSYP
+├── Wave
+│    └─ *.wav files (audio speech)
+├── PhoneLabeling
+│    └─ *.interval files (alignment between phoneme and duration)
+└── ProsodyLabeling
+     └─ 000001-010000.txt (text with prosodic by pinyin)
+```
 
 ### Get MFA Result and Extract
 We use [MFA](https://github.com/MontrealCorpusTools/Montreal-Forced-Aligner) results to cut silence at the edge of audio.