Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 4 additions & 4 deletions legacy/model_zoo/ernie-1.0/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -116,7 +116,7 @@ ERNIE 中文预训练更详细的介绍文档请可以参见[ERNIE 中文预训
<summary><b>CLUECorpusSmall 数据准备</b></summary>

#### 数据准备
数据下载部分请参考[preprocess](./preprocess)目录,根据文档中`CLUECorpusSmall 数据集处理教程`,下载数据。下载好后:
数据下载部分请参考[preprocess](../../../llm/tools/preprocess)目录,根据文档中`CLUECorpusSmall 数据集处理教程`,下载数据。下载好后:

解压文件
```shell
Expand Down Expand Up @@ -308,9 +308,9 @@ PaddleNLP致力于预训练开源工作,使用开源中文语料CLUE、WuDao

#### 数据准备

数据下载,数据转化部分,请参见[数据预处理文档](./preprocess/README.md),
- [CLUECorpus2020数据处理](./preprocess/docs/CLUECorpus2020.md)
- [WuDaoCorpusBase数据处理](./preprocess/docs/WuDaoCorpusBase.md)
数据下载,数据转化部分,请参见[数据预处理文档](../../../llm/tools/preprocess/README.md),
- [CLUECorpus2020数据处理](../../../llm/tools/preprocess/docs/CLUECorpus2020.md)
- [WuDaoCorpusBase数据处理](../../../llm/tools/preprocess/docs/WuDaoCorpusBase.md)

如果需要定制化词表,词表制作部分请参考[词表制作](./vocab/README.md)。

Expand Down
1 change: 1 addition & 0 deletions legacy/model_zoo/ernie-1.0/preprocess