Add Gradient Cache&Recompute into Neural Search #3697

w5688414 · 2022-11-07T14:12:27Z

PR types

New features

PR changes

APIs

Description

Add Gradient Cache&Recompute into Neural Search

wawltor · 2022-11-10T12:57:01Z

applications/neural_search/recall/in_batch_negative/README.md

 * `corpus_file`: 召回库数据 corpus_file
+* `use_recompute`: 使用Recompute策略，用于节省显存，是一种以时间换空间的技术
+* `use_gradient_cache`: 使用Gradient Cache策略，用于节省显存，是一种以时间换空间的技术
+* `chunk_numbers`: 使用Gradient Cache策略的参数，表示的是同一个批次的样本分几次执行


建议recompute和gradient_cache的这种分布式能力建议收口到Trainer里面，后续@ZHUI来加入

已经发给 @ZHUI

wawltor · 2022-11-10T12:59:57Z

applications/neural_search/recall/in_batch_negative/batch_negative/model.py

+                                   title_cls_embedding,
+                                   transpose_y=True)
+
+        # substract margin from all positive samples cosine_sim()


这里的注释注意统一，首字母大写

wawltor · 2022-11-10T13:01:10Z

applications/neural_search/recall/in_batch_negative/batch_negative/model.py

+        # substract margin from all positive samples cosine_sim()
+        margin_diag = paddle.full(shape=[query_cls_embedding.shape[0]],
+                                  fill_value=self.margin,
+                                  dtype=paddle.get_default_dtype())


这里的逻辑比较奇怪，为什么不拿一个变量来赋值，可以直接拿cosine_sim dtype

我试了一下，效果是一样的，已经改成cosine_sim dtype

wawltor · 2022-11-10T13:11:56Z

applications/neural_search/recall/in_batch_negative/train_batch_neg.py

+                                                   all_grads):
+
+                sub_query_input_ids, sub_query_token_type_ids, sub_title_input_ids, sub_title_token_type_ids = sub_batch
+                paddle.framework.random.set_cuda_rng_state(CUDA_state)


这里还原cuda的随机种子，是为了还原dropout？

由于Gradient Cache需要计算两次前向，设置随机种子是为了使dropout等随机种子一致，确保模型的中间状态一样，使得2次前向得到的结果一致。

wawltor

LGTM

w5688414 added 2 commits November 7, 2022 14:11

Add Gradient Cache&Recompute into Neural Search

8b051e3

Update README.md

2d6a867

w5688414 requested a review from wawltor November 8, 2022 02:21

w5688414 self-assigned this Nov 8, 2022

w5688414 added the neural-search label Nov 8, 2022

wawltor reviewed Nov 10, 2022

View reviewed changes

Optimize gradient cache code

9caff21

wawltor approved these changes Nov 16, 2022

View reviewed changes

Merge branch 'develop' into pip38

c72fc76

w5688414 merged commit 39c4f76 into PaddlePaddle:develop Nov 16, 2022

w5688414 mentioned this pull request Nov 17, 2022

PaddleNLP 2.4.3 Release Note Candidate #3774

Closed

w5688414 mentioned this pull request Jan 12, 2023

PaddleNLP 2.5.0 Release Note Candidate #4439

Closed

w5688414 deleted the pip38 branch October 13, 2023 07:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Gradient Cache&Recompute into Neural Search #3697

Add Gradient Cache&Recompute into Neural Search #3697

Uh oh!

w5688414 commented Nov 7, 2022

Uh oh!

wawltor Nov 10, 2022

Uh oh!

w5688414 Nov 16, 2022

Uh oh!

wawltor Nov 10, 2022

Uh oh!

w5688414 Nov 16, 2022

Uh oh!

wawltor Nov 10, 2022

Uh oh!

w5688414 Nov 16, 2022

Uh oh!

wawltor Nov 10, 2022

Uh oh!

w5688414 Nov 16, 2022 •

edited

Loading

Uh oh!

wawltor left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add Gradient Cache&Recompute into Neural Search #3697

Add Gradient Cache&Recompute into Neural Search #3697

Uh oh!

Conversation

w5688414 commented Nov 7, 2022

PR types

PR changes

Description

Uh oh!

wawltor Nov 10, 2022

Choose a reason for hiding this comment

Uh oh!

w5688414 Nov 16, 2022

Choose a reason for hiding this comment

Uh oh!

wawltor Nov 10, 2022

Choose a reason for hiding this comment

Uh oh!

w5688414 Nov 16, 2022

Choose a reason for hiding this comment

Uh oh!

wawltor Nov 10, 2022

Choose a reason for hiding this comment

Uh oh!

w5688414 Nov 16, 2022

Choose a reason for hiding this comment

Uh oh!

wawltor Nov 10, 2022

Choose a reason for hiding this comment

Uh oh!

w5688414 Nov 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wawltor left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

w5688414 Nov 16, 2022 •

edited

Loading