Add BQCorpus Dataset #534

dyan-dy · 2021-06-09T10:57:37Z

PR types

New features

PR changes

Database

Description

Issue 447, 接入千言数据集，BQ Corpus 文本相似度。

…nto my-new-dataset

CLAassistant · 2021-06-09T10:57:41Z

All committers have signed the CLA.

paddlenlp/datasets/bq_corpus.py

smallv0221 · 2021-06-09T12:00:28Z

paddlenlp/datasets/bq_corpus.py

+
+class bq_corpus(DatasetBuilder):
+    """
+    bq_corpus


It would be terrific if you provide more information, such as task and auther, for your dataset.

smallv0221 · 2021-06-09T12:04:13Z

requirements.txt

 seqeval
-multiprocess
+multiprocess
+pre-commit


Why adding this?

Sorry, actually I haven't see these two lines in my code file till now. Maybe I added these when I was using git bash commands. I'm a greenhand in github so I made mistakes.

smallv0221 · 2021-06-09T12:08:24Z

paddlenlp/datasets/bq_corpus.py

+                if not head:
+                    head = data
+                else:
+                    texta, textb, label = data


Better to use "sentence1" and "sentence2" here.

ZeyuChen · 2021-06-14T08:00:14Z

Duplicated PR #562 , close this one.

dyan-dy added 2 commits June 9, 2021 18:39

git commit

a5d4065

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleNLP i…

5ea7130

…nto my-new-dataset

smallv0221 requested changes Jun 9, 2021

View reviewed changes

dyan-dy and others added 2 commits June 9, 2021 22:59

Merge branch 'develop' into my-new-dataset

2bd6e18

Merge branch 'develop' into my-new-dataset

512bdb5

ZeyuChen changed the title ~~My new dataset~~ Add BQCorpus Dataset Jun 10, 2021

ZeyuChen added the data Issues about data pipeline and dataset label Jun 10, 2021

ZeyuChen assigned smallv0221 Jun 10, 2021

dyan-dy added 3 commits June 10, 2021 16:58

Merge branch 'develop' into my-new-dataset

6b67497

Merge branch 'develop' into my-new-dataset

ac90af3

Merge branch 'develop' into my-new-dataset

3758c50

ZeyuChen closed this Jun 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add BQCorpus Dataset #534

Add BQCorpus Dataset #534

Uh oh!

dyan-dy commented Jun 9, 2021 •

edited

Loading

Uh oh!

CLAassistant commented Jun 9, 2021 •

edited

Loading

Uh oh!

Uh oh!

smallv0221 Jun 9, 2021

Uh oh!

smallv0221 Jun 9, 2021

Uh oh!

dyan-dy Jun 9, 2021

Uh oh!

smallv0221 Jun 9, 2021

Uh oh!

ZeyuChen commented Jun 14, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add BQCorpus Dataset #534

Add BQCorpus Dataset #534

Uh oh!

Conversation

dyan-dy commented Jun 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Description

Uh oh!

CLAassistant commented Jun 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

smallv0221 Jun 9, 2021

Choose a reason for hiding this comment

Uh oh!

smallv0221 Jun 9, 2021

Choose a reason for hiding this comment

Uh oh!

dyan-dy Jun 9, 2021

Choose a reason for hiding this comment

Uh oh!

smallv0221 Jun 9, 2021

Choose a reason for hiding this comment

Uh oh!

ZeyuChen commented Jun 14, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dyan-dy commented Jun 9, 2021 •

edited

Loading

CLAassistant commented Jun 9, 2021 •

edited

Loading