Skip to content

Conversation

dyan-dy
Copy link
Contributor

@dyan-dy dyan-dy commented Jun 9, 2021

PR types

New features

PR changes

Database

Description

Issue 447, 接入千言数据集,BQ Corpus 文本相似度。

@CLAassistant
Copy link

CLAassistant commented Jun 9, 2021

CLA assistant check
All committers have signed the CLA.


class bq_corpus(DatasetBuilder):
"""
bq_corpus
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It would be terrific if you provide more information, such as task and auther, for your dataset.

seqeval
multiprocess
multiprocess
pre-commit
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why adding this?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry, actually I haven't see these two lines in my code file till now. Maybe I added these when I was using git bash commands. I'm a greenhand in github so I made mistakes.

if not head:
head = data
else:
texta, textb, label = data
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Better to use "sentence1" and "sentence2" here.

@ZeyuChen ZeyuChen changed the title My new dataset Add BQCorpus Dataset Jun 10, 2021
@ZeyuChen ZeyuChen added the data Issues about data pipeline and dataset label Jun 10, 2021
@ZeyuChen
Copy link
Member

Duplicated PR #562 , close this one.

@ZeyuChen ZeyuChen closed this Jun 14, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

data Issues about data pipeline and dataset

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants