SNLI语料库(1.0版)是一个57万个人工书写的英语句子对的集合,这些句子对经过手动标注以实现平衡分类,并带有enume,矛盾和中性标签,支持自然语言推理(NLI)的任务。
SNLI语料库(1.0版)是一个57万个人工书写的英语句子对的集合,这些句子对经过手动标注以实现平衡分类,并带有enume,矛盾和中性标签,支持自然语言推理(NLI)的任务,也称为文本内容识别(RTE)。我们旨在将其用作评估文本表示系统(尤其是包括由表示学习方法所诱导的系统)的基准,以及开发任何形式的NLP模型的资源。
语料示例:
Text | Judgments | Hypothesis |
---|---|---|
A man inspects the uniform of a figure in some East Asian country. | contradiction C C C C C | The man is sleeping |
An older and younger man smiling. | neutral N N E N N | Two men are smiling and laughing at the cats playing on the floor. |
A black race car starts up in front of a crowd of people. | contradiction C C C C C | A man is driving down a lonely road. |
A soccer game with multiple males playing. | entailment E E E E E | Some men are playing a sport. |
A smiling costumed woman is holding an umbrella. | neutral N N E C N | A happy woman in a fairy costume holds an umbrella. |
数据引用:
Samuel R. Bowman, Gabor Angeli, Christopher Potts, and Christopher D. Manning. 2015. A large annotated corpus for learning natural language inference. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP). [pdf] [bib]