← all datasets

SPICE

Canonical
0papers using it
40HF downloads
3HF likes

Dataset Card for SPICED Dataset Summary The Scientific Paraphrase and Information ChangE Dataset (SPICED) is a dataset of paired scientific findings from scientific papers, news media, and Twitter. The types of pairs are between <paper, news> and <paper, tweet>. Each pair is labeled for the degree of information similarity in the findings described by each sentence, on a scale from 1-5. This is called the Information Matching Score (IMS). The data was curated from S2ORC… See the full description on the dataset page: https://huggingface.co/datasets/copenlu/spiced.