SPICE

Canonical

9papers using it

2022first seen

Dataset Card for SPICED Dataset Summary The Scientific Paraphrase and Information ChangE Dataset (SPICED) is a dataset of paired scientific findings from scientific papers, news media, and Twitter. The types of pairs are between <paper, news> and <paper, tweet>. Each pair is labeled for the degree of information simila

🔎 Find this dataset

Papers using SPICE (9)

A universal augmentation framework for long-range electrostatics in machine learning interatomic potentials2025 · 24 cites

How Accurate Are DFT Forces? Unexpectedly Large Uncertainties in Molecular Datasets2025 · 1 cites

Cutting Through the Noise: On-the-fly Outlier Detection for Robust Training of Machine Learning Interatomic Potentials2026

Molecular electrostatic potentials from machine learning models for dipole and quadrupole predictions2026

Power law attention biases for molecular transformers2025

A Scalable and Quantum-Accurate Foundation Model for Biomolecular Force Field via Linearly Tensorized Quadrangle Attention2025

Nutmeg and SPICE: Models and Data for Biomolecular Machine Learning2024 · 35 cites

From Molecules to Materials: Pre-training Large Generalizable Models for Atomic Property Prediction2023 · 22 cites

SPICE, A Dataset of Drug-like Molecules and Peptides for Training Machine Learning Potentials2022 · 13 cites