Reddit

Emerging

4papers using it

2025first seen

The 'Reddit' dataset/benchmark contains data from the social media platform Reddit and is used to evaluate the performance of large language model agents in real-world, personalized applications.

🔎 Find this dataset

Papers using Reddit (4)

MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation2026

Polarization by Default: Auditing Recommendation Bias in LLM-Based Content Curation2026

Navigating through the hidden embedding space: steering LLMs to improve mental health assessment2025

Incongruent Positivity: When Miscalibrated Positivity Undermines Online Supportive Conversations2025