PRM800K is a process supervision dataset accompanying the paper Let’s Verify Step by Step, providing 800,000 step-level correctness labels on model-generated solutions to problems from the MATH dataset. The repository releases the raw labels and the labeler instructions used in two project phases, enabling researchers to study how human raters graded intermediate reasoning. Data are stored as newline-delimited JSONL files tracked with Git LFS, where each line is a full solution sample that can contain many step-level labels and rich metadata such as labeler UUIDs, timestamps, generation identifiers, and quality-control flags. Each labeled step can include multiple candidate completions with ratings of -1, 0, or +1, optional human-written corrections (phase 1), and a chosen completion index, along with a final finish reason such as found_error, solution, bad_problem, or give_up.

Features

  • 800,000 step-level correctness labels for MATH problems via JSONL
  • Detailed schema with labeler IDs, timestamps, generations, QC flags, and finish reasons
  • Multi-candidate step ratings of -1, 0, +1 with optional human-completion entries
  • Labeler instruction docs for both phase 1 and phase 2
  • Python grading logic using math normalization and sympy equivalence checks
  • Nonstandard MATH train/test split and large-scale scored samples with PRM/ORM eval scripts

Project Samples

Project Activity

See All Activity >

Categories

AI Models

License

MIT License

Follow PRM800K

PRM800K Web Site

You Might Also Like
Gen AI apps are built with MongoDB Atlas Icon
Gen AI apps are built with MongoDB Atlas

Build gen AI apps with an all-in-one modern database: MongoDB Atlas

MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of PRM800K!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Models

Registered

2025-10-04