Access benchmark-quality RL, multimodal, vision, and STEM datasets to accelerate your post-training research. Choose from pre-defined packs or create custom datasets tailored to your experiments.
Choose from our curated data collections—each optimized for post-training research and ready to request:
Samples are delivered via email and typically within 48 hours of your request, so you can begin integration and evaluation without delay.
Yes, you can select any combination of pre-defined packs or custom datasets in a single request form, and we’ll bundle them in one delivery.
We provide samples in machine-learning–ready formats (e.g., image folders, CSV/JSON for tabular and text, WAV for audio). All modalities listed in the catalog—RL, vision, audio, STEM, coding, gaming, and more—are available.
Sample datasets are provided under a research-only license. For full-pack access or commercial use, we’ll follow up to discuss terms and pricing.
Absolutely—use the “Custom Data Packs” option in the catalog to describe your requirements, and our team will work with you to assemble the right dataset.
You’ll receive curated sample files and metadata, followed by outreach from our team to discuss full-pack access, volume, pricing, and any custom adjustments.
Request your data packs today and accelerate your research.