Join us for "Own Your AI" night on 10/1 in SF featuring Meta, Uber, Upwork, and AWS. Register here

Snorkel Logo Mark

Snorkel Mistral PairRM DPO

A fine-tuned version of the Mistral-7B model developed by Snorkel using PairRM for response ranking and Direct Preference Optimization (DPO) for model adaptation and refinement.

Try Model

Fireworks Features

Fine-tuning

Snorkel Mistral PairRM DPO can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model

Learn More

On-demand Deployment

On-demand deployments give you dedicated GPUs for Snorkel Mistral PairRM DPO using Fireworks' reliable, high-performance system with no rate limits.

Learn More

Info & Pricing

Provider

Snorkel

Model Type

LLM

Context Length

32768

Fine-Tuning

Available

Pricing Per 1M Tokens

$0.2