Web UI Guide

The Fireworks dashboard provides a visual interface for creating RFT jobs with guided parameter selection. Perfect for first-time users and exploring options.

Before launching, review Training Prerequisites & Validation for requirements, validation checks, and common errors.

When to use Web UI

Start with the UI to learn the options, then switch to CLI for faster iteration and automation.

Feature	CLI (eval-protocol)	Web UI
Best for	Experienced users, automation	First-time users, exploration
Parameter discovery	Need to know flag names	Guided with tooltips
Speed	Fast - single command	Slower - multiple steps
Automation	Easy to script and reproduce	Manual process
Batch operations	Easy to launch multiple jobs	One at a time
Reproducibility	Excellent - save commands	Manual tracking needed

Launch training via Web UI

Navigate to Fine-Tuning

Go to Fireworks Dashboard
Click Fine-Tuning in the left sidebar
Click Fine-tune a Model

Fine-tuning dashboard showing list of jobs

Select Reinforcement Fine-Tuning

Choose Reinforcement as the tuning method
Select your base model from the dropdown

The UI shows only models that support fine-tuning. Popular choices appear at the top.

Not sure which model to choose? Start with llama-v3p1-8b-instruct for a good balance of quality and speed.

Configure Dataset

Upload new dataset or select existing from your account
Preview dataset entries to verify format
The UI validates your JSONL format automatically

Each dataset row should have messages array:

{
  "messages": [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "What is 25 * 4?"}
  ]
}

Select Evaluator

Choose from your uploaded evaluators
Preview evaluator code and test results
View recent evaluation metrics

If you haven’t uploaded an evaluator yet, you’ll need to do that first via CLI:

pytest test_evaluator.py -vs

For remote evaluators, you’ll enter your server URL in the environment configuration section.

Set Training Parameters

Configure how the model learns:Core parameters:

Output model name: Custom name for your fine-tuned model
Epochs: Number of passes through the dataset (start with 1)
Learning rate: How fast the model updates (use default 1e-4)
LoRA rank: Model capacity (8-16 for most tasks)
Batch size: Training throughput (use default 32k tokens)

The UI shows helpful tooltips for each parameter. See Parameter Tuning for detailed guidance.

Configure Rollout Parameters

Control how the model generates responses during training:

Temperature: Sampling randomness (0.7 for balanced exploration)
Top-p: Probability mass cutoff (0.9-1.0)
Top-k: Token candidate limit (40 is standard)
Number of rollouts (n): Responses per prompt (4-8 recommended)
Max tokens: Maximum response length (2048 default)

Higher temperature and more rollouts increase exploration but also cost.

Review and Launch

Review all settings in the summary panel
See estimated training time and cost
Click Start Fine-Tuning to launch

The dashboard will redirect you to the job monitoring page where you can track progress in real-time.

Next steps

Prerequisites & Validation

Review requirements, validation, and common errors

Monitor training

Track job progress, inspect rollouts, and debug issues

Parameter tuning

Learn how to adjust parameters for better results

Get Started

Deployments

Models & Inference

Fine Tuning

Administration

Security & Compliance

Integrations

When to use Web UI

Launch training via Web UI

Next steps

Prerequisites & Validation

Monitor training

Parameter tuning

Get Started

Deployments

Models & Inference

Fine Tuning

Administration

Security & Compliance

Integrations

​When to use Web UI

​Launch training via Web UI

​Next steps

Prerequisites & Validation

Monitor training

Parameter tuning

When to use Web UI

Launch training via Web UI

Next steps