Overview
Fireworks provides a CLI tool to export comprehensive billing metrics for all usage types including serverless inference, on-demand deployments, and fine-tuning jobs. The exported data can be used for cost analysis, internal billing, and usage tracking.Exporting billing metrics
Use the Fireworks CLI to export a billing CSV that includes all usage:Examples
Export all billing metrics for an account:Output format
The exported CSV includes the following columns:- email: Account email
- start_time: Request start timestamp
- end_time: Request end timestamp
- usage_type: Type of usage (e.g., TEXT_COMPLETION_INFERENCE_USAGE)
- accelerator_type: GPU/hardware type used
- accelerator_seconds: Compute time in seconds
- base_model_name: The model used
- model_bucket: Model category
- parameter_count: Model size
- prompt_tokens: Input tokens
- completion_tokens: Output tokens
Sample row
Automation
You can automate exports in cron jobs and load the CSV into your internal systems:Run
firectl export billing-metrics --help to see all available flags and options.Coverage
This export includes:- Serverless inference: All serverless API usage
- On-demand deployments: Deployment usage (see also Exporting deployment metrics for real-time Prometheus metrics)
- Fine-tuning jobs: Fine-tuning compute usage
- Other services: All billable Fireworks services
For real-time monitoring of on-demand deployment performance metrics (latency, throughput, etc.), use the Prometheus metrics endpoint instead.
See also
- firectl CLI overview
- Exporting deployment metrics - Real-time Prometheus metrics for on-demand deployments
- Rate Limits & Quotas - Understanding spend limits and quotas