Introducing Qwen3-4B-Instruct-2507, with improved instruction following, reasoning, coding, multilingual knowledge, user alignment, and 256K long-context understanding.
Fine-tuningDocs | Qwen 3 4B Instruct 2507 can be customized with your data to improve responses. Fireworks uses LoRA to efficiently train and deploy your personalized model |
On-demand DeploymentDocs | On-demand deployments allow you to use Qwen 3 4B Instruct 2507 on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits. |