LLM
StarCoder-15.5B is a 15.5B parameter model trained on 80+ programming languages from The Stack (v1.2), with opt-out requests excluded. The model uses Multi Query Attention, a context window of 8,192 tokens, and was trained using the Fill-in-the-Middle objective on 1 trillion tokens.
On-demand deployments allow you to use StarCoder 15.5B on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.
See the On-demand deployments guide for details.