Skip to main content

Document Inlining lets you use any LLM to process documents, providing higher quality and more input flexibility! Read more

FireAttention V2: 12x faster to make Long Contexts practical for Online Inference

FireAttention V2: 12x faster to make Long Contexts practical for Online Inference

By Dmytro Ivchenko|6/20/2024

Loading...