Together AI
Full-stack inference platform with 200+ models, fine-tuning, and custom training
Our Take
Together AI is the most full-featured independent inference provider. You get serverless inference, dedicated endpoints, fine-tuning (LoRA and full), custom training, and raw GPU cloud — all on 200+ models. Their in-house research (FlashAttention, Mamba) directly improves inference performance. Enterprise tiers include HIPAA compliance and 99.9% SLA. If you need a single platform for everything from experimentation to production with open-weight models, Together is the default choice.
Pros
- + Broadest open-weight model catalog (200+)
- + Full stack: inference, fine-tuning, training, GPU cloud
- + FlashAttention creators — research feeds product
- + HIPAA compliant with 99.9% SLA
Cons
- - Not the cheapest for pure inference
- - No self-hosted option
- - Can be complex for simple use cases
Details
- Pricing Model
- usage-based
- Starting Price
- $0 (free credits)
- Self-Hosted
- No
- Cloud Hosted
- Yes
- Founded
- 2022
Best For
- • Open-weight model inference
- • Fine-tuning (LoRA and full)
- • Custom model training
- • High-throughput batch workloads
Integrations
OpenAI API compatible LangChain LlamaIndex Llama Mixtral DeepSeek
Articles featuring Together AI
Last updated: