Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featuring 128 experts with 8 active per token. Engineered for efficient reasoning over long contexts (131k) with robust function calling and multi-step agent workflows.
by Arcee-ai|131K context|$0.04/M input tokens|$0.15/M output tokens
Endpoints
Available providers for this model, with details on pricing, context limits, and real-time health metrics.