Currently Supported Models

Model Description Company Context Window Cost
ChatGPT o1 An innovative language model crafted for tackling complex reasoning, coding, and mathematical challenges. It's perfect for those who demand sophisticated chain-of-thought analysis and self-fact-checking abilities to handle intricate tasks with precision and dependability. OpenAI 128K High
ChatGPT o1-mini A budget-friendly language model focused on STEM reasoning, with exceptional capabilities in mathematics and coding. It's ideal for users who need dependable and advanced computational power in educational and scientific contexts without incurring high expenses. OpenAI 128K Moderate
GPT-4o A leading-edge multimodal AI model that merges text, vision, and soon audio functionalities. It's suited for users seeking superior real-time reasoning and enhanced function calling in various applications such as chatbots, content creation, and intricate data interpretation. OpenAI 128K Moderate
GPT-4o Mini A cost-effective yet high-performing language model featuring advanced text and vision capabilities. It's perfect for users requiring versatile, real-time interactions and expanded context handling for diverse uses without high costs. OpenAI 128K Low
DALL-E 3 A state-of-the-art AI tool for creating high-quality images from text descriptions. Suited for professionals in advertising, product design, and educational content creation, it offers detailed and varied visual outputs for innovative, impactful projects. OpenAI 4K Low
Claude Sonnet 3.5 A sophisticated AI model tailored for real-world software engineering, featuring enhanced reasoning and coding capabilities. It serves those needing reliable and advanced interaction with computer systems and extensive input handling for complex tasks like code generation and robotic process automation. Anthropic 200K Moderate
Claude Haiku 3.5 A rapid language model optimized for coding assistance, chatbots, and real-time data processing. It's designed for those who require fast processing, extensive context understanding, and advanced security for dynamic, complex applications across varied sectors. Anthropic 200K Low
Claude Opus 3 A highly capable multimodal AI model excelling in difficult reasoning, coding, and multilingual communication. It's ideal for users needing advanced text and image processing for intricate content creation, analysis, and decision-making across different applications. Anthropic 200K High
Gemini Pro A cutting-edge multimodal AI model by Google DeepMind, engineered for thorough data analysis and long-context comprehension across text, images, audio, and video. It's perfect for users tackling complex reasoning tasks and generating insights from varied data types in research and content production. Google 2M Moderate
Gemini Flash A high-speed multimodal AI model designed for swift, real-time applications involving text, images, audio, and video. It's perfect for those needing quick responses and high efficiency for tasks like chatbots, instant content generation, and real-time data analysis. Google 1M Low
Grok Beta A versatile AI model emphasizing enhanced speed and efficiency for text and code generation. It's suited for developers requiring strong capabilities for creating scalable applications with seamless function integration. xAI 131K Moderate
Grok Vision Beta An advanced model for understanding images, capable of processing a wide range of visual data. It's ideal for users needing precise visual analysis in tasks like document processing and interpreting visual data. xAI 8K Moderate
Llama 3.2 90B Vision A cutting-edge multimodal AI model from Meta, crafted for advanced visual reasoning and language processing. It caters to those needing robust capabilities in analyzing complex visual and textual data for applications like document comprehension, image captioning, and visual question answering. Meta 131K Low
Llama 3.2 11B Vision A robust multimodal AI model designed for high-performance image and text processing. It's perfect for users needing scalable, enterprise-ready solutions for tasks like image captioning and visual question answering, supporting high-resolution images and multilingual capabilities. Meta 131K Low
Llama 3.2 3B An efficient multilingual language model optimized for following instructions. It's ideal for users needing a lightweight yet potent solution for diverse NLP applications, including dialogue generation, summarization, and real-time text analysis across various languages. Meta 131K Low
Llama 3.1 405B A state-of-the-art multilingual language model with 405 billion parameters, engineered for advanced text generation and nuanced understanding. It's perfect for those requiring high accuracy and contextual awareness for sophisticated content creation and automated text-based applications across multiple fields. Meta 4K Moderate
FLUX 1.1 Pro A cutting-edge image generation model that produces high-quality images six times faster than its predecessor, making it ideal for rapid asset creation and enhanced creative workflows. Black Forest Labs 4K High
FLUX Realism LoRA A cutting-edge image generation model for generating photorealistic images, ideal for artists, marketers, and developers. Black Forest Labs 4K High
FLUX 1.0 Schnell Fast, high-quality image generation from text. Black Forest Labs 4K Low
FLUX 1.0 Dev A cutting-edge image generation model designed for developers, featuring high-quality outputs and open weights. Black Forest Labs 4K High
Stable Diffusion 3.5 Large Has unique features, including prompt adherence, customizability, efficiency, and high-quality image generation capabilities. Stability AI 256 High
Stable Diffusion 3 Medium Cutting-edge text-to-image model with enhanced performance, multi-subject handling, and resource efficiency for diverse creative applications. Stability AI 77 High