Explore AI Models

Browse 265+ state-of-the-art foundation models. Filter by category, provider, or search by name.

265 models found

GPT-4o

OpenAI's most advanced multimodal model. Accepts text, image, and audio inputs with best-in-class reasoning. ~200B parameters (estimated).

multimodal

DALL-E 3

OpenAI

0.0

0 reviews

Latest image generation model with improved prompt understanding and photorealistic outputs.

image

Whisper Large v3

OpenAI

0.0

0 reviews

Open-source speech recognition model. Robust multilingual transcription. 1.55B parameters.

audio

Gemini 2.0 Flash

Google

0.0

0 reviews

Google's fastest model with multimodal reasoning across text, image, and audio.

multimodal

Claude 3.5 Sonnet

Anthropic

0.0

0 reviews

Anthropic's most intelligent model. Excels at complex reasoning, coding, and analysis.

general

Llama 3.1 405B

Meta

0.0

0 reviews

Meta's largest open-source model. State-of-the-art with 128K context. 405B parameters.

open-source

Mistral Large

Mistral AI

0.0

0 reviews

Previous flagship model for reasoning and code generation. 70B parameters (estimated).

general

DeepSeek V3

DeepSeek

0.0

0 reviews

671B MoE model with 37B active parameters. Rivals frontier models at fraction of cost.

open-source

𝕏

Grok-2

xAI

0.0

0 reviews

xAI's frontier model with real-time information access and strong reasoning.

general

Command R+

Cohere

0.0

0 reviews

Scalable enterprise model optimized for RAG, tool use, and multi-step agents. 104B parameters.

enterprise

Stable Diffusion XL

Stability AI

0.0

0 reviews

Popular open-source image generation model with photorealistic output. 3.5B parameters.

image

Yi-Large

01.AI

0.0

0 reviews

Flagship model with strong bilingual English-Chinese capabilities.

general

Yi-1.5 34B

01.AI

0.0

0 reviews

Strong bilingual model excelling at coding and reasoning. 34B parameters.

open-source

Yi-1.5 9B

01.AI

0.0

0 reviews

Compact bilingual model for diverse applications. 9B parameters.

open-source

Yi-Coder 9B

01.AI

0.0

0 reviews

Specialized code model with strong performance for its size. 9B parameters.

code

StripedHyena Nous 7B

Together AI

0.0

0 reviews

Alternative architecture model using hyena operator. 7B parameters.

open-source

DBRX

Databricks

0.0

0 reviews

Enterprise-grade MoE model optimized for efficiency. 132B total / 36B active parameters.

open-source

DBRX Instruct

Databricks

0.0

0 reviews

Instruction-tuned DBRX for enterprise applications. 132B total parameters.

open-source

OpenELM 3B

Apple

0.0

0 reviews

Apple's efficient open-source language model family. 3B parameters.

open-source

Runway Gen-3 Alpha

Runway

0.0

0 reviews

Advanced video generation model with high fidelity and temporal coherence.

video

Pika 2.0

Pika

0.0

0 reviews

Text-to-video model with scene-level physics understanding.

video

Kling 1.5

Kuaishou

0.0

0 reviews

Chinese video generation model with strong motion and physics simulation.

video

Mochi 1

Genmo

0.0

0 reviews

Open-source text-to-video model with strong motion quality. 10B parameters.

video

LTX Video

Lightricks

0.0

0 reviews

Fast open-source video generation model based on DiT architecture. 2B parameters.

video

Midjourney v6

Midjourney

0.0

0 reviews

Industry-leading image generation with exceptional artistic quality and photorealism.

image

Adobe Firefly 3

Adobe

0.0

0 reviews

Commercial-safe image generation trained on licensed content.

image

Ideogram 2.0

Ideogram

0.0

0 reviews

Image generation model with industry-best text rendering in images.

image

Playground v3

Playground

0.0

0 reviews

Image generation model optimized for graphic design and aesthetics.

image

GPT-4o Mini

OpenAI

0.0

0 reviews

Compact and cost-efficient variant of GPT-4o. Strong performance on everyday tasks. ~8B parameters (estimated).

general

GPT-4 Turbo

OpenAI

0.0

0 reviews

High-performance GPT-4 variant with 128K context window and improved instruction following. ~1.8T parameters (estimated).

general

GPT-4

OpenAI

0.0

0 reviews

OpenAI's flagship large language model with advanced reasoning. ~1.8T parameters (estimated MoE).

general

GPT-3.5 Turbo

OpenAI

0.0

0 reviews

Fast, affordable model optimized for chat and instruction-following. 175B parameters.

general

o1

OpenAI

0.0

0 reviews

OpenAI's reasoning model with chain-of-thought thinking for complex problem-solving. ~200B parameters (estimated).

general

o1-mini

OpenAI

0.0

0 reviews

Smaller reasoning model optimized for STEM and coding tasks. ~100B parameters (estimated).

general

o1-pro

OpenAI

0.0

0 reviews

Enhanced reasoning model with improved reliability for the most demanding tasks.

general

o3

OpenAI

0.0

0 reviews

Next-generation reasoning model with state-of-the-art benchmark performance.

general

o3-mini

OpenAI

0.0

0 reviews

Compact reasoning model balancing speed, cost, and strong reasoning.

general

o4-mini

OpenAI

0.0

0 reviews

Latest compact reasoning model with enhanced multimodal and agentic capabilities.

general

Whisper Large v3 Turbo

OpenAI

0.0

0 reviews

Faster variant of Whisper v3 with pruned decoder. 809M parameters.

audio

Whisper Medium

OpenAI

0.0

0 reviews

Mid-size speech recognition model balancing accuracy and speed. 769M parameters.

audio

Whisper Small

OpenAI

0.0

0 reviews

Compact speech recognition model for resource-constrained environments. 244M parameters.

audio

Sora

OpenAI

0.0

0 reviews

Text-to-video generation model capable of creating realistic and imaginative scenes.

video

Codex

OpenAI

0.0

0 reviews

Code-focused model powering GitHub Copilot. Excels at code generation. 12B parameters.

code

GPT-4.1

OpenAI

0.0

0 reviews

Incremental improvement over GPT-4 with better coding performance and instruction following.

general

GPT-4.1 Mini

OpenAI

0.0

0 reviews

Compact variant of GPT-4.1 optimized for cost-effective deployments.

general

GPT-4.1 Nano

OpenAI

0.0

0 reviews

Ultra-compact GPT-4.1 variant designed for the fastest, cheapest inference at scale.

general

Gemini 2.5 Pro

Google

0.0

0 reviews

Google's most advanced thinking model with enhanced reasoning and 1M token context.

multimodal

Gemini 2.5 Flash

Google

0.0

0 reviews

Fast, efficient Gemini model with thinking capabilities and multimodal understanding.

multimodal

Gemini 1.5 Pro

Google

0.0

0 reviews

Powerful multimodal model with 2M token context window for long-document understanding.

multimodal

Gemini 1.5 Flash

Google

0.0

0 reviews

Lightweight model optimized for speed and cost-efficiency with strong multimodal abilities.

multimodal

Gemini 1.0 Ultra

Google

0.0

0 reviews

Google's first Gemini flagship model for highly complex multimodal tasks.

multimodal

Gemma 3 27B

Google

0.0

0 reviews

Google's latest open-source model. State-of-the-art for its size class. 27B parameters.

open-source

Gemma 3 12B

Google

0.0

0 reviews

Mid-size open-source Gemma 3 model balancing capability and efficiency. 12B parameters.

open-source

Gemma 3 4B

Google

0.0

0 reviews

Compact open-source Gemma 3 model for mobile and edge deployment. 4B parameters.

open-source

Gemma 3 1B

Google

0.0

0 reviews

Ultra-compact Gemma 3 model for on-device inference. 1B parameters.

open-source

Gemma 2 27B

Google

0.0

0 reviews

Open-source text model with strong general capabilities. 27B parameters.

open-source

Gemma 2 9B

Google

0.0

0 reviews

Efficient open-source model for balanced performance. 9B parameters.

open-source

Gemma 2 2B

Google

0.0

0 reviews

Compact open-source model ideal for edge and mobile inference. 2B parameters.

open-source

CodeGemma 7B

Google

0.0

0 reviews

Specialized code model based on Gemma for code generation and completion. 7B parameters.

code

PaLM 2

Google

0.0

0 reviews

Google's advanced language model powering Bard and other AI features. 340B parameters (estimated).

general

Imagen 3

Google

0.0

0 reviews

Highest-quality text-to-image model with photorealistic output and deep language understanding.

image

Veo 2

Google

0.0

0 reviews

Most capable video generation model producing high-quality, realistic footage.

video

T5 XXL

Google

0.0

0 reviews

Text-to-text transformer for NLP tasks. Pre-trained on C4 corpus. 11B parameters.

open-source

FLAN-T5 XXL

Google

0.0

0 reviews

Instruction-tuned T5 model with strong zero-shot capabilities. 11B parameters.

open-source

BERT Large

Google

0.0

0 reviews

Bidirectional encoder for NLU tasks. Foundation of modern NLP. 340M parameters.

open-source

SigLIP

Google

0.0

0 reviews

Vision-language model for zero-shot image classification using sigmoid loss. 878M parameters.

open-source

Claude 4 Opus

Anthropic

0.0

0 reviews

Anthropic's most powerful model with unmatched reasoning, analysis, and creative capabilities.

general

Claude 4 Sonnet

Anthropic

0.0

0 reviews

Balanced Claude 4 model offering strong performance with improved speed and efficiency.

general

Claude 3.7 Sonnet

Anthropic

0.0

0 reviews

Hybrid thinking model combining instant responses with extended reasoning when needed.

general

Claude 3.5 Haiku

Anthropic

0.0

0 reviews

Fast, affordable Claude model for quick responses and high-throughput applications.

general

Claude 3 Opus

Anthropic

0.0

0 reviews

Previous flagship with exceptional performance on highly complex, open-ended tasks.

general

Claude 3 Sonnet

Anthropic

0.0

0 reviews

Balanced model with strong performance and practical speed for enterprise workloads.

general

Claude 3 Haiku

Anthropic

0.0

0 reviews

Fastest Claude 3 model for near-instant responses at scale.

general

Llama 4 Scout

Meta

0.0

0 reviews

Meta's latest MoE model with 17B active / 109B total parameters and 10M token context.

open-source

Llama 4 Maverick

Meta

0.0

0 reviews

Larger Llama 4 MoE variant: 17B active / 400B total parameters for complex reasoning.

open-source

Llama 3.3 70B

Meta

0.0

0 reviews

Cost-effective 70B model delivering Llama 3.1 405B-level performance. 70B parameters.

open-source

Llama 3.1 70B

Meta

0.0

0 reviews

High-performance open-source model for commercial applications. 70B parameters.

open-source

Llama 3.1 8B

Meta

0.0

0 reviews

Compact Llama model for edge deployment and constrained environments. 8B parameters.

open-source

Llama 3 70B

Meta

0.0

0 reviews

Previous generation large Llama model with strong general performance. 70B parameters.

open-source

Llama 3 8B

Meta

0.0

0 reviews

Efficient previous-gen Llama model for everyday applications. 8B parameters.

open-source

Llama 2 70B

Meta

0.0

0 reviews

Large open-source model that kickstarted the open LLM revolution. 70B parameters.

open-source

Llama 2 13B

Meta

0.0

0 reviews

Mid-size Llama 2 model balancing capability and compute. 13B parameters.

open-source

Llama 2 7B

Meta

0.0

0 reviews

Compact foundational model for fine-tuning and experimentation. 7B parameters.

open-source

Code Llama 70B

Meta

0.0

0 reviews

Largest Code Llama variant for complex code generation. 70B parameters.

code

Code Llama 34B

Meta

0.0

0 reviews

Specialized coding model supporting code generation and infilling. 34B parameters.

code

Code Llama 7B

Meta

0.0

0 reviews

Compact code model for fast code completion and generation. 7B parameters.

code

Llama Guard 3

Meta

0.0

0 reviews

Safety classifier for content moderation in AI applications. 8B parameters.

enterprise

Segment Anything 2

Meta

0.0

0 reviews

Foundation model for promptable visual segmentation in images and videos.

image

MusicGen Large

Meta

0.0

0 reviews

Music generation model creating audio from text descriptions. 3.3B parameters.

audio

MusicGen Medium

Meta

0.0

0 reviews

Mid-size music generation model for text-to-music. 1.5B parameters.

audio

SeamlessM4T v2

Meta

0.0

0 reviews

Multilingual multimodal translation model for speech and text. 2.3B parameters.

audio

ImageBind

Meta

0.0

0 reviews

Embedding model binding 6 modalities: images, text, audio, depth, thermal, IMU.

multimodal

Mistral Large 2

Mistral AI

0.0

0 reviews

Mistral's flagship model for complex reasoning and multilingual tasks. 123B parameters.

general

Mistral Medium

Mistral AI

0.0

0 reviews

Balanced model for business applications at moderate cost.

general

Mistral Small 3

Mistral AI

0.0

0 reviews

24B parameter open-source model competitive with much larger alternatives.

open-source

Mistral Small

Mistral AI

0.0

0 reviews

Efficient model for simple tasks, very low latency, and bulk processing.

general

Mistral Nemo

Mistral AI

0.0

0 reviews

Open-source model co-developed with NVIDIA. 128K context. 12B parameters.

open-source

Mistral 7B

Mistral AI

0.0

0 reviews

The original Mistral model that outperformed Llama 2 13B. 7.3B parameters.

open-source

Mixtral 8x22B

Mistral AI

0.0

0 reviews

Large MoE model. 141B total / 39B active parameters. Strong multilingual.

open-source

Mixtral 8x7B

Mistral AI

0.0

0 reviews

Efficient MoE model. 46.7B total / 12.9B active parameters.

open-source

Codestral 25.01

Mistral AI

0.0

0 reviews

Latest specialized coding model for code generation and review. 22B parameters.

code

Codestral Mamba

Mistral AI

0.0

0 reviews

Linear-time coding model using Mamba architecture. 7B parameters.

code

Pixtral Large

Mistral AI

0.0

0 reviews

Frontier multimodal model for document understanding and image analysis. 124B parameters.

multimodal

Pixtral 12B

Mistral AI

0.0

0 reviews

Open-source multimodal model with vision capabilities. 12B parameters.

multimodal

DeepSeek R1

DeepSeek

0.0

0 reviews

Reasoning model using RL to achieve o1-level performance. 671B total / 37B active parameters.

open-source

DeepSeek R1 Distill Qwen 32B

DeepSeek

0.0

0 reviews

R1 reasoning distilled into Qwen 2.5 32B architecture. 32B parameters.

open-source

DeepSeek R1 Distill Llama 70B

DeepSeek

0.0

0 reviews

R1 reasoning distilled into Llama 3.1 70B architecture. 70B parameters.

open-source

DeepSeek R1 Distill Qwen 7B

DeepSeek

0.0

0 reviews

R1 reasoning distilled into compact 7B architecture. 7B parameters.

open-source

DeepSeek R1 Distill Qwen 1.5B

DeepSeek

0.0

0 reviews

Ultra-compact R1 distillation for edge deployment. 1.5B parameters.

open-source

DeepSeek V2.5

DeepSeek

0.0

0 reviews

Previous generation MoE model with strong general capabilities. 236B total parameters.

open-source

DeepSeek Coder V2

DeepSeek

0.0

0 reviews

Code model supporting 338 programming languages. 236B total / 21B active parameters.

code

DeepSeek Coder 33B

DeepSeek

0.0

0 reviews

Specialized code generation model trained from scratch. 33B parameters.

code

DeepSeek Coder 6.7B

DeepSeek

0.0

0 reviews

Compact code model for fast development assistance. 6.7B parameters.

code

Janus Pro 7B

DeepSeek

0.0

0 reviews

Multimodal model unifying visual understanding and generation. 7B parameters.

multimodal

𝕏

Grok-3

xAI

0.0

0 reviews

xAI's most powerful model topping multiple benchmarks. Real-time information access.

general

𝕏

Grok-3 Mini

xAI

0.0

0 reviews

Lightweight thinking model with controllable reasoning depth.

general

𝕏

Grok-2 Mini

xAI

0.0

0 reviews

Compact Grok variant for fast, cost-effective inference.

general

𝕏

Grok-1

xAI

0.0

0 reviews

xAI's first open-source model released as MoE. 314B total parameters.

open-source

Command A

Cohere

0.0

0 reviews

Latest enterprise model with 256K context and multilingual agentic performance. 111B parameters.

enterprise

Command R

Cohere

0.0

0 reviews

Efficient enterprise model for production RAG workloads. 35B parameters.

enterprise

Command R7B

Cohere

0.0

0 reviews

Compact open-source model for fast enterprise inference. 7B parameters.

open-source

Embed v4

Cohere

0.0

0 reviews

Latest embedding model for semantic search, classification, and clustering.

enterprise

Embed v3

Cohere

0.0

0 reviews

Multilingual embedding model supporting 100+ languages.

enterprise

Aya 23 35B

Cohere

0.0

0 reviews

Open-source multilingual model covering 23 languages. 35B parameters.

open-source

Aya 23 8B

Cohere

0.0

0 reviews

Compact multilingual model for 23 languages. 8B parameters.

open-source

Aya Expanse 32B

Cohere

0.0

0 reviews

Instruction-tuned multilingual model excelling in 23 languages. 32B parameters.

open-source

Stable Diffusion 3.5 Large

Stability AI

0.0

0 reviews

Latest Stable Diffusion with improved text rendering and composition. 8B parameters.

image

Stable Diffusion 3.5 Medium

Stability AI

0.0

0 reviews

Mid-size diffusion model balancing quality and speed. 2.5B parameters.

image

Stable Diffusion 1.5

Stability AI

0.0

0 reviews

Classic and widely-used image generation model with huge ecosystem. 860M parameters.

image

Stable Video Diffusion

Stability AI

0.0

0 reviews

Open-source video generation model for short clips from images. 1.5B parameters.

video

Stable Audio 2.0

Stability AI

0.0

0 reviews

AI music and sound generation from text descriptions. 1.1B parameters.

audio

Stable LM 2 12B

Stability AI

0.0

0 reviews

Open-source language model for general-purpose text generation. 12B parameters.

open-source

Phi-4

Microsoft

0.0

0 reviews

Small language model excelling at reasoning and STEM tasks. 14B parameters.

open-source

Phi-3.5 MoE

Microsoft

0.0

0 reviews

Mixture-of-experts variant of Phi-3.5 for improved efficiency. 42B total / 6.6B active.

open-source

Phi-3.5 Mini

Microsoft

0.0

0 reviews

Compact model with 128K context for edge/mobile deployment. 3.8B parameters.

open-source

Phi-3 Medium

Microsoft

0.0

0 reviews

Mid-size Phi-3 model with strong reasoning capabilities. 14B parameters.

open-source

Phi-3 Mini

Microsoft

0.0

0 reviews

Compact language model for on-device AI applications. 3.8B parameters.

open-source

Phi-2

Microsoft

0.0

0 reviews

Efficient small model demonstrating emergent capabilities. 2.7B parameters.

open-source

Florence-2 Large

Microsoft

0.0

0 reviews

Unified vision model for captioning, detection, segmentation, and OCR. 770M parameters.

multimodal

Florence-2 Base

Microsoft

0.0

0 reviews

Foundation vision model for diverse visual understanding tasks. 230M parameters.

multimodal

Orca 2 13B

Microsoft

0.0

0 reviews

Research model trained with improved reasoning strategies. 13B parameters.

open-source

WizardLM 2 8x22B

Microsoft

0.0

0 reviews

MoE model with enhanced instruction following. 141B total parameters.

open-source

aws

Nova Pro

Amazon

0.0

0 reviews

Highly capable multimodal model balancing accuracy, speed, and cost.

multimodal

aws

Nova Lite

Amazon

0.0

0 reviews

Fast, low-cost multimodal model processing images, video, and text.

multimodal

aws

Nova Micro

Amazon

0.0

0 reviews

Text-only model with lowest latency at very low cost.

general

aws

Nova Canvas

Amazon

0.0

0 reviews

Image generation model for creative and professional content creation.

image

aws

Nova Reel

Amazon

0.0

0 reviews

Video generation model for short-form content creation.

video

aws

Titan Text Premier

Amazon

0.0

0 reviews

Amazon's flagship text model for enterprise workloads.

enterprise

aws

Titan Embeddings v2

Amazon

0.0

0 reviews

Text embedding model for search and retrieval applications.

enterprise

Qwen 3 235B A22B

Alibaba

0.0

0 reviews

Largest Qwen 3 MoE model with hybrid thinking. 235B total / 22B active parameters.

open-source

Qwen 3 32B

Alibaba

0.0

0 reviews

Strong general-purpose model with hybrid thinking. 32B parameters.

open-source

Qwen 3 14B

Alibaba

0.0

0 reviews

Mid-size Qwen 3 model balancing power and efficiency. 14B parameters.

open-source

Qwen 3 8B

Alibaba

0.0

0 reviews

Compact Qwen 3 model for versatile applications. 8B parameters.

open-source

Qwen 3 4B

Alibaba

0.0

0 reviews

Small Qwen 3 model for edge deployment. 4B parameters.

open-source

Qwen 3 1.7B

Alibaba

0.0

0 reviews

Ultra-compact Qwen 3 model for mobile devices. 1.7B parameters.

open-source

Qwen 3 0.6B

Alibaba

0.0

0 reviews

Tiny Qwen 3 model for embedded and IoT applications. 0.6B parameters.

open-source

Qwen 2.5 72B

Alibaba

0.0

0 reviews

Large model competitive with Llama 3.1 405B on reasoning benchmarks. 72B parameters.

open-source

Qwen 2.5 32B

Alibaba

0.0

0 reviews

Strong mid-size model for diverse tasks. 32B parameters.

open-source

Qwen 2.5 14B

Alibaba

0.0

0 reviews

Efficient model for production deployments. 14B parameters.

open-source

Qwen 2.5 7B

Alibaba

0.0

0 reviews

Compact model for fast inference and fine-tuning. 7B parameters.

open-source

Qwen 2.5 3B

Alibaba

0.0

0 reviews

Small model for resource-constrained environments. 3B parameters.

open-source

Qwen 2.5 Coder 32B

Alibaba

0.0

0 reviews

Top open-source coding model rivaling GPT-4o on coding benchmarks. 32B parameters.

code

Qwen 2.5 Coder 7B

Alibaba

0.0

0 reviews

Compact code model for fast code completion. 7B parameters.

code

Qwen VL Max

Alibaba

0.0

0 reviews

Flagship vision-language model for document/chart understanding and visual QA.

multimodal

Qwen VL Plus

Alibaba

0.0

0 reviews

Balanced vision-language model for image-based tasks.

multimodal

QwQ 32B

Alibaba

0.0

0 reviews

Reasoning model by Alibaba matching o1-mini level performance. 32B parameters.

open-source

Yi-Lightning

01.AI

0.0

0 reviews

Fast and powerful model rivaling frontier models at a fraction of cost.

general

Kolors

Kuaishou

0.0

0 reviews

Open-source text-to-image model with strong Chinese text support. 8B parameters.

image

PixArt-Σ

Huawei

0.0

0 reviews

Open-source DiT-based image generation with 4K resolution support. 900M parameters.

image

E5 Mistral 7B

Microsoft

0.0

0 reviews

LLM-based text embedding model with strong retrieval performance. 7B parameters.

enterprise

BGE-M3

BAAI

0.0

0 reviews

Multi-lingual, multi-granularity embedding model. 568M parameters.

enterprise

GTE-Qwen2 7B

Alibaba

0.0

0 reviews

Text embedding model based on Qwen2 for retrieval tasks. 7B parameters.

enterprise

NV-Embed v2

NVIDIA

0.0

0 reviews

State-of-the-art generalist embedding model. 7B parameters.

enterprise

Jina Embeddings v3

Jina AI

0.0

0 reviews

Multilingual multi-task embedding model with matryoshka support. 572M parameters.

enterprise

Nomic Embed v1.5

Nomic AI

0.0

0 reviews

Open-source embedding model with improved performance. 137M parameters.

enterprise

RT-2

Google

0.0

0 reviews

Vision-Language-Action model for robotic control. 55B parameters.

enterprise

Octo

UC Berkeley

0.0

0 reviews

Open-source generalist robot policy for dexterous manipulation.

enterprise

Jamba 1.5 Large

AI21 Labs

0.0

0 reviews

Hybrid SSM-Transformer model with 256K context for enterprise RAG. 398B total / 94B active.

enterprise

Jamba 1.5 Mini

AI21 Labs

0.0

0 reviews

Compact Jamba for fast enterprise workloads. 52B total / 12B active parameters.

enterprise

Nemotron 70B

NVIDIA

0.0

0 reviews

70B model fine-tuned with RLHF, excelling as reward model and assistant. 70B parameters.

open-source

Nemotron 340B

NVIDIA

0.0

0 reviews

Large-scale model optimized for enterprise synthetic data generation. 340B parameters.

open-source

NVLM 72B

NVIDIA

0.0

0 reviews

Frontier multimodal LLM matching proprietary models on vision-language tasks. 72B parameters.

multimodal

Llama 3.1 Nemotron 70B

NVIDIA

0.0

0 reviews

NVIDIA-optimized Llama 3.1 70B with enhanced helpfulness. 70B parameters.

open-source

Falcon 180B

TII

0.0

0 reviews

One of the largest open-source autoregressive models. 180B parameters.

open-source

Falcon 40B

TII

0.0

0 reviews

Strong open-source model that topped HuggingFace leaderboard. 40B parameters.

open-source

Falcon 11B

TII

0.0

0 reviews

Compact Falcon model for efficient deployment. 11B parameters.

open-source

Falcon 7B

TII

0.0

0 reviews

Base Falcon model on RefinedWeb dataset. 7B parameters.

open-source

Falcon 3 10B

TII

0.0

0 reviews

Latest Falcon generation with improved training methodology. 10B parameters.

open-source

Falcon Mamba 7B

TII

0.0

0 reviews

First large-scale pure Mamba architecture model. 7B parameters.

open-source

BLOOM

BigScience

0.0

0 reviews

Open multilingual model supporting 46 languages, created by 1000+ researchers. 176B parameters.

open-source

BLOOMZ

BigScience

0.0

0 reviews

Instruction-tuned BLOOM for cross-lingual zero-shot task generalization. 176B parameters.

open-source

StarCoder 2 15B

BigCode

0.0

0 reviews

Open-source code LLM on The Stack v2, supporting 600+ languages. 15B parameters.

code

StarCoder 2 7B

BigCode

0.0

0 reviews

Compact code model for fast code generation. 7B parameters.

code

StarCoder 2 3B

BigCode

0.0

0 reviews

Small code model for local development tools. 3B parameters.

code

StarCoder

BigCode

0.0

0 reviews

Original open-source code model trained on permissive data. 15.5B parameters.

code

SmolLM2 1.7B

Hugging Face

0.0

0 reviews

Tiny but capable model for on-device AI. 1.7B parameters.

open-source

SmolLM2 360M

Hugging Face

0.0

0 reviews

Ultra-compact model for embedded applications. 360M parameters.

open-source

SmolLM2 135M

Hugging Face

0.0

0 reviews

Smallest capable instruction-following model. 135M parameters.

open-source

SmolVLM 2 2.2B

Hugging Face

0.0

0 reviews

Compact vision-language model for image and video understanding. 2.2B parameters.

multimodal

FLUX.1 [dev]

Black Forest Labs

0.0

0 reviews

State-of-the-art open image generation with exceptional prompt following. 12B parameters.

image

FLUX.1 [schnell]

Black Forest Labs

0.0

0 reviews

Fastest FLUX variant for real-time image generation. 12B parameters.

image

FLUX.1 [pro]

Black Forest Labs

0.0

0 reviews

Premium FLUX model with highest quality outputs. 12B parameters.

image

Bark

Suno

0.0

0 reviews

Open-source TTS generating realistic speech, music, and sound effects.

audio

Parler TTS Large

Hugging Face

0.0

0 reviews

Controllable text-to-speech with natural speaker descriptions. 2.3B parameters.

audio

Dia 1.6B

Nari Labs

0.0

0 reviews

TTS model generating realistic dialogue with emotions. 1.6B parameters.

audio

Moshi

Kyutai

0.0

0 reviews

Real-time speech-to-speech foundation model for natural conversation. 7B parameters.

audio

Mars5 TTS

CAMB.AI

0.0

0 reviews

Novel two-stage TTS model excelling at prosody and expression.

audio

F5-TTS

SWivid

0.0

0 reviews

Flow-matching based text-to-speech with zero-shot voice cloning.

audio

Pi

Inflection

0.0

0 reviews

Personal AI assistant designed for emotional intelligence and supportive conversations.

general

Inflection 2.5

Inflection

0.0

0 reviews

97.5% of GPT-4 performance with strong EQ capabilities.

general

Reka Core

Reka

0.0

0 reviews

Frontier multimodal model processing text, images, video, and audio natively.

multimodal

Reka Flash

Reka

0.0

0 reviews

Fast multimodal model for cost-effective applications. 21B parameters.

multimodal

Reka Edge

Reka

0.0

0 reviews

Compact multimodal model for edge deployment. 7B parameters.

multimodal

Sonar Reasoning Pro

Perplexity

0.0

0 reviews

Advanced reasoning model with real-time web search and citations.

general

Sonar Pro

Perplexity

0.0

0 reviews

Enhanced search-augmented model with grounded, cited answers.

general

Sonar

Perplexity

0.0

0 reviews

Fast search-augmented model providing answers with web citations.

general

GLM-4 9B

Zhipu AI

0.0

0 reviews

Bilingual model with strong tool-use and 128K context. 9B parameters.

open-source

CogVideoX 5B

Zhipu AI

0.0

0 reviews

Open-source text-to-video model with strong generation quality. 5B parameters.

video

CogView 3 Plus

Zhipu AI

0.0

0 reviews

Text-to-image model with relay-based generation for high quality.

image

OpenELM 1.1B

Apple

0.0

0 reviews

Compact Apple language model for on-device AI. 1.1B parameters.

open-source

AIMv2 Large

Apple

0.0

0 reviews

Autoregressive vision encoder pre-trained with multimodal objectives. 304M parameters.

multimodal

xLAM 2 70B

Salesforce

0.0

0 reviews

Large action model for function calling and multi-step agentic tasks. 70B parameters.

enterprise

xLAM 2 8B

Salesforce

0.0

0 reviews

Compact action model excelling at tool use and function calling. 8B parameters.

enterprise

CodeGen 2.5 7B

Salesforce

0.0

0 reviews

Multi-turn program synthesis model for code generation. 7B parameters.

code

Hunyuan Large

Tencent

0.0

0 reviews

Tencent's largest open-source MoE model. 389B total / 52B active parameters.

open-source

HunyuanVideo

Tencent

0.0

0 reviews

Open-source video generation model with strong physical understanding. 13B parameters.

video

Hunyuan3D 2.0

Tencent

0.0

0 reviews

3D asset generation model creating high-quality meshes from images/text.

image

ERNIE 4.0

Baidu

0.0

0 reviews

Baidu's flagship model with strong Chinese-English bilingual capabilities.

general

ERNIE-ViLG 2.0

Baidu

0.0

0 reviews

Text-to-image model from Baidu with knowledge-enhanced generation.

image

MPT-30B

MosaicML

0.0

0 reviews

Open-source model trained from scratch with commercial license. 30B parameters.

open-source

MPT-7B

MosaicML

0.0

0 reviews

Commercially usable open-source base model. 7B parameters.

open-source

Pythia 12B

EleutherAI

0.0

0 reviews

Research model suite for studying LLM training dynamics. 12B parameters.

open-source

GPT-NeoX 20B

EleutherAI

0.0

0 reviews

Large open-source autoregressive model. 20B parameters.

open-source

GPT-Neo 2.7B

EleutherAI

0.0

0 reviews

Early open-source GPT alternative on The Pile. 2.7B parameters.

open-source

MiniMax-01

MiniMax

0.0

0 reviews

Powerful model with 4M token context window. 456B total parameters.

general

MiniMax-Text-01

MiniMax

0.0

0 reviews

Lightning-fast text model with hybrid attention architecture.

general

Exaone 3.5 32B

LG AI Research

0.0

0 reviews

Bilingual model excelling at instruction following. 32B parameters.

open-source

Exaone 3.5 7.8B

LG AI Research

0.0

0 reviews

Compact bilingual model for diverse applications. 7.8B parameters.

open-source

Exaone 3.5 2.4B

LG AI Research

0.0

0 reviews

Small but capable bilingual model. 2.4B parameters.

open-source

Vicuna 33B

LMSYS

0.0

0 reviews

Fine-tuned LLaMA model trained on ShareGPT conversations. 33B parameters.

open-source

Vicuna 13B

LMSYS

0.0

0 reviews

Popular fine-tuned chat model based on LLaMA. 13B parameters.

open-source

Vicuna 7B

LMSYS

0.0

0 reviews

Compact chat model fine-tuned on user conversations. 7B parameters.

open-source

Zephyr 7B

Hugging Face

0.0

0 reviews

DPO-aligned model based on Mistral 7B with strong chat abilities. 7B parameters.

open-source

Nous Hermes 2 Mixtral 8x7B

Nous Research

0.0

0 reviews

Instruction-tuned Mixtral with strong general performance. 46.7B total parameters.

open-source

Nous Hermes 2 34B

Nous Research

0.0

0 reviews

Strong instruct model based on Yi-34B. 34B parameters.

open-source

OpenChat 3.5 7B

OpenChat

0.0

0 reviews

High-quality open model rivaling ChatGPT. 7B parameters.

open-source

Neural Chat 7B

Intel

0.0

0 reviews

Intel-optimized chat model based on Mistral 7B. 7B parameters.

open-source

Solar 10.7B

Upstage

0.0

0 reviews

Depth upscaled model with strong performance. 10.7B parameters.

open-source

InternLM 2.5 20B

Shanghai AI Lab

0.0

0 reviews

Strong open-source model with excellent tool use. 20B parameters.

open-source

InternLM 2.5 7B

Shanghai AI Lab

0.0

0 reviews

Compact model with advanced reasoning and tool use. 7B parameters.

open-source

InternVL 2.5 78B

Shanghai AI Lab

0.0

0 reviews

Open-source multimodal model matching GPT-4o on vision tasks. 78B parameters.

multimodal

Baichuan 2 13B

Baichuan

0.0

0 reviews

Strong bilingual model with emphasis on Chinese NLP. 13B parameters.

open-source

Baichuan 2 7B

Baichuan

0.0

0 reviews

Compact bilingual model for Chinese-English tasks. 7B parameters.

open-source

ChatGLM3 6B

Zhipu AI

0.0

0 reviews

Bilingual chat model with function calling support. 6B parameters.

open-source

Aquila 2 70B

BAAI

0.0

0 reviews

Bilingual model from Beijing Academy of AI. 70B parameters.

open-source

MAP-Neo 7B

BAAI

0.0

0 reviews

Fully open-source bilingual model with transparent training. 7B parameters.

open-source

OLMo 2 13B

AI2

0.0

0 reviews

Fully open model with open data, weights, and code by Allen AI. 13B parameters.

open-source

OLMo 2 7B

AI2

0.0

0 reviews

Compact fully open model for research and deployment. 7B parameters.

open-source

Tülu 3 70B

AI2

0.0

0 reviews

State-of-the-art post-trained open model by Allen AI. 70B parameters.

open-source

Molmo 72B

AI2

0.0

0 reviews

Open multimodal model rivaling proprietary models on vision tasks. 72B parameters.

multimodal

LeRobot

Hugging Face

0.0

0 reviews

Open-source robotics framework with pretrained manipulation policies.

enterprise

Med-PaLM 2

Google

0.0

0 reviews

Medical-specialized model with expert-level clinical reasoning.

enterprise

BioMistral 7B

BioMistral

0.0

0 reviews

Biomedical domain model fine-tuned on PubMed data. 7B parameters.

enterprise

Codestral

Mistral AI

0.0

0 reviews

Mistral's specialized coding model for code generation, review, and debugging.

code

Meditron 70B

EPFL

0.0

0 reviews

Medical LLM adapted from Llama 2 for healthcare. 70B parameters.

enterprise