AIToolRank
ModelsCompareReviewsNewsQuizCalculator
AIToolRank

AI model specs, pricing & comparisons.

© 2026 AIToolRank

Explore

ModelsCompareCalculatorLeaderboardMethodology

Top Providers

OpenAIAnthropicGoogleDeepSeekMeta

Resources

ReviewsNewsModel QuizMethodology
Home/Best Multimodal AI Models -- Vision, Audio & More

Best Multimodal AI Models -- Vision, Audio & More

AI models that accept more than just text input, including images, audio, and video.

#ModelProviderContextInput $/1MOutput $/1M
1Gemini 2.5 ProGoogle1.0M$1.25$10
2Gemini 3.1 Pro PreviewGoogle1.0M$2.00$12
3GPT-4.1 nanoOpenAI1.0M$0.10$0.40
4GPT-4.1OpenAI1.0M$2.00$8.00
5GPT-4.1 miniOpenAI1.0M$0.40$1.60
6Nova PremierAmazon1M$2.50$13
7Nova LiteAmazon300K$0.06$0.24
8Nova ProAmazon300K$0.80$3.20
9Grok 4xAI256K$3.00$15
10Claude 3.5 HaikuAnthropic200K$0.80$4.00
11Claude 3 HaikuAnthropic200K$0.25$1.25
12o3OpenAI200K$2.00$8.00
13Sonar ProFreePerplexity200KFreeFree
14o1-proOpenAI200K$150$600
15o1OpenAI200K$15$60
16Sonar Reasoning ProFreePerplexity128KFreeFree
17GPT-4 TurboOpenAI128K$10$30
18SonarFreePerplexity127KFreeFree