New Models from OpenAI, Google and Mistral and New Pricing

by Duncan Miller on July 24, 2024
We've made some changes at Shiro that you should know about. We've simplified our pricing and added a per-user Premium tier (instead of team-only Premium tier). The Premium tier offers unlimited Prompts, Deployments and Tests and provides access to state-of-the-art models from OpenAI, Google, Anthropic, Mistral, and Cohere. Check out our pricing page for more information and get a free 14 day trial on the Premium tier.

We still have our free tier available with up to 10 Prompts, 1 Deployment and unlimited Tests, and access to OpenAI models.

OpenAI GPT-4o and 4o mini

We've recently added support for OpenAI's new flagship model GPT-4o ("o" for "omni") as well as the lighter GPT-4o-mini model. We are continuing to support GPT-4, GPT-4-turbo and GPT-3.5 as well. All OpenAI models are available for both free tier and premium tier users.

GPT-4o matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages, while also being much faster and 50% cheaper in the API.

Language model name: gpt-4o

GPT-4o mini scores 82% on MMLU and currently outperforms GPT-4 on chat preferences in LMSYS leaderboard. It is priced at 15 cents per million input tokens and 60 cents per million output tokens, an order of magnitude more affordable than previous frontier models and more than 60% cheaper than GPT-3.5 Turbo.

Language model name: gpt-4o-mini

Mistral NeMo


Premium tier users can now create, test, and deploy prompts using the Mistral NeMo model in addition to Mistral Small, Medium, Large and 7b.

NeMo is Mistral's new best small model. A state-of-the-art 12B model with 128k context length, built in collaboration with NVIDIA, and released under the Apache 2.0 license.

Language model name: open-mistral-nemo

Google Gemini 1.5 Pro


Premium users can also use the Gemini 1.5 Pro model in addition to Gemini 1.0 Pro.

Gemini 1.5 delivers dramatically enhanced performance. It represents a step change in Google's approach, building upon research and engineering innovations making Gemini 1.5 more efficient to train and serve. It’s a mid-size multimodal model, optimized for scaling across a wide-range of tasks, and performs at a similar level to 1.0 Ultra, Google's largest model to date. It also introduces a breakthrough experimental feature in long-context understanding.

Language model name: gemini-1.5-pro
  • Photo of Duncan Miller

    Duncan Miller

    Founder, Software Developer

    Duncan is the founder and lead software developer for OpenShiro. He been running startups since 2006 and has been writing code for over 20 years. Duncan has an MBA from Babson College and lives with his wife and two children in Portland Oregon on an extinct cinder code volcano. He is passionate about artificial intelligence, climate solutions, public benefit companies and social entrepreneurship.

Subscribe to our newsletter

The latest prompt engineering best practices and resources, sent to your inbox weekly.