The 30 second story
You know how delivery services offer next-day, same-day, or instant options at different prices? Google just did the same thing with its Gemini AI service. They’ve added two new tiers called Flex and Priority that let businesses choose between cheap-and-slow or expensive-and-fast responses from their AI system. Google has not announced UK pricing yet, but the service is available to developers here who already use Gemini.
Why it matters
Most businesses using AI tools right now pay a fixed rate and get whatever speed the service delivers on that day. If the system is busy, you wait. If it’s quiet, you get fast responses. You have no control over either the cost or the speed. These new tiers change that completely. Flex mode costs less but might take longer when demand is high. Priority mode costs more but jumps the queue for instant responses. This matters because AI automation works differently depending on what you’re doing. If you’re automatically processing invoices overnight, slow and cheap makes sense. If you’re using AI to answer customer questions during business hours, you need the fast option.
What this means for your business
- Running AI automation during quiet periods becomes much cheaper, making it viable for tasks that weren’t worth the cost before
- Customer-facing AI tools can guarantee fast responses by paying for priority access, reducing the risk of frustrated customers waiting
- Switching between different speeds for different tasks means your overall AI costs can drop while maintaining quality where it matters
- Businesses no longer need to avoid AI automation during busy periods when costs were unpredictable