During its annual developer conference — Google I/O 2026, google revealed that instead of counting the number of prompts per day, Gemini will now measure usage based on compute consumed. This means that the complexity of prompts, features used, and chat length will determine how quickly a user’s quota is exhausted. Earlier, the Gemini users were restricted by a simple daily prompt cap. But now Google says limits will refresh within every five hours, but the amount of compute consumed will vary depending on the task. For instance, the basic text queries will use lesser or minimal compute, whereas resource-heavy tasks like video generation and deep research will consume higher limits.
“Gemini will move to compute-based usage limits that will refresh every 5 hours until you reach your weekly limit. Calculation of your usage will factor in the complexity of your prompt, the features you use, and the length of your chat. Paid users have higher limits than users without a Google AI subscription, said Google.
Premium models and features require more usage and may cause you to reach your limit faster. This would include things like:
- Media generation
- Deep Research
- Pro Model
- Extended thinking and Deep Think
Usage limits
Plan
| Limit
|
Without a plan
| Standard limits
|
AI Plus
| 2x higher than standard limits
|
AI Pro
| 4x higher than standard limits
|
AI Ultra
| 5x or 20x higher than AI Pro depending on your subscription
|
Impact of usage limit change on users
The casual users who rely on Google Gemini to basic day-to-day queries will not notice any major change. On the other hand, professionals who are using Gemini for coding, media creation, or enterprise-scale tasks will now have to monitor their compute usage more carefully. Google confirmed that limits for users under 18 remain unchanged.
Google said the new system helps manage resource-heavy features such as image, video, and music generation, while encouraging users to upgrade to higher-tier plans for intensive workloads.