Skip to main content

Software Architecture

2026


We Misunderstood LLMs And It's Costing Us

·1221 words·6 mins

Microsoft and GitHub recently introduced new usage limits and charges for premium AI models in Copilot. Under the new structure - standard subscriptions now have a strict monthly quota for queries using top-tier models like GPT-4o or Claude 3.5 Sonnet. Once you hit the limit, you either pay extra or fall back to standard base models.

They were completely honest about the reason: the massive operational cost of running advanced frontier models at scale.