More often than not, database administrators have to fight to get high loads out of their databases. It could be ad hoc queries, urgent reports, overflown jobs, or simply high frequency and volume of queries by the end users.
DBAs try their best with generous capacity planning to ensure optimal response time and throughput for the end users. There are various scenarios where it becomes very hard to predict the demand. Storage and processing needs in case of unpredictable load are hard to foretell in advance.
Cloud computing offers the promise of unmatched scalability for processing and storage needs. Amazon AWS has introduced a new service which gets closer to that ultimate scalability. Amazon Aurora is a hosted relational database service by AWS. You set your instance size and storage need while setting Aurora up. If your processing requirements change, you change your instance size and if you need more read throughput you add more read replicas.
But that is good for the loads we know about and can more ore less predict. What about the loads which appear out of blue? Maybe for a blogging site, where a post has suddenly gone viral and it has started getting millions of views instead of hundreds? And then the traffic disappears after some time suddenly, just like it appeared out of nowhere and may be after a few days the same thing happens for another post?
In this case if you are running Amazon Aurora, it would be fairly expensive to just increase the instance size or read replicas in the anticipation that some traffic burst would come. It might not, but then it might.
Ahead of this uncertainty, enters Amazon Aurora Serverless. With this Serverless Aurora, you don’t select your instance size. You simply specify an endpoint and then all the queries are routed to that endpoint. Behind that endpoint lies a a warm proxy fleet of database capacity which can scale as per your requirements within 5 seconds. It’s all on-demand and ideal for transient spiky loads. What’s even sweeter is that billing is on a second by second basis and deals in Aurora capacity units and the minimum is one minute for each newly added resource.