14
devs
2y

It's so difficult to scale for this type of load...

Comments
  • 3
    What is this graph what does it measure?
  • 2
    Is it routine? If you know it's coming at the same time you can scale 30 minutes before.
  • 5
    Isn’t this what serverless is for?
  • 1
    look like a random spike... is it some kind of shop or social media app?
  • 0
    is it unforeseen legitimate traffic? Perhaps you don't even need to scale for it -- just filter it out? And if it's legit and foreseen - scale for it in advance.
  • 2
    The load is expected. This specifically is is operations on our Redis database that we use for caching. Our platform is for conferences, so the majority of the time we have no users, until suddently we have thousands of users simultaneously online for it.

    We can sort of scale for it in advance, unless too much is going on at the same time (we're two developers managing the environment + managing other developers + prorgamming).

    Spiky load is always fun.

    Application wise we use Kubernetes and tend to run with a fairly large overhead, so auto scaling can catch up.
  • 2
    The E in EC2 stands for Elastic; EC2 instances accumulate CPU credit which can then be used to handle spikes.
  • 0
    @devs this is a fun project to plan and build for.

    If you have enough data you could actually train an AI to look for early indicators that would determine these spikes and scale out to accommodate.

    With a combination of data sources like the traffic you're showing along with app logs, database content based on venue dates it's possible to accomplish this.

    This is no different than how Netflix handles their own scalability problems.
  • 1
    @sariel we usually go from 0 users to >5000 users within 2-3 minutes.
  • 2
    @devs then there's plenty of time to scale up. 😂
  • 2
    @jonas-w my dicki
  • 1
    @karma your dicki in the morning
  • 2
    Hard wood ™️
  • 1
Add Comment