Auto-Scaling & Load Balancing
All deployment types support auto-scaling based on CPU and RAM thresholds. API and website deployments include built-in RoundRobin load balancing.
Capabilities
Intelligent Scaling & Traffic Distribution
MultiConnect Cloud automatically scales your containers up or down based on real-time CPU and RAM usage. For APIs and websites, incoming requests are distributed across all healthy instances using RoundRobin load balancing.
Server Auto-Scaling
Scale between min/max server count based on CPU/RAM thresholds.
API Load Balancing
Auto-scale instances with RoundRobin load balancing across all healthy containers.
Website Scaling
Auto-scale instances with RoundRobin load balancing (70% CPU default threshold).
10 Containers Per Region
Deploy up to 10 containers per region across all deployment types.
Zero-Downtime Deploys
Blue/green deployments with automatic traffic cutover – no interruptions.
How It Works
Load Balancing Flow
01
Request arrives
The platform’s managed reverse proxy receives the incoming HTTP request.
02
Next container
The proxy picks the next healthy container in rotation (Container 1 → 2 → 3 → back to 1).
03
Request forwarded
The request is forwarded to the selected container, which processes it and returns the response.
04
Health-aware
Unhealthy containers are automatically removed from rotation – requests only go to healthy instances.
05
Zero-downtime deploys
During blue/green deployments, new containers are verified healthy before traffic switches over.
Example: 3 API Containers in India
Even distribution of load, fault tolerance if Container B goes down, requests flow to A and C only.
Frequently Asked Questions
Everything you need to know about MultiConnect Cloud.
Up to 10 containers per region for all deployment types (servers, APIs, websites). This is a system-wide maximum regardless of plan. Plans define the resources (CPU, RAM, storage, bandwidth) per container.
No. RoundRobin load balancing is built-in for API and website deployments at no additional cost.