- Documentation
- Services
- Scaling
Scaling
Guara Cloud offers two types of scaling for your services: horizontal (more instances) and vertical (more resources per instance).
Horizontal scaling: replicas
Replicas are additional instances of your service running in parallel. When you increase the number of replicas, traffic is automatically distributed among them, improving availability and throughput.
With multiple replicas, your service stays online even if one instance fails — the other replicas continue serving requests while the instance is replaced.
Vertical scaling: CPU and memory
You can also increase the amount of CPU and memory allocated to each instance of your service. This is useful for applications that need more processing power or consume more memory.
Configuring in the dashboard
To adjust your service’s scaling:
- Go to the service page in the dashboard
- Navigate to the Resources tab
- Adjust the number of replicas, the CPU limit, and the memory limit
- Click Save
Changes are applied automatically. If you have more than one replica, the update is performed gradually (rolling update) to avoid downtime.
Limits by plan
Scaling limits depend on your project’s plan:
| Resource | Hobby | Pro | Business | Enterprise |
|---|---|---|---|---|
| CPU per service (request) | 0.25 vCPU | 0.5 vCPU | 1 vCPU | 4 vCPU |
| CPU per service (burst) | 0.5 vCPU | 1 vCPU | 2 vCPU | 8 vCPU |
| Memory per service (request) | 256 MB | 512 MB | 2 GB | 8 GB |
| Memory per service (burst) | 512 MB | 1 GB | 4 GB | 16 GB |
| Replicas per service | 1 | 2 | 3 | 50 |
| Services per project | 2 | 4 | 10 | Unlimited |
Graceful degradation
If you hit your plan’s limits, existing services keep running normally. However, new resource allocations will be rejected until you free up resources or upgrade your plan.
For example, if you are on the Pro plan with 4 active services, you will not be able to create a fifth service. But the 4 existing services continue running without interruption.