On this page

Scaling

Guara Cloud offers two types of scaling for your services: horizontal (more instances) and vertical (more resources per instance).

Horizontal scaling: replicas

Replicas are additional instances of your service running in parallel. When you increase the number of replicas, traffic is automatically distributed among them, improving availability and throughput.

With multiple replicas, your service stays online even if one instance fails — the other replicas continue serving requests while the instance is replaced.

Vertical scaling: CPU and memory

You can also increase the amount of CPU and memory allocated to each instance of your service. This is useful for applications that need more processing power or consume more memory.

Configuring in the dashboard

To adjust your service’s scaling:

  1. Go to the service page in the dashboard
  2. Navigate to the Resources tab
  3. Adjust the number of replicas, the CPU limit, and the memory limit
  4. Click Save

Changes are applied automatically. If you have more than one replica, the update is performed gradually (rolling update) to avoid downtime.

Limits by plan

Scaling limits depend on your project’s plan:

ResourceHobbyProBusinessEnterprise
CPU per service (request) 0.25 vCPU 0.5 vCPU 1 vCPU 4 vCPU
CPU per service (burst) 0.5 vCPU 1 vCPU 2 vCPU 8 vCPU
Memory per service (request) 256 MB 512 MB 2 GB 8 GB
Memory per service (burst) 512 MB 1 GB 4 GB 16 GB
Replicas per service 1 2 3 50
Services per project 2 4 10 Unlimited

Graceful degradation

If you hit your plan’s limits, existing services keep running normally. However, new resource allocations will be rejected until you free up resources or upgrade your plan.

For example, if you are on the Pro plan with 4 active services, you will not be able to create a fifth service. But the 4 existing services continue running without interruption.