- Set min replicas to 1 or more.
cerebrium.toml file. This is typically the best option if you would like to sustain a base load or would like
to meet minimum SLA’s with customers. Please note that you are charged for 24/7 usage of the instances
- Set your cooldown period
cerebrium.toml and is by default set to 60 seconds. This is the number of seconds of inactivity from when your last
request finishes that a container must experience before terminating. Every time you get a new request, this time is reset. It is important to note that you are charged
for the cooldown time since your container is constantly running.