Skip to content

cortexlabs/cortex

Repository files navigation

WebsiteSlackDocs



Cost-effective serverless computing

Cortex is a highly-scalable serverless computing platform that is up to 90% less expensive than AWS Lambda. It is designed for scaling compute-intensive realtime, async, and batch workloads on your AWS account. Organizations worldwide use Cortex to scale microservices, data processing, machine learning, and other applications in production.


Maximize compute utilization

Workload autoscaling - set autoscaling policies per workload based on its traffic.

Resource requests - configure CPU, GPU, and memory requests per workload, without limits.

Container deployments - customize the runtime and request concurrency for each container.


Minimize compute costs

Cluster autoscaling - elastically scale your cluster to meet demand.

Spot instances - run workloads on spot instances without sacrificing reliability.

Multi-instance - use multiple instance types to optimize price-performance ratio per workload.


Visualize and control spend

Metrics dashboard - monitor latency and resource utilization with pre-built dashboards.

Spend visibility - visualize your spend using the latest AWS pricing information.

Resource limits - set limits on resource consumption globally and per workload.

About

Production infrastructure for machine learning at scale

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Contributors 22