Cloud Service Mesh service routing APIs overview
This document is intended for mesh or platform administrators and service developers who have an intermediate to advanced level of familiarity with Cloud Service Mesh and service mesh concepts and who are deploying Cloud Service Mesh on Compute Engine with VM instances. This document applies to deployments using Envoy and gRPC clients. For more information on Cloud Service Mesh concepts, see the general overview.
Cloud Service Mesh provides service networking capabilities to your applications, including advanced traffic management, observability, and security. However, configuring and operating a service mesh is a complex task for mesh administrators and service developers.
This document describes the service routing APIs for configuring Cloud Service Mesh. These APIs are designed to simplify and improve your overall mesh configuration experience.
The service routing model uses API resources called Mesh, Gateway, and Route. These resources provide a contextually-relevant configuration experience when you define your service networking control plane.
This document introduces the following service routing API model and resources.
Meshresource- Service-to-service (east-west) traffic management and security configuration for Envoy sidecar proxies and proxyless gRPC clients.
-
- Traffic management and security configuration for Envoy proxies acting as ingress gateways, allowing external clients to connect to the service mesh (north-south).
Routeresources with the following types:
The Google Cloud console does not provide support for the service routing APIs. You must implement these API resources using the Google Cloud CLI or the REST APIs.
Use cases and benefits
The service routing APIs let you configure Cloud Service Mesh for both proxyless gRPC and Envoy proxy deployments. The service routing API model enables several key benefits.
In the following diagram, two services in the service mesh are connected by a Mesh resource. The two HTTPRoute resources configure routing. The mesh or platform administrator manages the Mesh resource and the two service owners create the routing configuration for their services.
Role-oriented API design enables clear separation of responsibilities
The service routing APIs let you separate mesh configuration responsibilities based on organizational roles:
- Mesh administrators can define the logical mesh as well as the ingress gateway infrastructure.
- Service owners (application developers) can independently define access patterns for their services. They can also define and apply traffic management policies for their services.
In the following diagram, Cloud Load Balancing and a Gateway resource provide an ingress gateway for traffic entering the mesh from a client that is not in the mesh. The mesh administrator configures and manages the Gateway resource, while the service owners configure and manage their own services and traffic routing.
Enhanced reliability with self-serve model
The service routing APIs use per-protocol, per-route resources that can be configured and owned by independent service owners. This approach has several advantages.
- Service owners have autonomy over how they want to configure policies and traffic management for the services they own.
- Updating one
Routeresource does not affect otherRouteresources in the mesh. The update process is more reliable because service owners manage small configurations. - The service owner who is responsible for the destination service or hostname owns each
Routeresource. - Service owners don't have to depend on mesh administrators to update routing.
Enable a service mesh spanning multiple projects in Shared VPC environments
The service routing API model lets service owners participate in a shared mesh infrastructure using Shared VPC and other means of connectivity while maintaining independent control over their services. For example, service owners can define the Route resources in their own projects. Platform administrators can define a Mesh in a centrally administered host project, then grant service owners IAM permissions to attach their Route resources to a shared Mesh or Gateway. The following diagram shows an example with Shared VPC.
The service routing APIs also let you have service mesh clients connected to different networks using VPC Network Peering.
Route traffic based on the server name indicator
The TLSRoute resource lets you route TLS-encrypted traffic based on the Server Name Indication (SNI) in the TLS handshake. You can configure TLS traffic to be routed to the appropriate backend services by configuring the SNI match in the TLSRoute resource. In these deployments, proxies only route traffic and the TLS session is terminated at the destination backend instance.
The TLSRoute resource is supported only with Envoy proxies that are deployed as sidecar proxies or gateways.
TLSRoute resource attached to a Mesh resource
The deployment shown in the following diagram routes any service mesh traffic where the SNI extension has the value service1 to the backend service service1. Additionally, any service mesh traffic where the SNI extension has the value service2 is routed to the backend service service2. The SNI extension value and the backend service name are independent of each other.
TLSRoute resource and Mesh resource (click to enlarge)TLSRoute resource attached to a Gateway resource
The deployment shown in the following diagram routes any inbound traffic to the Gateway resource where the SNI extension has the value serviceA to the backend service serviceA. Additionally, any inbound traffic to the Gateway where the SNI extension has the value serviceB is routed to the backend service serviceB. The SNI extension value and the backend service name are independent of each other. The SNI extension value and the header in HTTP requests are also independent.
The Gateway resource does not terminate the TLS connection at the Gateway's Envoy proxy. Instead, the TLS connection is terminated at the corresponding backend service. The Gateway cannot inspect any information encrypted in the TLS layer, other than seeing the ClientHello, which contains a plain text SNI extension. The Gateway performs TLS passthrough in this mode. Note that encrypted ClientHello is unsupported.
TLSRoute resource and Gateway resource (click to enlarge)Core gRPC support
You can configure proxyless gRPC clients by using core gRPC attributes such as matching by method.
Traffic splitting for TCP traffic
You can implement weight-based traffic splitting for TCP traffic across multiple backend services. You can configure patterns such as canary (blue- green) rollouts when you update your service. Traffic splitting also lets you migrate traffic in a controlled manner without downtime.
Traffic interception
When you use the service routing API Mesh and Gateway resources, all traffic is automatically intercepted. For more information, see Options for Compute Engine VM setup with automatic Envoy deployment.
Architecture and resources
This section describes the service routing API model and its resources, and helps you to understand how the service routing API resources work together.
Mesh resource
The Mesh resource represents an instance of a service mesh. You use it to create a logical service mesh in your project. Each Mesh resource must have a unique name in the project. After a Mesh resource is created, its name cannot be modified.
MeshAPI resource with Envoy sidecar and proxyless gRPC deployments (click to enlarge)The Mesh resource is referenced in the Route resource to add routes for services that are part of the mesh.
Envoy proxy and proxyless gRPC clients receive configuration from Cloud Service Mesh by joining the service mesh identified by the Mesh resource's name. The Mesh resource supports the following data plane deployments:
- Envoy running alongside the application as sidecar proxies
- Proxyless gRPC clients
- Mix of Envoy sidecar and proxyless gRPC clients
Route resource
The Route resource is used to set up routing to the services. There are four different types of the Route API resource. They define the protocol used to route traffic to a backend service.
HTTPRouteGRPCRouteTCPRouteTLSRoute
The API doesn't contain a Route API verbatim. The only configurable API resources are HTTPRoute, GRPCRoute, TCPRoute, and TLSRoute.
The Route resource references one or more Mesh and Gateway resources to add the routes that are part of the corresponding Mesh or Gateway configuration. A Route resource can reference both Gateway and Mesh resources.
The Route resource also references one or more backend service resources. The services are configured using the backend service API. You create a backend service resource that points to one or more MIG or NEG backends.
The following diagram shows the relationships among the Mesh, Gateway, and Route resources and the backend service resource and its backends.
Route API resources (click to enlarge)You define other traffic management capabilities, such as routing, header modifications, timeouts, and weight-based traffic splitting in Route resources. For example, in the following diagram, an HTTPRoute resource defines a 70% / 30% traffic split between two backend services.
To simplify administration of your service mesh, you can list all Route resources attached to a Mesh or Gateway resource.
TLSRoute resource
Use the TLSRoute resource to route TLS traffic to backend services based on SNI hostnames and Application-Layer Protocol Negotiation (ALPN) name. TLSRoute configuration implies TLS passthrough, in which the Envoy proxy does not terminate TLS traffic.
The TLSRoute resource references one or more Mesh and Gateway resources to add the routes that are part of the corresponding Mesh or Gateway configuration.
The TLSRoute resource also references one or more backend service resources. The services are configured using the backend service API resource.
Gateway resource
The Gateway resource is used to represent Envoy proxies acting as ingress gateways, allowing external clients to connect to the service mesh (north-south traffic). This resource has listening ports along with a scope parameter. The Envoy proxy that acts as an ingress gateway binds to the ports specified and to 0.0.0.0, which represents all of the IP addresses on the local VM. The following diagram shows Envoy proxies deployed as an ingress service and configured by the Gateway resource. In this particular example, Envoy proxies are configured to listen on port 80 for incoming connections from clients.
The Gateway API resource only supports the Envoy proxy data plane. It does not support proxyless gRPC. gRPCRoutes are supported in the Gateway resource, but the gRPC traffic is routed by the Envoy proxy, acting as a middle-proxy.
Gateway resource (click to enlarge)Gateway resource (click to enlarge)What are a Gateway scope and merged Gateway configuration?
A Gateway resource instance represents the ports and configuration specific to traffic received on those ports. The Gateway API resource has a parameter, scope, that is used to logically group and merge the configuration of two or more Gateway resources.
For example, If you want the Gateway proxies to listen on ports 80 and 443 to receive HTTP and HTTPS traffic respectively, you create two Gateway resources. Configure one Gateway resource with port 80, for HTTP traffic, and the other with 443, for HTTPS traffic. Give the scope field in each the same value. Cloud Service Mesh dynamically merges the individual configurations of all Gateways that have the same scope. On the data plane side, the Envoy proxies that run in the ingress gateway mode must also present the same scope parameter to Cloud Service Mesh to receive the Gateway configuration. Note that you specify the scope when you create the Gateway resource, and you specify the same scope as the bootstrap parameter for the proxies.
Gateway resource merge behavior (click to enlarge)The following are key considerations for the Gateway resource:
- The
Gatewayscope parameter is mandatory. Specify the scope in theGatewayresource and in the bootstrap configuration of the Envoy proxies even when only oneGatewayexists. - Creating a
Gatewayresource does not deploy a service with an Envoy proxy. Deploying the Envoy proxy is a separate step. - The
Gatewayresource has atypethat represents the type of ingress deployment. This field is reserved for future use. The only currently-supported value isOPEN_MESH, which is the default value and which cannot be modified.
Mesh deployments with mixed protocols and data planes
You can have a mixed data plane deployment, with Envoy proxy and proxyless gRPC in the same mesh. When you create such deployments, consider the following.
- Envoy sidecar deployments support all Routes (
HTTPRoute,GRPCRoute,TCPRoute, andTLSRoute). - Proxyless gRPC deployments only support
GRPCRoute. GRPCRouteis limited to features supported only by gRPC proxyless deployments.
Supported topologies in multi-project Shared VPC environments
Cloud Service Mesh supports adding Route resources that are defined in other projects to a Mesh or Gateway resource defined in a centralized administration project. Authorized service owners can directly add their service routing configurations to the Mesh or Gateway.
Mesh and Route resources (click to enlarge)In a typical cross-project scenario, you choose a project (host project or centrally controlled administration project) as the mesh administration project where you create a Mesh resource. The mesh administration project owner authorizes Route resources from other projects to reference the Mesh resource, allowing the routing configuration from other projects to be part of the mesh. A mesh data plane, whether Envoy or gRPC, requests configuration from the administration project and receives a union of all of the routes attached to the Mesh. For a Gateway, the routes are also merged across all Gateways that use the same scope.
The Mesh administration project can be any project that you choose, and the configuration works as long as the underlying projects have VPC network connectivity, either through Shared VPC or VPC Network Peering.
IAM permissions and roles
The following are the IAM permissions that are required to securely get, create, update, delete, list, and use the Mesh and Route resources.
- Mesh administrators need to have
networkservices.mesh.*permissions. - Gateway administrators need to have
networkservices.gateways.*permissions. - Service owners need to have
networkservices.grpcRoutes.*,networkservices.httpRoutes.*, ornetworkservices.tcpRoutes.*permissions.
Mesh administrators need to grant the networkservices.mesh.use permission to service owners so that the service owners can attach their Route resources to the Mesh resource. The same model applies to Gateway resources.
To see all IAM permissions for Mesh resources, go to the IAM permissions reference page and search for meshes.
There are no additional predefined roles required. The existing, predefined role Compute Network Admin (roles/compute.networkAdmin) has networkservices.* permissions by default. You might need to add the previously described permissions to your custom roles.
Considerations and limitations
- The Google Cloud console does not support the service routing APIs.
- Use the xDS API version 3 or later.
- Minimum Envoy version of 1.20.0 (since the service routing APIs are supported only on xDS version 3)
- Minimum gRPC bootstrap generator version of v0.14.0
- The
TLSRouteresource is supported only with Envoy proxies that are deployed as sidecar proxies or gateways. - Only Compute Engine VMs with automatic Envoy deployment and GKE Pods with automatic Envoy injection are supported. You cannot use manual deployment with the service routing APIs.
- The service routing APIs are not backward compatible with the older APIs.
- When a
TCPRouteresource is attached to aMeshresource, the port used to match TCP traffic cannot be used to serve anything except the traffic described by thisTCPRoute.- For example, your deployments might include a
TCPRouteresource that matches port "8000" and an HttpRoute resource. When both are attached to the sameMeshresource, traffic routed by theHTTPRouteresource cannot use port 8000 even when the underlying IP addresses are different. This limitation comes from Envoy proxy implementation, which assigns precedence to the port-matched route first.
- For example, your deployments might include a
- The
Gatewayresource does not provision a managed load balancer and it does not dynamically create an Envoy service. - Automatically deployed Envoys serving as ingress gateways must not have the
serving_portsargument to the--service-proxyflag. - Automatically deployed Envoy does not support providing a project number different from the project of the VM.
What's next
- For information about listing route resources associated with a
MeshorGatewayresource, see ListRouteresources. - For information about the service routing APIs, read the documentation for the network services APIs.