kubernetes-sigs · xtineskim · Jul 26, 2024 · Jul 26, 2024 · Aug 8, 2024 · Aug 8, 2024
diff --git a/geps/gep-3139/index.md b/geps/gep-3139/index.md
@@ -0,0 +1,186 @@
+# GEP-3139: GRPCRoute Timeouts
+
+* Issue: [#3139](https://github.com/kubernetes-sigs/gateway-api/issues/3139)
+* Status: Implementable
+
+(See status definitions [here](/geps/overview/#gep-states).)
+
+## TLDR
+
+Similar to the HTTPRoute Timeouts (GEP # 1742), the goal of this GEP is to create a design for implementing GRPCRoute Timeouts
+
+## Goals
+@arkodg (original requester of this experimental feature) had the following listed in the discussion, which is a good starting point the API of GRPCRoute timeouts
+
+- The ability to set a request timeout for unary RPC
+- The ability to disable timeouts (set to 0s) for streaming RPC
+
+## Non-Goals
+
+Create a design for bidirectional streaming. Although this would be very useful, I propose that we leave further iteration on laying the grounds for enabling this discussion. Furthermore, we should look into streaming for HTTP, and update GEP 1742 as well.
+
+## Introduction
+
+This GEP intends to find common timeouts that we can build into the Gateway API for GRPC Route.
+
+gRPC has the following 4 cases:
+- Unary (single req, single res)
+- Client Stream (Client sends a stream of messages, server replies with a res)
+- Server Stream (Client sends a single req, Server replies with a stream)
+- Bidirectional Streaming 
+
+For this initial design however, we’ll focus on unary connections, and provide room for discussion on having a streaming semantics defined for HTTP, GRPC, etc in a future iteration.
+
+Most implementations have a proxy for GRPC, as listed in the table here. From the table, implementations rely on either Envoy, Nginx, F5 BigIP, Pipy, HAProxy, Litespeed, or Traefik as their proxy in their dataplane. 
+For the sake of brevity, the flow of timeouts are shown in a generic flow diagram (same diagram as [GEP 1742](https://gateway-api.sigs.k8s.io/geps/gep-1742/#flow-diagrams-with-available-timeouts)):
+
+```mermaid
+sequenceDiagram
+    participant C as Client
+    participant P as Proxy
+    participant U as Upstream
+    C->>P: Connection Started
+    C->>P: Starts sending Request
+    C->>P: Finishes Headers
+    C->>P: Finishes request
+    P->>U: Connection Started
+    P->>U: Starts sending Request
+    P->>U: Finishes request
+    P->>U: Finishes Headers
+    U->>P: Starts Response
+    U->>P: Finishes Headers
+    U->>P: Finishes Response
+    P->>C: Starts Response
+    P->>C: Finishes Headers
+    P->>C: Finishes Response
+    Note right of P: Repeat if connection sharing
+    U->>C: Connection ended
+```
+
+Some differences from HTTPRoute timeouts
+
+Noted by [@gnossen](https://github.com/kubernetes-sigs/gateway-api/discussions/3103#discussioncomment-9732739), the timeout field in a bidirectional stream is never complete, since the timer only starts after the request is finished, since the timer is never started. Envoy uses the config `grpc_timeout_header_max` in order to start the timer from when the first request message is initiated. 
+
+Nginx uses grpc_<>_timeout is used to configure of GRPC timeouts, which occurs between the proxy and upstream (`grpc_connect_timeout,grpc_send_timeout, grpc_read_timeout`)
+
+## API
+
+The proxy implementations for the dataplane for the majority have some way to configure GRPC timeouts.
+
+### Timeout Values
+
+To remain consistent with the HTTPRoute’s timeouts, there will be the same timeout.requests and timeout.backendRequest that can be configurable. There is also a timeout.streamingRequest to capture the ability to disable timeouts for streaming RPC
+
+Unary RCP
+
+Remaining consistent with HTTPRoute’s timeout values:
+- `timeout.requests`
+The timeout for the Gateway API implementation to send a res to a client GRPC request. The timer should start when connection is started, since this will ideally make sense with the stream option. This field is optional Extended support.
+- `timeout.backendRequest`
+The timeout for a single request from the gateway to upstream. This field is optional Extended support.
+
+Disabling streaming RPC
+- `timeout.streamingRequest`
+The timeout value for streaming. Currently, only the value of 0s will be allowed, but leaving this field as a string to allow for future work around bidirectional streaming timers. This field is optional Extended support.
+
+GO
+```
+type GRPCRouteRule struct {
+    // Timeouts defines the timeouts that can be configured for an GRPC request.
+    //
+    // Support: Extended
+    //
+    // +optional
+    // <gateway:experimental>
+    Timeouts *GRPCRouteTimeouts `json:"timeouts,omitempty"`
+
+    // ...
+}
+
+// GRPCRouteTimeouts defines timeouts that can be configured for an GRPCRoute.
+// Timeout values are represented with Gateway API Duration formatting.
+// Specifying a zero value such as "0s" is interpreted as no timeout.
+//
+// +kubebuilder:validation:XValidation:message="backendRequest timeout cannot be longer than request timeout",rule="!(has(self.request) && has(self.backendRequest) && duration(self.request) != duration('0s') && duration(self.backendRequest) > duration(self.request))"
+type GRPCRouteTimeouts struct {
+    // Request specifies the maximum duration for a gateway to respond to an GRPC request.
+    // If the gateway has not been able to respond before this deadline is met, the gateway
+    // MUST return a timeout error.
+    //
+    // For example, setting the `rules.timeouts.request` field to the value `10s` in an
+    // `GRPCRoute` will cause a timeout if a client request is taking longer than 10 seconds
+    // to complete.
+    //
+    // This timeout is intended to cover as close to the whole request-response transaction
+    // as possible although an implementation MAY choose to start the timeout after the entire
+    // request stream has been received instead of immediately after the transaction is
+    // initiated by the client.
+    //
+    // When this field is unspecified, request timeout behavior is implementation-specific.
+    //
+    // Support: Extended
+    //
+    // +optional
+    Request *Duration `json:"request,omitempty"`
+
+    // BackendRequest specifies a timeout for an individual request from the gateway
+    // to a backend. This covers the time from when the request first starts being
+    // sent from the gateway to when the full response has been received from the backend.
+    //
+    // An entire client GRPC transaction with a gateway, covered by the Request timeout,
+    // may result in more than one call from the gateway to the destination backend,
+    // for example, if automatic retries are supported.
+    //
+    // Because the Request timeout encompasses the BackendRequest timeout, the value of
+    // BackendRequest must be <= the value of Request timeout.
+    //
+    // Support: Extended
+    //
+    // +optional
+    BackendRequest *Duration `json:"backendRequest,omitempty"`
+
+    // StreamingRequest specifies the ability for disabling bidirectional streaming. 
+    // The only supported settings are `0s`, so users can disable timeouts for streaming
+    //
+    // Support: Extended
+    //
+    // +optional
+    StreamingRequest *Duration `json:"request,omitempty"`
+}
+
+// Duration is a string value representing a duration in time. The format is as specified
+// in GEP-2257, a strict subset of the syntax parsed by Golang time.ParseDuration.
+//
+// +kubebuilder:validation:Pattern=`^([0-9]{1,5}(h|m|s|ms)){1,4}$`
+type Duration string
+```
+YAML
+```
+apiVersion: gateway.networking.k8s.io/v1beta1
+kind: GRPCRoute
+metadata:
+  name: timeout-example
+spec:
+  ...
+  rules:
+  - backendRefs:
+    - name: some-service
+      port: 8080
+    timeouts:
+      request: 10s
+      backendRequest: 2s
+      streamRequest: 0s
+```
+## Conformance Details
+The feature name for this feature is GRPCRouteTimeout, and its support is Extended.
+Gateway implementations can indicate support for this feautre using the following:
+- `GRPCRouteRequestTimeount`
+- `GRPCRouteRequestBackendTimeout`
+- `GRPCRouteStreamingRequestTimeout`
+
+
+## Alternatives
+
+
+## References
+
diff --git a/geps/gep-3139/metadata.yaml b/geps/gep-3139/metadata.yaml
@@ -0,0 +1,12 @@
+apiVersion: internal.gateway.networking.k8s.io/v1alpha1
+kind: GEPDetails
+number: 3139
+name: GRPCRoute Timeouts
+status: Implementable 
+authors:
+  - xtine 
+relationships:
+  extendedBy:
+    - number: 2257
+      name: Gateway API Duration Format
+      description: Adds a duration format for us in timeouts.