mongodb
diff --git a/‎source/connection-monitoring-and-pooling/connection-monitoring-and-pooling.md‎
Lines changed: 11 additions & 0 deletions b/‎source/connection-monitoring-and-pooling/connection-monitoring-and-pooling.md‎
Lines changed: 11 additions & 0 deletions
diff --git a/‎source/connection-monitoring-and-pooling/tests/cmap-format/pool-create-min-size-error.json‎
Lines changed: 1 addition & 1 deletion b/‎source/connection-monitoring-and-pooling/tests/cmap-format/pool-create-min-size-error.json‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎source/connection-monitoring-and-pooling/tests/cmap-format/pool-create-min-size-error.yml‎
Lines changed: 1 addition & 1 deletion b/‎source/connection-monitoring-and-pooling/tests/cmap-format/pool-create-min-size-error.yml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎source/load-balancers/tests/sdam-error-handling.json‎
Lines changed: 2 additions & 2 deletions b/‎source/load-balancers/tests/sdam-error-handling.json‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎source/load-balancers/tests/sdam-error-handling.yml‎
Lines changed: 2 additions & 2 deletions b/‎source/load-balancers/tests/sdam-error-handling.yml‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎source/logging/logging.md‎
Lines changed: 1 addition & 1 deletion b/‎source/logging/logging.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎source/server-discovery-and-monitoring/server-discovery-and-monitoring-tests.md‎
Lines changed: 37 additions & 0 deletions b/‎source/server-discovery-and-monitoring/server-discovery-and-monitoring-tests.md‎
Lines changed: 37 additions & 0 deletions
diff --git a/‎source/server-discovery-and-monitoring/server-discovery-and-monitoring.md‎
Lines changed: 33 additions & 23 deletions b/‎source/server-discovery-and-monitoring/server-discovery-and-monitoring.md‎
Lines changed: 33 additions & 23 deletions
diff --git a/‎source/server-discovery-and-monitoring/server-monitoring.md‎
Lines changed: 8 additions & 1 deletion b/‎source/server-discovery-and-monitoring/server-monitoring.md‎
Lines changed: 8 additions & 1 deletion
diff --git a/‎source/server-discovery-and-monitoring/tests/unified/backpressure-network-error-fail.json‎
Lines changed: 140 additions & 0 deletions b/‎source/server-discovery-and-monitoring/tests/unified/backpressure-network-error-fail.json‎
Lines changed: 140 additions & 0 deletions
@@ -284,6 +284,14 @@ Endpoint. The pool has the following properties:
 - **Rate-limited:** A Pool MUST limit the number of [Connections](#connection) being
     [established](#establishing-a-connection-internal-implementation) concurrently via the **maxConnecting**
     [pool option](#connection-pool-options).
+- **Backpressure-enabled** - The pool MUST add the error labels `SystemOverloadedError` and `RetryableError` to network
+    errors or network timeouts it encounters during the connection establishment or the `hello` message. These labels
+    are used by the
+    [SDAM error handling](../server-discovery-and-monitoring/server-discovery-and-monitoring.md#error-handling-pseudocode)
+    to avoid clearing the pool. The pool MUST NOT add the backpressure error labels during an authentication step
+    after the `hello` message. For errors that the driver can distinguish as never occurring due to server overload,
+    such as DNS lookup failures, TLS related errors, or errors encountered establishing a connection to a socks5 proxy,
+    the driver MUST clear the connection pool and MUST mark the server Unknown for these error types.
 
 ```typescript
 interface ConnectionPool {
@@ -461,6 +469,7 @@ try:
   return connection
 except error:
   close connection
+  add `SystemOverloadedError` label if appropriate (see "backpressure-enabled" in [Connection Pool](#connection-pool))
   throw error # Propagate error in manner idiomatic to language.
 ```
 
@@ -1375,6 +1384,8 @@ to close and remove from its pool a [Connection](#connection) which has unread e
 
 ## Changelog
 
+- 2025-11-21: Add handling of backpressure error labels.
+
 - 2025-01-22: Clarify durationMS in logs may be Int32/Int64/Double.
 
 - 2024-11-27: Relaxed the WaitQueue fairness requirement.
 
@@ -11,7 +11,7 @@ failPoint:
   mode: { times: 50 }
   data:
     failCommands: ["isMaster","hello"]
-    closeConnection: true
+    errorCode: 91
     appName: "poolCreateMinSizeErrorTest"
 poolOptions:
   minPoolSize: 1
 
@@ -153,14 +153,14 @@ tests:
             mode: { times: 1 }
             data:
               failCommands: [isMaster, hello]
-              closeConnection: true
+              errorCode: 11600
               appName: *singleClientAppName
       - name: insertOne
         object: *singleColl
         arguments:
           document: { x: 1 }
         expectError:
-          isClientError: true
+          isError: true
     expectEvents:
       - client: *singleClient
         eventType: cmap
 
@@ -95,7 +95,7 @@ Drivers MUST support configuring where log messages should be output, including
     > - If the value is "stdout" (case-insensitive), log to stdout.
     > - If the value is "stderr" (case-insensitive), log to stderr.
     > - Else, if direct logging to files is supported, log to a file at the specified path. If the file already exists, it
-    >   MUST be appended to.
+    >     MUST be appended to.
     >
     > If the variable is not provided or is set to an invalid value (which could be invalid for any reason, e.g. the path
     > does not exist or is not writeable), the driver MUST log to stderr and the driver MAY attempt to warn the user about
 
@@ -172,3 +172,40 @@ This test requires failCommand appName support which is only available in MongoD
 5. Then verify that a ServerHeartbeatSucceededEvent and a ConnectionPoolReadyEvent (CMAP) are emitted.
 
 6. Disable the failpoint.
+
+## Connection Pool Backpressure
+
+This test will be used to ensure that connection establishment failures during the TLS handshake do not result in a pool
+clear event. We create a setup client to enable the ingress connection establishment rate limiter, and then induce a
+connection storm. After the storm, we verify that some of the connections failed to checkout, but that the pool was not
+cleared.
+
+This test requires MongoDB 7.0+.
+
+1. Create a test client that listens to CMAP events, with maxConnecting=100. The higher maxConnecting will help ensure
+    contention for creating connections.
+
+2. Run the following commands to set up the rate limiter.
+
+    ```python
+    client.admin.command("setParameter", 1, ingressConnectionEstablishmentRateLimiterEnabled=True)
+    client.admin.command("setParameter", 1, ingressConnectionEstablishmentRatePerSec=20)
+    client.admin.command("setParameter", 1, ingressConnectionEstablishmentBurstCapacitySecs=1)
+    client.admin.command("setParameter", 1, ingressConnectionEstablishmentMaxQueueDepth=1)
+    ```
+
+3. Add a document to the test collection so that the sleep operations will actually block:
+    `client.test.test.insert_one({})`.
+
+4. Run the following find command on the collection in 100 parallel threads/coroutines. Run these commands concurrently
+    but block on their completion, and ignore errors raised by the command.
+    `client.test.test.find_one({"$where": "function() { sleep(2000); return true; }})`
+
+5. Assert that at least 10 `ConnectionCheckOutFailedEvent` occurred.
+
+6. Assert that 0 `PoolClearedEvent` occurred.
+
+7. Sleep for 1 second to clear the rate limiter.
+
+8. Ensure that the following command runs at test teardown even if the test fails.
+    `client.admin("setParameter", 1, ingressConnectionEstablishmentRateLimiterEnabled=False)`.
@@ -434,18 +434,18 @@ correspond to [replica set member states](https://www.mongodb.com/docs/manual/re
 some replica set member states like STARTUP and RECOVERING are identical from the client's perspective, so they are
 merged into "RSOther". Additionally, states like Standalone and Mongos are not replica set member states at all.
 
-| State           | Symptoms                                                                                                                  |
-| --------------- | ------------------------------------------------------------------------------------------------------------------------- |
-| Unknown         | Initial, or after a network error or failed hello or legacy hello call, or "ok: 1" not in hello or legacy hello response. |
-| Standalone      | No "msg: isdbgrid", no setName, and no "isreplicaset: true".                                                              |
-| Mongos          | "msg: isdbgrid".                                                                                                          |
-| PossiblePrimary | Not yet checked, but another member thinks it is the primary.                                                             |
-| RSPrimary       | "isWritablePrimary: true" or "ismaster: true", "setName" in response.                                                     |
-| RSSecondary     | "secondary: true", "setName" in response.                                                                                 |
-| RSArbiter       | "arbiterOnly: true", "setName" in response.                                                                               |
-| RSOther         | "setName" in response, "hidden: true" or not primary, secondary, nor arbiter.                                             |
-| RSGhost         | "isreplicaset: true" in response.                                                                                         |
-| LoadBalanced    | "loadBalanced=true" in URI.                                                                                               |
+| State           | Symptoms                                                                                                 |
+| --------------- | -------------------------------------------------------------------------------------------------------- |
+| Unknown         | Initial, or after a failed hello or legacy hello call, or "ok: 1" not in hello or legacy hello response. |
+| Standalone      | No "msg: isdbgrid", no setName, and no "isreplicaset: true".                                             |
+| Mongos          | "msg: isdbgrid".                                                                                         |
+| PossiblePrimary | Not yet checked, but another member thinks it is the primary.                                            |
+| RSPrimary       | "isWritablePrimary: true" or "ismaster: true", "setName" in response.                                    |
+| RSSecondary     | "secondary: true", "setName" in response.                                                                |
+| RSArbiter       | "arbiterOnly: true", "setName" in response.                                                              |
+| RSOther         | "setName" in response, "hidden: true" or not primary, secondary, nor arbiter.                            |
+| RSGhost         | "isreplicaset: true" in response.                                                                        |
+| LoadBalanced    | "loadBalanced=true" in URI.                                                                              |
 
 A server can transition from any state to any other. For example, an administrator could shut down a secondary and bring
 up a mongos in its place.
@@ -1055,7 +1055,10 @@ def handleError(error):
                 # next full scan.
                 if isNotWritablePrimary(error):
                     check failing server
-        elif isNetworkError(error) or (not error.completedHandshake and (isNetworkTimeout(error) or isAuthError(error))):
+        elif isNetworkError(error) or (not error.completedHandshake):
+            # Ignore errors that have a backpressure error label applied.
+            if error.hasLabel("SystemOverloadedError"):
+                continue
             if type != LoadBalanced
               # Mark the server Unknown
               unknown = new ServerDescription(type=Unknown, error=error)
@@ -1139,16 +1142,20 @@ errors, network timeout errors, state change errors, and authentication errors.
 
 ##### Network error when reading or writing
 
-To describe how the client responds to network errors during application operations, we distinguish two phases of
+To describe how the client responds to network errors during application operations, we distinguish three phases of
 connecting to a server and using it for application operations:
 
-- *Before the handshake completes*: the client establishes a new connection to the server and completes an initial
-    handshake by calling "hello" or legacy hello and reading the response, and optionally completing authentication
+- *Connection establishment and hello*: the client establishes a new connection to the server and completes an initial
+    handshake by calling "hello" or legacy hello and reading the response
+- *Authentication step*: the client optionally completes an authentication step
 - *After the handshake completes*: the client uses the established connection for application operations
 
-If there is a network error or timeout on the connection before the handshake completes, the client MUST replace the
-server's description with a default ServerDescription of type Unknown when the TopologyType is not LoadBalanced, and
-fill the ServerDescription's error field with useful information.
+If there is a network error or timeout on the connection establishment or the hello, the client MUST NOT change the
+server's description.
+
+If there is an network error or timeout during the authentication step, the client MUST replace the server's description
+with a default ServerDescription of type Unknown when the TopologyType is not LoadBalanced, and fill the
+ServerDescription's error field with useful information.
 
 If there is a network error or timeout on the connection before the handshake completes, and the TopologyType is
 LoadBalanced, the client MUST keep the ServerDescription as LoadBalancer.
@@ -1253,11 +1260,12 @@ if and only if the error is "node is shutting down" or the error originated from
 and [other transient errors](#other-transient-errors) and
 [Why close connections when a node is shutting down?](#why-close-connections-when-a-node-is-shutting-down).)
 
-##### Authentication and Handshake errors
+##### MongoDB Handshake errors
 
-If the driver encounters errors when establishing application connections (this includes the initial handshake and
-authentication), the driver MUST mark the server Unknown and clear the server's connection pool if the TopologyType is
-not LoadBalanced. (See [Why mark a server Unknown after an auth error?](#why-mark-a-server-unknown-after-an-auth-error))
+If the driver encounters errors that do not have the backpressure error label (`SystemOverloadedError`) applied when
+establishing application connections (this includes the initial handshake and authentication), the driver MUST mark the
+server Unknown and clear the server's connection pool if the TopologyType is not LoadBalanced. (See
+[Why mark a server Unknown after an auth error?](#why-mark-a-server-unknown-after-an-auth-error))
 
 ### Monitoring SDAM events
 
@@ -2027,6 +2035,8 @@ oversaw the specification process.
 - 2025-01-22: Add error messages when a new primary is elected or a primary with a stale electionId or setVersion is
     discovered.
 
+- 2025-11-21: Add handling of backpressure error labels.
+
 ______________________________________________________________________
 
 [^1]: "localThresholdMS" was called "secondaryAcceptableLatencyMS" in the Read Preferences Spec, before it was superseded
 
@@ -163,7 +163,8 @@ MUST be used to satisfy the check and update the topology.
 When a client successfully calls hello or legacy hello to handshake a new connection for application operations, it
 SHOULD use the hello or legacy hello reply to update the ServerDescription and TopologyDescription, the same as with a
 hello or legacy hello reply on a monitoring socket. If the hello or legacy hello call fails, the client SHOULD mark the
-server Unknown and update its TopologyDescription, the same as a failed server check on monitoring socket.
+server Unknown and update its TopologyDescription, the same as a failed server check on monitoring socket, unless the
+connection pool has added the `SystemOverloadedError` label to the error.
 
 ##### Clients use the streaming protocol when supported
 
@@ -254,6 +255,12 @@ default lastUpdateTime "infinity ago", so it scans them in random order. This ra
 many clients start at once. A client's subsequent scans of the mongoses are always in the same order, since their
 lastUpdateTimes are always in the same order by the time a scan ends.
 
+##### Handling of backpressure labels
+
+Because the scan may occur on an authenticated connection in single-threaded monitors, the server may apply backpressure
+by failing the command with a `SystemOverloadedError` label. The driver MUST not close the connection when this label is
+encountered.
+
 #### minHeartbeatFrequencyMS
 
 If a client frequently rechecks a server, it MUST wait at least minHeartbeatFrequencyMS milliseconds since the previous
Original file line number	Diff line number	Diff line change
`@@ -95,7 +95,7 @@ Drivers MUST support configuring where log messages should be output, including`
`95`	`95`	`> - If the value is "stdout" (case-insensitive), log to stdout.`
`96`	`96`	`> - If the value is "stderr" (case-insensitive), log to stderr.`
`97`	`97`	`> - Else, if direct logging to files is supported, log to a file at the specified path. If the file already exists, it`
`98`		`- > MUST be appended to.`
	`98`	`+ > MUST be appended to.`
`99`	`99`	`>`
`100`	`100`	`> If the variable is not provided or is set to an invalid value (which could be invalid for any reason, e.g. the path`
`101`	`101`	`> does not exist or is not writeable), the driver MUST log to stderr and the driver MAY attempt to warn the user about`