[2/n] [Serve] Refactor replica rank to prepare for node local ranks #58473

abrarsheikh · 2025-11-08T06:01:16Z

Summary

This PR refactors the replica rank system to support multi-dimensional ranking (global, node-level, and local ranks) in preparation for node-local rank tracking. The ReplicaRank object now contains three fields instead of being a simple integer, enabling better coordination of replicas across nodes.

Motivation

Currently, Ray Serve only tracks a single global rank per replica. For advanced use cases like tensor parallelism, model sharding across nodes, and node-aware coordination, we need to track:

Global rank: Replica's rank across all nodes (0 to N-1)
Node rank: Which node the replica is on (0 to M-1)
Local rank: Replica's rank on its specific node (0 to K-1)

This PR lays the groundwork by introducing the expanded ReplicaRank schema while maintaining backward compatibility in feature.

Changes

Core Implementation

schema.py: Extended ReplicaRank to include node_rank and local_rank fields (currently set to -1 as placeholders)
replica.py: Updated replica actors to handle ReplicaRank objects
context.py: Changed ReplicaContext.rank type from Optional[int] to ReplicaRank

Current Behavior

node_rank and local_rank are set to -1 (placeholder values). Will change in future
Global rank assignment and management works as before
All existing functionality is preserved

Breaking Changes

Rank is changing from int to ReplicaRank

Signed-off-by: abrar <abrar@anyscale.com>

[Serve] Refactor replica rank to prepare for node local ranks

0f7ae3c

Signed-off-by: abrar <abrar@anyscale.com>

abrarsheikh added the go add ONLY when ready to merge, run all tests label Nov 8, 2025

abrarsheikh mentioned this pull request Nov 8, 2025

[1/n] [Serve] Refactor replica rank to prepare for node local ranks #58471

Draft

abrarsheikh changed the title ~~[Serve] Refactor replica rank to prepare for node local ranks~~ [2/n] [Serve] Refactor replica rank to prepare for node local ranks Nov 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[2/n] [Serve] Refactor replica rank to prepare for node local ranks #58473

[2/n] [Serve] Refactor replica rank to prepare for node local ranks #58473

abrarsheikh commented Nov 8, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[2/n] [Serve] Refactor replica rank to prepare for node local ranks #58473

Are you sure you want to change the base?

[2/n] [Serve] Refactor replica rank to prepare for node local ranks #58473

Conversation

abrarsheikh commented Nov 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Changes

Core Implementation

Current Behavior

Breaking Changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

abrarsheikh commented Nov 8, 2025 •

edited

Loading