Skip to content

Conversation

@AlonKellner-RedHat
Copy link
Collaborator

@AlonKellner-RedHat AlonKellner-RedHat commented Oct 29, 2025

Summary

This PR adds over-saturation stopping to the GuideLLM CLI.
It's based on the OSD (Over-Saturation Detection) algorithm we developed and evaluated at Jounce.
Use --stop-over-saturated or --stop-osd to enable.

Details

This PR adds:

  • Over-saturation stopping (--stop-over-saturated)
  • Comprehensive OSD unit tests

Test Plan


  • "I certify that all code in this PR is my own, except as noted below."

Use of AI

  • Includes AI-assisted code completion
  • Includes code generated by an AI application
  • Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes ## WRITTEN BY AI ##)

@AlonKellner-RedHat AlonKellner-RedHat mentioned this pull request Oct 30, 2025
9 tasks
@AlonKellner-RedHat AlonKellner-RedHat force-pushed the feat/over-saturation-stopping branch 3 times, most recently from 65bf98c to 5b4dcca Compare November 6, 2025 11:02
AlonKellner-RedHat and others added 24 commits November 6, 2025 11:07
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Samuel Monson <smonson@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Samuel Monson <smonson@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Samuel Monson <smonson@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Samuel Monson <smonson@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Samuel Monson <smonson@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Samuel Monson <smonson@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Samuel Monson <smonson@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Samuel Monson <smonson@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: dalthecow <dalcowboiz@gmail.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
Signed-off-by: Alon Kellner <akellner@redhat.com>
tukwila and others added 9 commits November 6, 2025 11:07
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: dalthecow <dalcowboiz@gmail.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Samuel Monson <smonson@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Samuel Monson <smonson@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Samuel Monson <smonson@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Samuel Monson <smonson@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
@AlonKellner-RedHat AlonKellner-RedHat force-pushed the feat/over-saturation-stopping branch from 85cf65e to f996254 Compare November 6, 2025 11:11
sjmonson added a commit that referenced this pull request Nov 6, 2025
## Summary

E2E tests which check basic GuideLLM functionality, using vLLM
simulator.

## Details

- [x] Max requests test
- [x] Max duration test
- [ ] Over-saturation stopping test - skipped for now, will be enabled
when #438 lands

## Test Plan

- [x] Local testing
- [x] GitHub action

---

- [x] "I certify that all code in this PR is my own, except as noted
below."

## Use of AI

- [x] Includes AI-assisted code completion
- [ ] Includes code generated by an AI application
- [ ] Includes AI-generated tests (NOTE: AI written tests should have a
docstring that includes `## WRITTEN BY AI ##`)
Signed-off-by: Alon Kellner <akellner@redhat.com>
Signed-off-by: Alon Kellner <akellner@redhat.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants