feat(controller): support serverless serving with keda support by k8s scale subresource. by X1aoZEOuO · Pull Request #500 · InftyAI/llmaz

X1aoZEOuO · 2025-09-28T11:59:41Z

What this PR does / why we need it

Detailed Explanation of Commit

This commit introduces a guide for configuring serverless environments on Kubernetes, focusing on integrating Prometheus for monitoring and KEDA for autoscaling. The guide aims to optimize resource efficiency through event-driven scaling while maintaining observability for AI/ML workloads.

Prometheus Integration: Configured with namespaceSelector for cross-namespace monitoring
KEDA Autoscaling: Custom metric scaling with Prometheus triggers
Scale-to-Zero: Activator pattern with request buffering and CloudEvents

Which issue(s) this PR fixes

Fixes #

Special notes for your reviewer

Does this PR introduce a user-facing change?

cc @pacoxu @kerthcet

X1aoZEOuO · 2025-09-28T12:20:57Z

/kind feature

X1aoZEOuO · 2025-09-29T13:29:05Z

@pacoxu @googs1025 @carlory @kerthcet Hello all! Could you spare a few minutes to review my PRs when you have a chance?

Other ref PRs:

pacoxu · 2025-10-09T05:43:39Z

/assign
I will take a look this week or early next week.

X1aoZEOuO · 2025-10-15T10:37:26Z

@pacoxu @kenwoodjw Friendly ping, do you have some time to take a look at my PRs? Thanks a lot for your assistance!

kerthcet · 2025-10-27T09:48:47Z

/assign

kerthcet

seems some docs are duplicated with https://github.com/InftyAI/llmaz/pull/499/files, can we just put one here and refer to it in another one.

X1aoZEOuO · 2025-10-29T17:47:19Z

seems some docs are duplicated with https://github.com/InftyAI/llmaz/pull/499/files, can we just put one here and refer to it in another one.

@kerthcet Thank you for catching this! I've refactored the documentation structure to eliminate duplication, Now focuses specifically, and reference link to the main serverless documentation (PR #499)

kerthcet · 2025-10-30T18:30:45Z

The test is always failing ...

pacoxu · 2025-10-31T03:50:01Z

/retest

Signed-off-by: X1aoZEOuO <nizefeng2002@outlook.com>

X1aoZEOuO · 2025-10-31T04:29:37Z

@pacoxu I have resolved the conflict. :)

X1aoZEOuO · 2025-10-31T04:41:56Z

/retest

InftyAI-Agent added needs-triage Indicates an issue or PR lacks a label and requires one. needs-priority Indicates a PR lacks a label and requires one. do-not-merge/needs-kind Indicates a PR lacks a label and requires one. labels Sep 28, 2025

InftyAI-Agent requested review from carlory and googs1025 September 28, 2025 11:59

X1aoZEOuO force-pushed the feat/1-n-keda-support branch from fea7121 to b73e6f0 Compare September 28, 2025 12:18

InftyAI-Agent added feature Categorizes issue or PR as related to a new feature. and removed do-not-merge/needs-kind Indicates a PR lacks a label and requires one. labels Sep 28, 2025

InftyAI-Agent assigned pacoxu Oct 9, 2025

InftyAI-Agent assigned kerthcet Oct 27, 2025

pacoxu mentioned this pull request Oct 28, 2025

doc: add serverless doc with keda and activator. #499

Open

kerthcet reviewed Oct 28, 2025

View reviewed changes

X1aoZEOuO force-pushed the feat/1-n-keda-support branch from 6ff79f5 to 9b47451 Compare October 29, 2025 17:49

X1aoZEOuO mentioned this pull request Oct 31, 2025

Github e2e ci always failed in recent weeks. #505

Open

X1aoZEOuO added 3 commits October 31, 2025 12:26

feat: add label for prometheus query.

c0cd133

Signed-off-by: X1aoZEOuO <nizefeng2002@outlook.com>

feat: add serverless config for keda.

b096ddf

Signed-off-by: X1aoZEOuO <nizefeng2002@outlook.com>

feat: add serverless usage doc for llmaz.

66faed7

Signed-off-by: X1aoZEOuO <nizefeng2002@outlook.com>

X1aoZEOuO force-pushed the feat/1-n-keda-support branch from 9b47451 to 66faed7 Compare October 31, 2025 04:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(controller): support serverless serving with keda support by k8s scale subresource.#500

feat(controller): support serverless serving with keda support by k8s scale subresource.#500
X1aoZEOuO wants to merge 3 commits intoInftyAI:mainfrom
X1aoZEOuO:feat/1-n-keda-support

X1aoZEOuO commented Sep 28, 2025

Uh oh!

X1aoZEOuO commented Sep 28, 2025

Uh oh!

X1aoZEOuO commented Sep 29, 2025 •

edited

Loading

Uh oh!

pacoxu commented Oct 9, 2025

Uh oh!

X1aoZEOuO commented Oct 15, 2025

Uh oh!

kerthcet commented Oct 27, 2025

Uh oh!

kerthcet left a comment

Uh oh!

X1aoZEOuO commented Oct 29, 2025

Uh oh!

kerthcet commented Oct 30, 2025

Uh oh!

pacoxu commented Oct 31, 2025

Uh oh!

X1aoZEOuO commented Oct 31, 2025

Uh oh!

X1aoZEOuO commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

X1aoZEOuO commented Sep 28, 2025

What this PR does / why we need it

Detailed Explanation of Commit

Which issue(s) this PR fixes

Special notes for your reviewer

Does this PR introduce a user-facing change?

Uh oh!

X1aoZEOuO commented Sep 28, 2025

Uh oh!

X1aoZEOuO commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pacoxu commented Oct 9, 2025

Uh oh!

X1aoZEOuO commented Oct 15, 2025

Uh oh!

kerthcet commented Oct 27, 2025

Uh oh!

kerthcet left a comment

Choose a reason for hiding this comment

Uh oh!

X1aoZEOuO commented Oct 29, 2025

Uh oh!

kerthcet commented Oct 30, 2025

Uh oh!

pacoxu commented Oct 31, 2025

Uh oh!

X1aoZEOuO commented Oct 31, 2025

Uh oh!

X1aoZEOuO commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

X1aoZEOuO commented Sep 29, 2025 •

edited

Loading