feat(controller): support serverless serving with keda support by k8s scale subresource.#500
feat(controller): support serverless serving with keda support by k8s scale subresource.#500X1aoZEOuO wants to merge 3 commits intoInftyAI:mainfrom
Conversation
fea7121 to
b73e6f0
Compare
|
/kind feature |
|
@pacoxu @googs1025 @carlory @kerthcet Hello all! Could you spare a few minutes to review my PRs when you have a chance? Other ref PRs: |
|
/assign |
|
@pacoxu @kenwoodjw Friendly ping, do you have some time to take a look at my PRs? Thanks a lot for your assistance! |
|
/assign |
kerthcet
left a comment
There was a problem hiding this comment.
seems some docs are duplicated with https://github.com/InftyAI/llmaz/pull/499/files, can we just put one here and refer to it in another one.
@kerthcet Thank you for catching this! I've refactored the documentation structure to eliminate duplication, Now focuses specifically, and reference link to the main serverless documentation (PR #499) |
6ff79f5 to
9b47451
Compare
|
The test is always failing ... |
|
/retest |
Signed-off-by: X1aoZEOuO <nizefeng2002@outlook.com>
Signed-off-by: X1aoZEOuO <nizefeng2002@outlook.com>
Signed-off-by: X1aoZEOuO <nizefeng2002@outlook.com>
9b47451 to
66faed7
Compare
|
@pacoxu I have resolved the conflict. :) |
|
/retest |
What this PR does / why we need it
Detailed Explanation of Commit
This commit introduces a guide for configuring serverless environments on Kubernetes, focusing on integrating Prometheus for monitoring and KEDA for autoscaling. The guide aims to optimize resource efficiency through event-driven scaling while maintaining observability for AI/ML workloads.
Which issue(s) this PR fixes
Fixes #
Special notes for your reviewer
Does this PR introduce a user-facing change?
cc @pacoxu @kerthcet