Skip to content

Add a metric to track InferenceModels ready to serve by the epp #598

Open
@ahg-g

Description

@ahg-g

What would you like to be added:

A metric tracking the InferenceModels ready to serve by the epp

Why is this needed:

When rolling out a new model adapter, there is currently no update on the InferenceModel status indicating if the epp became aware of it. We decided that the epp should not update the status of the object since it is the responsibility of a control plane component. However, it is still important operationally to expose what inferenceModels the epp is aware of.

Metadata

Metadata

Assignees

No one assigned

    Labels

    good first issueDenotes an issue ready for a new contributor, according to the "help wanted" guidelines.help wantedDenotes an issue that needs help from a contributor. Must meet "help wanted" guidelines.triage/acceptedIndicates an issue or PR is ready to be actively worked on.

    Type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions