Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhance tgi README for ArtifactHUB. #517

Closed
wants to merge 3 commits into from
Closed

Conversation

Yugar-1
Copy link
Contributor

@Yugar-1 Yugar-1 commented Oct 31, 2024

Description

The summary of the proposed changes as long as the relevant motivation and context.

Enhance tgi README for ArtifactHUB.

Issues

List the issue or RFC link this PR is working on. If there is no such link, please mark it as n/a.

n/a

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)

n/a

Dependencies

List the newly introduced 3rd party dependency if exists.

n/a

Tests

Describe the tests that you ran to verify your changes.

n/a

Signed-off-by: Yugar-1 <xiaoyu.zhang@intel.com>
helm install tgi tgi --set global.modelUseHostPath=${MODELDIR} --set LLM_MODEL_ID=${MODELNAME} --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN}
# To deploy on Gaudi enabled kubernetes cluster
# helm install tgi tgi --set global.modelUseHostPath=${MODELDIR} --set LLM_MODEL_ID=${MODELNAME} --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} --values gaudi-values.yaml
helm repo add opea https://opea-project.github.io/GenAIInfra
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As there are discussions about where to release the helm charts, I think it would be better to put only chart options and verification part in the README.
The deployment part might be different if users are getting the helm chart from various places.
Let's involve @poussa @eero-t to bring more experience. (For a better readme before v1.1 release)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The less duplication, the better. There could e.g. be a link to project README instead:
https://github.com/opea-project/GenAIInfra/blob/main/helm-charts/README.md#deploy-with-helm-charts

@yongfengdu yongfengdu requested a review from poussa November 5, 2024 02:55
Comment on lines +39 to +40
| horizontalPodAutoscaler.enabled | bool | false | Enable HPA autoscaling for the service deployment based on metrics it provides. See [HPA instructions](../../HPA.md) before enabling! |
| global.monitoring | bool | false | Enable usage metrics for the service. Required for HPA. See [monitoring instructions](../../monitoring.md) before enabling! |
Copy link
Contributor

@eero-t eero-t Nov 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please leave this part out, there's a more complete fix in: #523

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

#523 is merged. Please remove this part and rebase.

@poussa
Copy link
Collaborator

poussa commented Nov 5, 2024

As we are still discussing where to host the OPEA helm charts, let's not merge PRs related to this area before we have reached a conclusion. The top candidates for Helm chart hosting are currently ghcr.io and artifacthub.io.

@yongfengdu
Copy link
Collaborator

Close this PR as we decided to use ghcr.io

@yongfengdu yongfengdu closed this Nov 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants