[Feature] Helm Charts for Txt2Img and SearchQnA. #596

joshuayao · 2024-11-21T01:00:04Z

No description provided.

yongfengdu · 2024-11-27T09:13:22Z

Any priority list for the GenAIExamples? If we don't have enough resource to add all examples, we need priority list to decide which to go first.
Besides adding more Examples, I think the most important thing to do is fix issues and enhance the high priority Examples/Components to make them ready for production.

joshuayao · 2024-12-25T03:06:15Z

Hi @yongfengdu, is your team working on this?

yongfengdu · 2025-01-03T06:29:36Z

As I know the GenAIExamples is under refactoring for 1.2, and helm-chart will need to change accordingly.
It's not wise to add more examples if they will be merged soon.
I'd propose to postpone the "more examples support" to 1.3 release, since it would be a waste of engineering resource if we're doing sth like this:
#590 - Add more microservices for docsum
#659 - Reduce microservices in docsum

The later one removed >80% of the code for previous PR.

yongfengdu · 2025-01-15T03:39:19Z

Walked through the GenAIExamples' current list, here is the summary for gaps:

Already supported, will follow up with update
AgentQnA, AudioQnA, ChatQnA, CodeGen, CodeTrans, DocSum, FaqGen, VisualQnA
Not yet supported, but is planning to do (5 Examples).
AvatarChatBot - AudioQnA+wav2lip+animation, 2 more microservices required.
DBQnA - Use LLM to Generate SQL. CPU only. More components required postpres, text2sql-service
GraphRAG - Variants of ChatQnA, neo4j graph database required, Gaudi only
SearchQnA - Use webretriever. Web-retriever supported, CPU only
Text2Image - No compose file, Components only
Not support. no plan to support.
DocIndexRetriever - Not an E2E example, part of AgentQnA
EdgeCraftRag - ARC GPU only, all functions are implemented in edgecraftrag-server, not using GenAIComps. Less likely to run on k8s environmetn, defer the support or more discuss/requirement needed.
InstructionTuning/RerankingTuning - Not E2E example, No compose file
MultimodalQnA - To be merged with VisualQnA
ProductivitySuite - To be merged with ERAG
Translation - To be merged with CodeTrans
VideoQnA - To be merged with VisualQnA

eero-t · 2025-01-15T17:42:26Z

Already supported, will follow up with update
AgentQnA, AudioQnA, ChatQnA, CodeGen, CodeTrans, DocSum, FaqGen, VisualQnA
...

Not support. no plan to support.
DocIndexRetriever - Not an E2E example, part of AgentQnA
EdgeCraftRag - ARC GPU only, all functions are implemented in edgecraftrag-server, not using GenAIComps. Less likely to run on k8s environmetn, defer the support or more discuss/requirement needed.

While EdgeCraftRag may not be important, I think Intel GPU support is important.

(Gaudi is too expensive for normal devs, so if Intel GPU support is not available, some other manufacturer's GPUs are used for acceleration, and that dev is lost for Intel.)

FYI: there's old open PR for adding vLLM OpenVINO / GPU support for ChatQnA: #403

vrantala · 2025-01-15T19:26:57Z

To be added to a list. Helm chart for vLLM support for DocSum service is required.

eero-t · 2025-01-16T11:20:13Z

To be added to a list. Helm chart for vLLM support for DocSum service is required.

That was merged few hours ago: #649

ChatQnA vLLM support was merged last month: #610

However, currently ChatQnA uses vLLM only for LLM. Embedding, reranking, and guardrails still uses TEI / TGI. There was PR to add vLLM embedding support, but review had comment that it might not be needed: opea-project/GenAIExamples#1237

And Helm charts for all other applications are lacking vLLM support.

yongfengdu · 2025-01-17T00:41:05Z

AgentQnA will add vLLM support(LLM only): #715

yongfengdu · 2025-01-20T06:33:57Z

We added 2 new examples for 1.2: Txt2Img and SearchQnA.

The other 3 be candidates for v1.3 release(May change according to GenAIExamples' plan):
AvatarChatBot - AudioQnA+wav2lip+animation, 2 more microservices required.
DBQnA - Use LLM to Generate SQL. CPU only. More components required postpres, text2sql-service
GraphRAG - Variants of ChatQnA, neo4j graph database required, Gaudi only

yongfengdu · 2025-01-20T06:40:39Z

It should be easy to add vLLM OpenVINO support in current vllm helm chart with just a new defined openvino-values.yaml.
To make sure the parameters are set correctly and tests are the trouble.(No GPUs env)

While EdgeCraftRag may not be important, I think Intel GPU support is important.

(Gaudi is too expensive for normal devs, so if Intel GPU support is not available, some other manufacturer's GPUs are used for acceleration, and that dev is lost for Intel.)

FYI: there's old open PR for adding vLLM OpenVINO / GPU support for ChatQnA: #403

mkbhanda · 2025-01-29T01:21:41Z

Please note that AMD will help on updating multiple GenAIExamples to use vLLM in v1.3, to avoid duplicate work let us create issues for each and ask them to self-assign half.

joshuayao added this to the v1.2 milestone Nov 21, 2024

joshuayao added this to OPEA Nov 21, 2024

joshuayao added the feature New feature or request label Nov 21, 2024

joshuayao assigned yongfengdu Nov 22, 2024

joshuayao assigned lianhao Jan 7, 2025

lianhao mentioned this issue Jan 7, 2025

Add text2image microservice support #674

Merged

1 task

joshuayao moved this to In progress in OPEA Jan 8, 2025

joshuayao changed the title ~~[Feature] Helm Charts for all remaining GenAIExamples~~ [Feature] Helm Charts for remaining GenAIExamples Jan 16, 2025

lianhao mentioned this issue Jan 17, 2025

Add SearchQnA e2e helm chart #722

Merged

1 task

yongfengdu mentioned this issue Jan 18, 2025

Add Text2Image e2e helm chart #725

Merged

3 tasks

joshuayao changed the title ~~[Feature] Helm Charts for remaining GenAIExamples~~ [Feature] Helm Charts for Txt2Img and SearchQnA. Jan 21, 2025

joshuayao closed this as completed Jan 22, 2025

github-project-automation bot moved this from In progress to Done in OPEA Jan 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Helm Charts for Txt2Img and SearchQnA. #596

[Feature] Helm Charts for Txt2Img and SearchQnA. #596

joshuayao commented Nov 21, 2024

yongfengdu commented Nov 27, 2024

joshuayao commented Dec 25, 2024

yongfengdu commented Jan 3, 2025

yongfengdu commented Jan 15, 2025

eero-t commented Jan 15, 2025

vrantala commented Jan 15, 2025

eero-t commented Jan 16, 2025

yongfengdu commented Jan 17, 2025

yongfengdu commented Jan 20, 2025

yongfengdu commented Jan 20, 2025

mkbhanda commented Jan 29, 2025

[Feature] Helm Charts for Txt2Img and SearchQnA. #596

[Feature] Helm Charts for Txt2Img and SearchQnA. #596

Comments

joshuayao commented Nov 21, 2024

yongfengdu commented Nov 27, 2024

joshuayao commented Dec 25, 2024

yongfengdu commented Jan 3, 2025

yongfengdu commented Jan 15, 2025

eero-t commented Jan 15, 2025

vrantala commented Jan 15, 2025

eero-t commented Jan 16, 2025

yongfengdu commented Jan 17, 2025

yongfengdu commented Jan 20, 2025

yongfengdu commented Jan 20, 2025

mkbhanda commented Jan 29, 2025