tgi: revert xeon version to 2.2.0 #328

lianhao · 2024-08-21T02:21:46Z

Description

Due to bug opea-project/GenAIExamples#636, revert tgi xeon version to 2.2.0.

Issues

opea-project/GenAIExamples#636

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)

Dependencies

n/a.

Tests

Describe the tests that you ran to verify your changes.

Due to bug opea-project/GenAIExamples#636, revert tgi xeon version to 2.2.0. Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

daisy-ycguo

lgtm

lianhao · 2024-08-21T05:08:06Z

The reason why gmc xeon test fails is because the model used by gmc chatqna CI test bigscience/bloom-560m doesn't work with tgi 2.2.0 anymore. We use bigscience/bloom-560m in GMC xeon CI because of bug #258.

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

KfreeZ · 2024-08-21T07:00:20Z

note: TGI 2.2.0 cannot work with bigscience/bloom-560m

KfreeZ

thanks for fixing this!!!

eero-t · 2024-08-21T11:43:31Z

The reason why gmc xeon test fails is because the model used by gmc chatqna CI test bigscience/bloom-560m doesn't work with tgi 2.2.0 anymore. We use bigscience/bloom-560m in GMC xeon CI because of bug #258.

Whereas 2.0.x & 2.1.x versions regressed TGI Prometheus support so one does not get any token metrics out of them (although they worked fine with TGI 1.x). That was fixed only in 2.2.0: huggingface/text-generation-inference#2184

* tgi: revert xeon version to 2.2.0 Due to bug opea-project/GenAIExamples#636, revert tgi xeon version to 2.2.0. * Retriever: Fix missing HF_TOKEN issue of v0.9 retriever-redis image Signed-off-by: Lianhao Lu <lianhao.lu@intel.com> (cherry picked from commit 076e81e)

tgi: revert xeon version to 2.2.0

1312570

Due to bug opea-project/GenAIExamples#636, revert tgi xeon version to 2.2.0. Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

daisy-ycguo approved these changes Aug 21, 2024

View reviewed changes

lianhao added this to the v0.9 milestone Aug 21, 2024

yongfengdu approved these changes Aug 21, 2024

View reviewed changes

lianhao force-pushed the tgi_2.2.0 branch from df32fc7 to eb0b1e2 Compare August 21, 2024 06:18

Retriever: Fix missing HF_TOKEN issue of v0.9 retriever-redis image

2f0166c

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

lianhao force-pushed the tgi_2.2.0 branch from eb0b1e2 to 2f0166c Compare August 21, 2024 06:53

KfreeZ approved these changes Aug 21, 2024

View reviewed changes

KfreeZ merged commit 076e81e into opea-project:main Aug 21, 2024
14 checks passed

lianhao deleted the tgi_2.2.0 branch August 21, 2024 07:41

lianhao mentioned this pull request Aug 22, 2024

[ci-auto] GenAIExample ChatQnA compose.yaml got changed. #335

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tgi: revert xeon version to 2.2.0 #328

tgi: revert xeon version to 2.2.0 #328

lianhao commented Aug 21, 2024

daisy-ycguo left a comment

lianhao commented Aug 21, 2024

KfreeZ commented Aug 21, 2024

KfreeZ left a comment

eero-t commented Aug 21, 2024

tgi: revert xeon version to 2.2.0 #328

tgi: revert xeon version to 2.2.0 #328

Conversation

lianhao commented Aug 21, 2024

Description

Issues

Type of change

Dependencies

Tests

daisy-ycguo left a comment

Choose a reason for hiding this comment

lianhao commented Aug 21, 2024

KfreeZ commented Aug 21, 2024

KfreeZ left a comment

Choose a reason for hiding this comment

eero-t commented Aug 21, 2024