change model #13

slowmagic10 · 2025-01-02T03:19:38Z

can I use other nim models like llama-3.1 8b instead of 70b?

omihub777 · 2025-02-18T09:11:04Z

In case of using nvidia-hosted LLMs, you just need to set an environment variable by export APP_LLM_MODELNAME=meta/llama-3.1-8b-instruct before run docker compose up

For the self-hosted deployment, you additionally need to change the image on deploy/compose/nims.yaml:

services:
  nemollm-inference:
    container_name: nemollm-inference-microservice
    # image: nvcr.io/nim/meta/llama-3.1-70b-instruct:1.1 # comment out
   image: nvcr.io/nim/meta/llama-3.1-8b-instruct:latest # use 8b instead

Be aware that llama-3.1-8b might not be performant enough to run this blueprint successfully

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

change model #13

change model #13

slowmagic10 commented Jan 2, 2025

omihub777 commented Feb 18, 2025

change model #13

change model #13

Comments

slowmagic10 commented Jan 2, 2025

omihub777 commented Feb 18, 2025