Skip to content

Running Llama2-70B TP2 on L40s (with fp8 quantization) #709

Running Llama2-70B TP2 on L40s (with fp8 quantization)

Running Llama2-70B TP2 on L40s (with fp8 quantization) #709