How to make the model load only once? #12

lattemj · 2023-11-25T20:05:46Z

Can the model be loaded only once instead of waiting for the load to complete each time?

ikergarcia1996 · 2023-11-30T13:38:28Z

Hi @lattemj
If you want to translate all the files in a directory, use the --sentences_dir flag instead of --sentences_path. You need to download the more recent version of the code, as I have implemented this argument today.

# We use --files_extension txt to translate only files with this extension. 
# Use empty string to translate all files in the directory

python3 translate.py \
--sentences_dir sample_text/ \
--output_path sample_text/translations \
--files_extension txt \
--source_lang en \
--target_lang es \
--model_name facebook/m2m100_1.2B

Is this what you are trying to do?

twicer-is-coder · 2024-05-19T11:03:40Z

Any update on this? He is asking to keep the model loaded in memory so for every inference the model does not have to be loaded again as it time consuming.

ikergarcia1996 · 2024-05-20T20:12:24Z

@twicer-is-coder the only solution is to either put all your data in a single or multiple files and do a single call to the code. If you want to run the code as an API, you can use libraries that have been built for that purpose, such as VLLM https://github.com/vllm-project/vllm or TGI https://huggingface.co/docs/text-generation-inference/index

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to make the model load only once? #12

How to make the model load only once? #12

lattemj commented Nov 25, 2023

ikergarcia1996 commented Nov 30, 2023

twicer-is-coder commented May 19, 2024

ikergarcia1996 commented May 20, 2024

How to make the model load only once? #12

How to make the model load only once? #12

Comments

lattemj commented Nov 25, 2023

ikergarcia1996 commented Nov 30, 2023

twicer-is-coder commented May 19, 2024

ikergarcia1996 commented May 20, 2024