You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Your work is amazing! However, I encountered some issues during the training process. I simply launched the first-stage training using your recommanded command. But a problem happens.
=========================================================
scripts.train_stage1 FAILED
---------------------------------------------------------
Failures:
<NO_OTHER_FAILURES>
---------------------------------------------------------
Root Cause (first observed failure):
[0]:
time : 2025-03-04_15:41:29
host : amax
rank : 0 (local_rank: 0)
exitcode : -11 (pid: 3182701)
error_file: <N/A>
traceback : Signal 11 (SIGSEGV) received by PID 3182701
=========================================================
My environment is same as yours.
There is no problem in my inference.
If you could spare some time to help me, I would be extremely grateful.
The text was updated successfully, but these errors were encountered:
Your work is amazing! However, I encountered some issues during the training process. I simply launched the first-stage training using your recommanded command. But a problem happens.
My environment is same as yours.
There is no problem in my inference.
If you could spare some time to help me, I would be extremely grateful.
The text was updated successfully, but these errors were encountered: