Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Signal 11 (SIGSEGV) #75

Open
stein-kristina opened this issue Mar 3, 2025 · 0 comments
Open

Signal 11 (SIGSEGV) #75

stein-kristina opened this issue Mar 3, 2025 · 0 comments

Comments

@stein-kristina
Copy link

stein-kristina commented Mar 3, 2025

Your work is amazing! However, I encountered some issues during the training process. I simply launched the first-stage training using your recommanded command. But a problem happens.

=========================================================
scripts.train_stage1 FAILED
---------------------------------------------------------
Failures:
  <NO_OTHER_FAILURES>
---------------------------------------------------------
Root Cause (first observed failure):
[0]:
  time      : 2025-03-04_15:41:29
  host      : amax
  rank      : 0 (local_rank: 0)
  exitcode  : -11 (pid: 3182701)
  error_file: <N/A>
  traceback : Signal 11 (SIGSEGV) received by PID 3182701
=========================================================

My environment is same as yours.
There is no problem in my inference.
If you could spare some time to help me, I would be extremely grateful.

@stein-kristina stein-kristina changed the title Segmentation fault Signal 11 (SIGSEGV) Mar 4, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant