Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some questions about the principle level of the article #8

Open
Roveer opened this issue Dec 19, 2024 · 1 comment
Open

Some questions about the principle level of the article #8

Roveer opened this issue Dec 19, 2024 · 1 comment

Comments

@Roveer
Copy link

Roveer commented Dec 19, 2024

Hello and thank you for this excellent work !!!

I just started to get in touch with diffusion model, I would like to ask a few questions:
(1)The Loss used in the training process is the distance between ' x_ki-0 ' and ' x_0 '( || x_ki-0 - x_0 || ),
and ' x_ki-0 ' just like:
image
But the Loss during the real diffusion training process is || noise - ε_θ(x_t, t) ||^2 ,
It looks like your training process broke the Markov chain, isn't that a problem? (forgive my ignorance = _ =)

(2)If this method works (which is indeed the case from the effect point of view ^-^),
Can I understand like that : this training strategy is actually compressing the part of the inference process?

Grateful if you could answer my questions !!

@ZHAOZHIHAO
Copy link

(1). For 1, I think they didn't change anything in backward process. In the forward process, the predicted noise still follows Gaussian so all the equations still hold.
(2). Don't know what do you mean by 'compressing the part of the inference process'.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants