For some reason, lately the results from diffusers are bad #11201
Closed
nitinmukesh
started this conversation in
General
Replies: 2 comments 5 replies
-
Hi!, to make it easier, can you post an example of this bad results? I prefer SANA since it would be easier and faster to test. Sorry to ask you, but I need to see the bad results so I can compare them. |
Beta Was this translation helpful? Give feedback.
2 replies
-
Some sample output here for Sana model (using all 3 newly released models) https://drive.google.com/file/d/1cDiONGMlp4eTHTGmDTLW-6fJ7XgIRGnY/view?usp=sharing NOTE: All images have generation data embedded. You need metadata/ exif reader You can also use |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
So, I faced this problem a while back with Hunyuan where I quantized the models and produced the output but it was nowhere near to what I was expecting. I thought that it's due to quantization and didn't posted any issue.
I again faced the problem with Wan and thought the same and deleted it. Then I came across Diffsynth and used it (still using it) and the output was very nice. I checked and it is using quantization. I have also started using Video-X for Wan which introduced t2v, i2v, control for 1.3B and results are amazing for quantized version.
Once again I faced the same quality issue with Sana v1.5 models (all 3 of them) and ignored, thinking I am the only one having issue or I don't know how to use. Then I got notification of several threads with several other users facing similar issues.
I will post the link of few threads here
Wan-Video/Wan2.1#292
Wan-Video/Wan2.1#303
NVlabs/Sana#203
It's not that all models have this issue but the recent ones.
For e.g. LTX 0.9.1, Cog 3, Sana v1, Flux, Lumina, AuraFlow etc.... all produce good output.
If you see Sana thread it mention about Scheduler implementation. Is it to do with Scheduler and if so can't we have custom_scheduler just like custom_pipeline. I am not sure what is it related to (not a developer) so will leave with you guys to figure out and help us.
My test env is 8 + 16 + 42 for all models
Beta Was this translation helpful? Give feedback.
All reactions