For many AMD GPUs, you must add --precision full --no-half
or --upcast-sampling
arguments to avoid NaN errors or crashing. If --upcast-sampling
works as a fix with your card, you should have 2x speed (fp16) compared to running in full precision.
- Some cards like the Radeon RX 6000 Series and the RX 500 Series will already run fp16 perfectly fine (noted here.)
- If your card is unable to run SD with the latest pytorch+rocm core package, you can try installing previous versions, by following a more manual installation guide below.