ComfyUI Intro & Install: SD on a Budget
ComfyUI is one of the best ways to run Stable Diffusion locally when you care about control, repeatability, and workflow visibility. It is also unforgiving when your hardware budget is tight. On 8 GB to 12 GB cards, every bad assumption shows up quickly in the form of failed loads, slow startup, or VRAM exhaustion. The good news is that the basic setup path is straightforward if you keep the workflow simple and organize the model files correctly from the start.
1. Install the Portable Build Cleanly
The least painful Windows setup is still the portable release from the official ComfyUI repository. Extract it into a short path without spaces, make sure your GPU drivers are current, and launch the correct batch file for your hardware. For NVIDIA systems that usually means run_nvidia_gpu.bat.
The portable package is useful because it carries its own Python environment. That cuts down on environment conflicts and makes it easier to recover from a bad custom-node install later.
2. Put Models in the Right Directories
Most first-time setup failures are not mysterious. They come from models living in the wrong directory or from mixing SDXL and SD 1.5 assets without paying attention to workflow compatibility.
- Checkpoints belong in
ComfyUI\models\checkpoints. - VAE files belong in
ComfyUI\models\vae. - LoRAs belong in
ComfyUI\models\loras.
If a checkpoint fails to load, confirm the path before you assume the file is broken. If you see tensor or key-mismatch errors, verify that you are not trying to drive an SDXL checkpoint with a workflow built for SD 1.5.
3. Generate a First Image Before You Optimize Anything
Start with a basic workflow and prove the stack works end to end. Load a small known-good graph, select the checkpoint, enter a short prompt, and queue a single image. Do not begin by importing a massive community workflow full of custom nodes. That only increases the number of variables when something fails.
- Load a simple SDXL or SD 1.5 example workflow.
- Select the model explicitly.
- Keep the resolution modest for the first run.
- Queue one image and watch the console output.
If that basic path fails, the useful debugging targets are model path issues, missing VAE files, and memory pressure. Solve those first.
4. Install ComfyUI Manager Early
Once the base install works, add ComfyUI Manager. It simplifies custom-node installs, updates, and dependency handling enough that it should be treated as standard tooling rather than an optional extra.
git clone https://github.com/ltdrdata/ComfyUI-Manager
Place it inside ComfyUI\custom_nodes, restart ComfyUI, and use the Manager interface for future node installs instead of manually dragging random repositories into the tree.
5. The VRAM Survival Kit
Low-VRAM operation is mostly about being disciplined. A few settings do the majority of the work:
- Tiled VAE Decode: slower, but dramatically more forgiving on large outputs.
- Sage Attention or similar memory-efficient attention patches: useful when the card is right on the edge.
- Conservative batch size: on 8 GB, start at 1 and assume that is the normal operating mode.
- Model offload or block swapping: acceptable when stability matters more than raw throughput.
It also helps to close every other GPU-hungry application before blaming ComfyUI. Browser tabs, game overlays, and video tools can quietly consume the margin you thought you had.
6. Common Failure Modes
CUDA out of memory: lower the resolution, drop the batch size, enable tiled decoding, and confirm you are not loading unnecessary models into memory.
Checkpoint key mismatch: you are probably combining the wrong model family with the wrong workflow.
Washed-out or broken output: check the VAE selection and make sure the workflow is not inheriting a bad default.
Slow startup: the first run often pays a model-load penalty. That is normal. Judge performance after the pipeline is warm.
7. A Reasonable Starter Stack
For most budget-conscious installs, a sensible starter stack is: one proven checkpoint, one simple workflow, ComfyUI Manager, and a short list of VRAM-saving options you actually understand. That is enough to get productive without turning the system into a dependency jungle.
Once that baseline is stable, then it makes sense to experiment with more aggressive workflows, LoRAs, or visual workflow builders such as Promptus for faster iteration and packaging.
More Readings
- Promptus AI for building and testing ComfyUI-oriented workflows.
- ComfyUI Official Repository for releases and setup notes.
- Understanding ComfyUI Workflows for Beginners
- Low-VRAM SDXL Optimization Tactics
- RTX Performance and VRAM Optimization
Created: 22 January 2026