stable diffusion clip aesthetic – Aesthetic gradients are proposed as a means to personalize a CLIP-conditioned diffusion model by steering the generating process towards specific aesthetics defined by the user from a group of photos.
Aesthetic gradients effectively interpose the aesthetic embedding provided by the user (i.e., by their contributed photos and single-word definition) in the conventional prompt>CLIP>noise>image process.
The donated pictures are ‘averaged out’ in the pipeline before being normalized to the typical Stable Diffusion text2img process’s ‘unitary norm’ – supplementing rather than replacing it.
stable diffusion clip aesthetic-1
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/nB531dB2BIX620b_1682755362.png)
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/6UCu2iXMyqFXuyf_1682755361.png)
prompt: intensive girl, Redhead curly hair, detailed intricate environment, winter dawn, detailed, intricate, [by loish and rossdraws and artgerm and Kamal Rao:0.1]
Negative prompt: 3d, (disfigured), (bad art), (deformed), (poorly drawn), (extra limbs), strange colours, blurry, boring, sketch, lacklustre, repetitive, cropped, hands,text,watermark,signature,tiled
Steps: 20, Sampler: DPM++ 2S a, CFG scale: 7, Seed: 1290542316, Size: 768×832, Model hash: 2c02b20a, Model: sd-v2.0-768-v-ema, Batch size: 4, Batch pos: 0
stable diffusion clip aesthetic-2
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/sozkJAbUj0bLfA0_1682754540.png)
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/9TSpey7FXX8O79U_1682754541.png)
- prompt: “Infinity pool with a tropical forest in the background, high resolution, detail, 8 k, DSLR, good lighting, ray tracing, realistic”
- size: 512×512
- guidance scale: 12
- steps: 50
- sampler: DDIM
stable diffusion clip aesthetic-3
We notice some intriguing effects when we add the same negative prompt as in the previous case. The negative cue, in particular, appears to be detrimental to SD 1 but uniformly beneficial to SD 2.
Each image from SD 2 improves with negative prompting, however, the caption alignment for SD 1 appears to decrease generally. Adding the negative prompt appears to push the generated images closer to photorealism.
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/8OQ2dNSZedsqaxf_1682748829.png)
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/ev4mTiqkkaQ7Em6_1682748830.png)
- prompt: “Roman city on top of a ridge, sci-fi illustration by Greg Rutkowski #sci-fi detailed vivid colors gothic concept illustration by James Gurney and Zdzislaw BeksiÅ„ski vivid vivid colorsg concept illustration colorful interior”
- size: 512×512
- guidance scale: 12
- steps: 50
- sampler: DDIM
stable diffusion clip aesthetic-4
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/48lN1NIYRvJm0zi_1682749161.png)
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/hkUqWUp8UqpOMyW_1682749161.png)
- prompt: “Roman city on top of a ridge, sci-fi illustration by Greg Rutkowski #sci-fi detailed vivid colors gothic concept illustration by James Gurney and Zdzislaw BeksiÅ„ski vivid vivid colorsg concept illustration colorful interior”
- size: 512×512
- guidance scale: 12
- steps: 50
- sampler: DDIM
stable diffusion clip aesthetic-5
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/VFkAbysCXNhC65U_1682749090.png)
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/3tdIdFelqhyDcCv_1682749091.png)
- prompt: “a gothic cathedral in a stunning landscape by Jean-HonorĂ© Fragonard”
- size: 512×512
- guidance scale: 12
- steps: 50
- sampler: DDIM
stable diffusion clip aesthetic-6
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/l2z45PokVOKOnbl_1682748984.png)
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/vqjCpFb78xjvTQb_1682748985.png)
- prompt: “Cyberpunk ikea, close up shot from the top, anime art, greg rutkowski, studio ghibli, dramatic lighting”
- size: 512×512
- guidance scale: 12
- steps: 50
- sampler: DDIM
stable diffusion clip aesthetic-7
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/bIQR5jkmG4a6CSr_1682755038.png)
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/WUA1v8z1PQ7PLHZ_1682755037.png)
- prompt: “A studio photograph of Robert Downey Jr., cinematic lighting, hyperdetailed, 8 k realistic, global illumination, radiant light, frostbite 3 engine, CryEngine, trending on artstation, digital art”
- size: 512×512
- guidance scale: 7
- steps: 50
- seed: 119-121
stable diffusion clip aesthetic-8
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/vq9x5JcZU9kKMoK_1682755152.png)
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/7U8532X8FgbEln5_1682755151.png)
- prompt: “A monster fighting a hero by greg rutkowski, romanticism, cinematic lighting, hyperdetailed, 8 k realistic, global illumination, radiant light, trending on artstation, digital art”
- size: 512×512
- guidance scale: 9
- steps: 50
- seed: 119-122
stable diffusion clip aesthetic-9
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/ntTxdC6JCO1aPck_1682755325.png)
![stable diffusion clip aesthetic](https://stablediffusionaigenerator.com/wp-content/uploads/2023/04/sQWoeDaYMDT7NHz_1682755324.png)
PROMPT: digital painting of a (young) (woman) (medieval knight) portrait, beautiful eyes, forest in the background, intricate details, concept art, [art by ross tran and artgerm:0.1] [and leyendecker:0.2]