AUTOMATIC1111_stable-diffusion-webui： Stable Diffusion web UIAUTOMATIC1111_stable-diffusion-webui：稳定的Diffusion web UI

描述

网址

标签

收藏类型

URL

https://github.com/AUTOMATIC1111/stable-diffusion-webui

原链接

Origin

github.com

Stable Diffusion web UI

稳定的扩散网络用户界面

A browser interface based on Gradio library for Stable Diffusion. 基于用于稳定扩散的 Gradio 库的浏览器界面。

Features 特征

Detailed feature showcase with images: 带有图像的详细功能展示：

Original txt2img and img2img modes 原始的 txt2img 和 img2img 模式

One click install and run script (but you still must install python and git) 一键安装并运行脚本（但你仍然必须安装python和git）

Outpainting 外涂

Inpainting 修复

Color Sketch 彩绘

Prompt Matrix 提示矩阵

Stable Diffusion Upscale 稳定扩散高档

Attention, specify parts of text that the model should pay more attention to 注意，指定模型应该更加注意的文本部分

a man in a ((tuxedo)) - will pay more attention to tuxedo 穿 ((tuxedo)) 的男人 - 会更注意燕尾服
a man in a (tuxedo:1.21) - alternative syntax(tuxedo:1.21) 中的一个人 - 替代语法
select text and press Ctrl+Up or Ctrl+Down to automatically adjust attention to selected text (code contributed by anonymous user) 选择文本并按 Ctrl+Up 或 Ctrl+Down 自动调整对所选文本的关注（匿名用户提供的代码）

Loopback, run img2img processing multiple times Loopback，多次运行img2img处理

X/Y/Z plot, a way to draw a 3 dimensional plot of images with different parameters X/Y/Z plot，一种绘制具有不同参数的 3 维图像图的方法

Textual Inversion 文本倒置

have as many embeddings as you want and use any names you like for them 拥有任意数量的嵌入，并为它们使用任何你喜欢的名称
use multiple embeddings with different numbers of vectors per token 每个标记使用具有不同数量向量的多个嵌入
works with half precision floating point numbers 适用于半精度浮点数
train embeddings on 8GB (also reports of 6GB working) 在 8GB 上训练嵌入（还有 6GB 工作的报告）

Extras tab with: 附加选项卡：

GFPGAN, neural network that fixes faces GFPGAN，修复人脸的神经网络
CodeFormer, face restoration tool as an alternative to GFPGAN CodeFormer，面部修复工具作为 GFPGAN 的替代品
RealESRGAN, neural network upscaler RealESRGAN，神经网络升级器
ESRGAN, neural network upscaler with a lot of third party models ESRGAN，具有大量第三方模型的神经网络升级器
SwinIR and Swin2SR (see here), neural network upscalers SwinIR 和 Swin2SR（参见此处），神经网络升频器
LDSR, Latent diffusion super resolution upscaling LDSR，潜在扩散超分辨率升级

Resizing aspect ratio options 调整纵横比选项

Sampling method selection 取样方法选择

Adjust sampler eta values (noise multiplier) 调整采样器 eta 值（噪声倍增器）
More advanced noise setting options 更高级的噪音设置选项

Interrupt processing at any time 随时中断处理

4GB video card support (also reports of 2GB working) 4GB 视频卡支持（也有 2GB 工作报告）

Correct seeds for batches 正确的批次种子

Live prompt token length validation 实时提示令牌长度验证

Generation parameters 生成参数

parameters you used to generate images are saved with that image 您用于生成图像的参数与该图像一起保存
in PNG chunks for PNG, in EXIF for JPEG 在 PNG 块中用于 PNG，在 EXIF 中用于 JPEG
can drag the image to PNG info tab to restore generation parameters and automatically copy them into UI 可以将图片拖拽到PNG info tab中，恢复生成参数，自动复制到UI中
can be disabled in settings 可以在设置中禁用
drag and drop an image/text-parameters to promptbox 将图像/文本参数拖放到提示框

Read Generation Parameters Button, loads parameters in promptbox to UI 读取生成参数按钮，将提示框中的参数加载到 UI

Settings page 设置页面

Running arbitrary python code from UI (must run with -allow-code to enable) 从 UI 运行任意 python 代码（必须使用 -allow-code 运行才能启用）

Mouseover hints for most UI elements 大多数 UI 元素的鼠标悬停提示

Possible to change defaults/mix/max/step values for UI elements via text config 可以通过文本配置更改 UI 元素的默认值/混合/最大/步进值

Tiling support, a checkbox to create images that can be tiled like textures 平铺支持，用于创建可以像纹理一样平铺的图像的复选框

Progress bar and live image generation preview 进度条和实时图像生成预览

Can use a separate neural network to produce previews with almost none VRAM or compute requirement 可以使用单独的神经网络生成几乎没有 VRAM 或计算要求的预览

Negative prompt, an extra text field that allows you to list what you don’t want to see in generated image 否定提示，一个额外的文本字段，允许您列出您不想在生成的图像中看到的内容

Styles, a way to save part of prompt and easily apply them via dropdown later 样式，一种保存部分提示并稍后通过下拉列表轻松应用它们的方法

Variations, a way to generate same image but with tiny differences 变体，一种生成相同图像但有微小差异的方法

Seed resizing, a way to generate same image but at slightly different resolution 种子大小调整，一种生成相同图像但分辨率略有不同的方法

CLIP interrogator, a button that tries to guess prompt from an image CLIP 询问器，一个尝试从图像中猜测提示的按钮

Prompt Editing, a way to change prompt mid-generation, say to start making a watermelon and switch to anime girl midway Prompt Editing，一种改变prompt mid-generation的方法，说开始做西瓜，中途转动漫女孩

Batch Processing, process a group of files using img2img 批处理，使用 img2img 处理一组文件

Img2img Alternative, reverse Euler method of cross attention control img2img Alternative，交叉注意力控制的反欧拉方法

Highres Fix, a convenience option to produce high resolution pictures in one click without usual distortions Highres Fix，一个方便的选项，可以一键生成高分辨率图片，不会出现通常的失真

Reloading checkpoints on the fly 即时重新加载检查点

Checkpoint Merger, a tab that allows you to merge up to 3 checkpoints into one 检查点合并，一个选项卡，允许您将最多 3 个检查点合并为一个

Custom scripts with many extensions from community 具有来自社区的许多扩展的自定义脚本

Composable-Diffusion, a way to use multiple prompts at once Composable-Diffusion ，一种同时使用多个提示的方法

separate prompts using uppercase AND 使用大写字母 AND 分隔提示
also supports weights for prompts: a cat :1.2 AND a dog AND a penguin :2.2 还支持提示权重： a cat :1.2 AND a dog AND a penguin :2.2

No token limit for prompts (original stable diffusion lets you use up to 75 tokens) 提示无令牌限制（原始稳定扩散最多可使用 75 个令牌）

DeepDanbooru integration, creates danbooru style tags for anime prompts DeepDanbooru 集成，为动漫提示创建 danbooru 样式标签

xformers, major speed increase for select cards: (add -xformers to commandline args) xformers，选择卡片的主要速度提升：（添加 -xformers 到命令行参数）

via extension: History tab: view, direct and delete images conveniently within the UI 通过扩展：历史选项卡：在 UI 中方便地查看、定向和删除图像

Generate forever option 生成永久选项

Training tab 培训标签

hypernetworks and embeddings options 超网络和嵌入选项
Preprocessing images: cropping, mirroring, autotagging using BLIP or deepdanbooru (for anime) 预处理图像：使用 BLIP 或 deepdanbooru（用于动漫）裁剪、镜像、自动标记

Clip skip 剪辑跳过

Hypernetworks 超级网络

Loras (same as Hypernetworks but more pretty) Loras（与 Hypernetworks 相同但更漂亮）

A sparate UI where you can choose, with preview, which embeddings, hypernetworks or Loras to add to your prompt 一个单独的 UI，您可以在其中选择带有预览的嵌入、超网络或 Loras 以添加到您的提示中

Can select to load a different VAE from settings screen 可以选择从设置屏幕加载不同的 VAE

Estimated completion time in progress bar 进度条中的预计完成时间

API 应用程序接口

Support for dedicated inpainting model by RunwayML RunwayML 支持专用修复模型

via extension: Aesthetic Gradients, a way to generate images with a specific aesthetic by using clip images embeds (implementation of https://github.com/vicgalle/stable-diffusion-aesthetic-gradients) 通过扩展：审美梯度，一种通过使用剪辑图像嵌入生成具有特定审美的图像的方法（ https://github.com/vicgalle/stable-diffusion-aesthetic-gradients 的实现）

Stable Diffusion 2.0 support - see wiki for instructions Stable Diffusion 2.0 支持 - 有关说明，请参阅 wiki

Alt-Diffusion support - see wiki for instructions Alt-Diffusion 支持 - 有关说明，请参阅 wiki

Now without any bad letters! 现在没有任何坏信！

Load checkpoints in safetensors format 以安全张量格式加载检查点

Eased resolution restriction: generated image’s domension must be a multiple of 8 rather than 64 放宽分辨率限制：生成图像的维度必须是 8 的倍数而不是 64

Now with a license! 现在有驾照！

Reorder elements in the UI from settings screen 从设置屏幕重新排序 UI 中的元素

Installation and Running 安装运行

Make sure the required dependencies are met and follow the instructions available for both NVidia (recommended) and AMD GPUs. 确保满足所需的依赖项并遵循适用于 NVidia（推荐）和 AMD GPU 的说明。

Alternatively, use online services (like Google Colab): 或者，使用在线服务（如 Google Colab）：

List of Online Services 在线服务列表

Automatic Installation on Windows

在 Windows 上自动安装

Install Python 3.10.6, checking “Add Python to PATH”. 安装 Python 3.10.6 ，勾选“Add Python to PATH”。

Install git. 安装混帐。

Download the stable-diffusion-webui repository, for example by running git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui.git. 下载 stable-diffusion-webui 存储库，例如通过运行 git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui.git 。

Run webui-user.bat from Windows Explorer as normal, non-administrator, user. 以普通非管理员用户身份从 Windows 资源管理器运行 webui-user.bat 。

Automatic Installation on Linux

在 Linux 上自动安装

Install the dependencies: 安装依赖项：

To install in /home/$(whoami)/stable-diffusion-webui/, run: 要在 /home/$(whoami)/stable-diffusion-webui/ 中安装，请运行：

Run webui.sh. 运行 webui.sh 。

Installation on Apple Silicon

在 Apple Silicon 上安装

Find the instructions here. 在这里找到说明。

Contributing 贡献

Here’s how to add code to this repo: Contributing 以下是将代码添加到此 repo 的方法：贡献

Documentation 文档

The documentation was moved from this README over to the project’s wiki. 文档已从本自述文件移至项目的 wiki。

Credits 学分

Licenses for borrowed code can be found in Settings -> Licenses screen, and also in html/licenses.html file. 可以在 Settings -> Licenses 屏幕和 html/licenses.html 文件中找到借用代码的许可证。

Stable Diffusion - https://github.com/CompVis/stable-diffusion, https://github.com/CompVis/taming-transformers 稳定扩散 - https://github.com/CompVis/stable-diffusion ， https://github.com/CompVis/taming-transformers

k-diffusion - https://github.com/crowsonkb/k-diffusion.git k-扩散- https://github.com/crowsonkb/k-diffusion.git

GFPGAN - https://github.com/TencentARC/GFPGAN.git GFPGAN- https://github.com/TencentARC/GFPGAN.git

CodeFormer - https://github.com/sczhou/CodeFormer CodeFormer- https://github.com/sczhou/CodeFormer

ESRGAN - https://github.com/xinntao/ESRGAN ESRGAN- https://github.com/xinntao/ESRGAN

SwinIR - https://github.com/JingyunLiang/SwinIR SwinIR- https://github.com/JingyunLiang/SwinIR

Swin2SR - https://github.com/mv-lab/swin2sr Swin2SR- https://github.com/mv-lab/swin2sr

LDSR - https://github.com/Hafiidz/latent-diffusion LDSR- https://github.com/Hafiidz/latent-diffusion

MiDaS - https://github.com/isl-org/MiDaS MiDaS- https://github.com/isl-org/MiDaS

Ideas for optimizations - https://github.com/basujindal/stable-diffusion 优化想法 - https://github.com/basujindal/stable-diffusion

Cross Attention layer optimization - Doggettx - https://github.com/Doggettx/stable-diffusion, original idea for prompt editing. Cross Attention layer optimization - Doggettx - https://github.com/Doggettx/stable-diffusion ，快速编辑的原始想法。

Cross Attention layer optimization - InvokeAI, lstein - https://github.com/invoke-ai/InvokeAI (originally http://github.com/lstein/stable-diffusion) 交叉注意层优化 - InvokeAI，lstein - https://github.com/invoke-ai/InvokeAI （最初是 http://github.com/lstein/stable-diffusion ）

Sub-quadratic Cross Attention layer optimization - Alex Birch (Birch-san/diffusers#1), Amin Rezaei (https://github.com/AminRezaei0x443/memory-efficient-attention) 次二次交叉注意层优化 - Alex Birch (Birch-san/diffusers#1), Amin Rezaei ( https://github.com/AminRezaei0x443/memory-efficient-attention )

Textual Inversion - Rinon Gal - https://github.com/rinongal/textual_inversion (we’re not using his code, but we are using his ideas). Textual Inversion - Rinon Gal - https://github.com/rinongal/textual_inversion （我们没有使用他的代码，但我们正在使用他的想法）。

Idea for SD upscale - https://github.com/jquesnelle/txt2imghd SD 高档的想法 - https://github.com/jquesnelle/txt2imghd

Noise generation for outpainting mk2 - https://github.com/parlance-zz/g-diffuser-bot 为 outpainting mk2 生成噪音 - https://github.com/parlance-zz/g-diffuser-bot

CLIP interrogator idea and borrowing some code - https://github.com/pharmapsychotic/clip-interrogator CLIP 询问器的想法和借用一些代码 - https://github.com/pharmapsychotic/clip-interrogator

Idea for Composable Diffusion - https://github.com/energy-based-model/Compositional-Visual-Generation-with-Composable-Diffusion-Models-PyTorch 可组合扩散的想法 - https://github.com/energy-based-model/Compositional-Visual-Generation-with-Composable-Diffusion-Models-PyTorch

xformers - https://github.com/facebookresearch/xformers xformers- https://github.com/facebookresearch/xformers

DeepDanbooru - interrogator for anime diffusers https://github.com/KichangKim/DeepDanbooru DeepDanbooru - 动漫扩散器审讯员 https://github.com/KichangKim/DeepDanbooru

Sampling in float32 precision from a float16 UNet - marunine for the idea, Birch-san for the example Diffusers implementation (https://github.com/Birch-san/diffusers-play/tree/92feee6) 从 float16 UNet 以 float32 精度采样 - marunine 的想法，Birch-san 的示例 Diffusers 实现（ https://github.com/Birch-san/diffusers-play/tree/92feee6 ）

Instruct pix2pix - Tim Brooks (star), Aleksander Holynski (star), Alexei A. Efros (no star) - https://github.com/timothybrooks/instruct-pix2pix 指导 pix2pix - Tim Brooks（明星）、Aleksander Holynski（明星）、Alexei A. Efros（无明星）- https://github.com/timothybrooks/instruct-pix2pix

Security advice - RyotaK 安全建议 - RyotaK

UniPC sampler - Wenliang Zhao - https://github.com/wl-zhao/UniPC UniPC 采样器 - Wenliang Zhao - https://github.com/wl-zhao/UniPC

Initial Gradio script - posted on 4chan by an Anonymous user. Thank you Anonymous user. 初始 Gradio 脚本 - 由匿名用户发布在 4chan 上。谢谢匿名用户。

(You) （你）

▎本文由简悦 SimpRead 转码。