Guide to Fine Tune Stable Diffusion XL
Introduction
Stable Diffusion XL 1.0 (SDXL 1.0) is an advanced deep generative model specifically engineered for generating high-quality images from textual descriptions. It represents a significant enhancement over its predecessor, the Stable Diffusion model.
Fine-tuning the Stable Diffusion XL model involves the process of refining its parameters and optimizing its performance for a particular task or dataset. This method leverages the pre-existing knowledge encoded within the model through prior training on a diverse dataset, then tailors it to better suit the nuances and intricacies of a specific domain or task. In essence, fine-tuning serves as a form of transfer learning, where the model's existing understanding is adapted and enhanced to excel in a targeted area of application.
What is Fine-Tuning?
Fine-tuning refers to the process of modifying a pre-existing, pre-trained model to cater to a new, specific task by training it on a smaller dataset related to the new task. This approach leverages the existing knowledge gained from the pre-training phase, thereby reducing the need for extensive data and resources.
In the Context of Neural Networks and Deep Learning:
In the specific context of neural networks and deep learning, fine-tuning is typically executed by adjusting the parameters of a pre-trained model. This adjustment is made using a smaller, task-specific dataset. The pre-trained model, having already learned a set of features from a large dataset, is further trained on the new dataset to adapt these features to the new task.
How to Create a Fine-Tuning Job?
To initiate the Fine-Tuning Job process, first, the user should navigate to the sidebar section and select Foundation Studio. Upon selecting Foundation Studio, a dropdown menu will appear, featuring an option labeled Fine-Tune Models.
-
Upon clicking the Fine-Tune Models option, the user will be directed to the "Manage Fine-Tuning Jobs" page.
-
After redirecting to the Manage Fine-Tuning Jobs, users can locate and click on the Create Fine-Tuning Job button or Click here button to create Fine-Tune models.
-
After clicking the Create Fine-Tuning Job button, the Create Fine-Tuning Job page will open. On this page, there are several options such as Job-Name, Model, and Hugging Face Token.
-
If the user already has an integration with a Hugging Face token, they can select it from the dropdown options. If the user does not have any integration setup with Hugging Face, they can click on the Create a new one, and the Create Integration page will open. After adding a token, the user can move to the next stage by clicking on the Create button.
If the user doesn't have a Hugging Face token, they will not be able to access certain services. In this case, you need to sign up for an account on Hugging Face and obtain an HF token to use their services.
To obtain a Hugging Face token, you can follow these steps:
- Go to the Hugging Face website and create an account if you haven't already.
- Once you have created an account, log in and go to your account settings.
- Click on the Tokens tab.
- Click on the "New Access Token" button.
- Give your token a name and select the permissions you want to grant to the token.
- Click on the "Create" button.
- Your new token will be displayed. Make sure to copy it and store it in a safe place, as you will not be able to see it again after you close the window.
Some models are available for commercial use but require access granted by their Custodian/Administrator (creator/maintainer of this model). You can visit the model card on Hugging Face to initiate the process.
How to Define Dataset Preparation?
After defining the Job Model configuration, users can move on to the next section for Dataset Preparation. The Dataset page will open, providing several options such as Select Task, Dataset Type, Choose a Dataset, Validation Split Ratio, and Prompt Configuration. Once these options are filled, the dataset preparation configuration will be set, and the user can move to the next section.
Dataset Type
In the Dataset Type, you can select either CUSTOM or HUGGING FACE as the dataset type. The CUSTOM Dataset Type allows training models with user-provided data, offering flexibility for unique tasks. Alternatively, the Hugging Face option provides a variety of pre-existing datasets, enabling convenient selection and utilization for model training.
Choosing a Custom Dataset
If you select the dataset type as CUSTOM, you will need to choose a user-defined dataset by clicking the CHOOSE button.
After clicking the CHOOSE button, you will see the screen below if you have already created objects in the selected dataset. You can create a new dataset by clicking the click here link.
Note: The listed datasets here use an EOS Bucket for data storage.
- For Models like Stable Diffusion XL, the dataset provided should be in the following format: one metadata file mapping images with the context text and a list of images in PNG format.
folder_name/metadata.jsonl
folder_name/0001.png
folder_name/0002.png
folder_name/0003.png
-
structure of metadata.jsonl file
{"file_name": "0001.png", "text": "This is a first value of a text feature you added to your image"}
{"file_name": "0002.png", "text": "This is a second value of a text feature you added to your image"}
{"file_name": "0003.png", "text": "This is a third value of a text feature you added to your image"}
Uploading incorrect dataset format will result into finetuning run failure
PROCESS TO UPLOAD DATASET
-
After selecting the dataset, you can upload objects in a particular dataset by selecting the dataset and clicking on the UPLOAD DATASET button.
-
Click on the UPLOAD DATASET button, upload the objects, and click on the UPLOAD button.
-
After uploading objects to a specific dataset, choose a particular file to continue and then click on the SUBMIT button.
How to define a Hyperparameter Configuration?
-
Upon providing the dataset preparation details, users are directed to the Hyperparameter Configuration page. This interface allows users to customize the training process by specifying desired hyperparameters, thereby facilitating effective hyperparameter tuning. The form provided enables the selection of various hyperparameters, including but not limited to training type, epoch, learning rate, and max steps. Please fill out the form meticulously to optimize the model training process.
-
In addition to the standard hyperparameters, the configuration page offers advanced options such as batch size and gradient accumulation steps. These settings can be utilized to further refine the training process. Users are encouraged to explore and employ these advanced options as needed to achieve optimal model performance.
-
Upon specifying the advanced settings, users are advised to leverage the WandB Integration feature for comprehensive job tracking. This involves filling in the necessary details in the provided interface. By doing so, users can effectively monitor and manage the model training process, ensuring transparency and control throughout the lifecycle of the job.
-
Users can also describe the debug option as desired for debugging purposes or quicker training.
-
Once the debug option has been thoroughly addressed, users are required to select their preferred machine configuration for the fine-tuning job. Clicking on the LAUNCH button will initiate or schedule the job, depending on the chosen settings. To ensure fast and precise training, a variety of high-performance GPUs, such as Nvidia H100 and A100, are available for selection. This allows users to optimize their resources and accelerate the model training process.
Viewing your Job parameters and Fine-tuned models
-
Upon completion of the job, a Fine-Tuned model will be created and displayed in the models section at the lower part of the page. This fine-tuned model repo will contain all checkpoints of model training as well as adapters built during training. Users can also directly go to the model repo page under inference to view it.
-
If desired, users can view job parameter details in the overview section of the job as shown below.
-
Users can effortlessly access the logs of the model training by clicking on the run of the fine-tuning job they've created. These logs serve as a valuable resource, aiding users in debugging the training process and identifying any potential errors that could impede the model's progress.
-
Upon successful completion of training, the model will be seamlessly pushed, ensuring smooth integration with our existing model infrastructure.
Running inference on Fine-tuned Stable Diffusion XL model
Upon successful completion of model training, the fine-tuned Stable Diffusion XL model can be conveniently accessed and viewed under the Models section.
.. image:: stableDiffusionXL_images/sdxl-view.png :class: bordered-image
Deployment of Stable Diffusion XL
Selecting the Deploy icon reveals a sidebar featuring different menus, including Resource details, Endpoint details, and Environment Variables.
-
Within the Resource details menu, users can specify their desired CPU/GPU configuration and set the number of replicas. Once configurations are finalized, users can proceed by clicking on the Next button.
-
In the Endpoint details section, users can either enter a custom name for the endpoint or opt to use the default name provided. We have used the default name.
-
Complete the Environment Variables section.
-
Following the completion of the aforementioned steps, proceed by selecting the Create option. This action will initiate the creation process for your model endpoint.
Running inference on the Model Endpoint
Navigate to the Inference section and select the Model Endpoint option to access the details of your created model endpoint. Here, you can monitor the status of your model endpoint. Once the status changes to Running, you can proceed by clicking on the API Request icon to make requests to your endpoint.
A sidebar menu titled Sample API Request will appear. Within this menu, you have two options for running inference:
-
Curl Script: Execute the provided curl script on your terminal of your local machine.
-
Python Script: Run the provided Python script on your terminal or any code editor of your choice.
Ensure to replace the placeholder $AUTH_TOKEN
in the script with your actual API authentication token. You can locate your authentication token in the "API Tokens" section. If you don't have an authentication token, you can generate a new one from the same section.
For Authentication, refer to this documentation: Click here.