bikinibottom/On host/DreamCanvas/Readme.md

# **DreamCanvas - AI-Powered Creativity**

DreamCanvas is an AI-powered image generator that allows users to create high-quality, creative images using ComfyUI and integrates with a local Large Language Model (LLM) via Ollama. This project includes a FastAPI backend, a dynamic web interface, and support for user-configurable models and servers.

---

## **Table of Contents**
- [Setup](#setup)
  - [Requirements](#requirements)
  - [Installation](#installation)
  - [Running the Server](#running-the-server)
  - [Running with Docker](#running-with-docker)
- [Functionality](#functionality)
  - [Positive and Negative Prompts](#positive-and-negative-prompts)
  - [LLM-Assisted Prompt Generation](#llm-assisted-prompt-generation)
  - [Quick Prompts](#quick-prompts)
  - [Image Caching and Navigation](#image-caching-and-navigation)
  - [UI Reset](#ui-reset)
- [Architecture](#architecture)
  - [Backend](#backend)
    - [Key Endpoints](#key-endpoints)
  - [Frontend](#frontend)
    - [UI Components](#ui-components)
  - [Tools and Libraries](#tools-and-libraries)
- [Testing](#testing)

---

## **Setup**

### **Requirements**

1. **Conda Environment**:
   - The project uses Conda for environment management. Make sure Conda is installed on your system.

2. **ComfyUI**:
   - ComfyUI should be installed and running. You must have the checkpoint `realvisxlV50Lightning.Ng9I.safetensors` installed in the `checkpoints` folder for the workflow.
   - Alternatively, you can modify `workflow.json` to use any other model/checkpoint.

3. **Ollama**:
   - Ollama LLM server should be installed and accessible.

4. **Configuration via `.env`**:
   - The project uses a `.env` file for configuring server addresses. Below are custom configuration settings:
     ```bash
     COMFYUI_SERVER_ADDRESS=192.168.1.10:8188
     OLLAMA_SERVER_ADDRESS=192.168.1.10:11436
     ```
   - Adjust these values to match your environment.

### **Installation**

1. **Clone the Repository**:

   ```bash
   git clone https://github.com/Teachings/DreamCanvas.git
   cd DreamCanvas
   ```

2. **Set Up Conda Environment**:

   Create and activate the Conda environment:

   ```bash
   conda create --name dreamcanvas python=3.12
   conda activate dreamcanvas
   ```

3. **Install Dependencies**:

   Install the project dependencies listed in `requirements.txt`:

   ```bash
   pip install -r requirements.txt
   ```

4. **Set Up `.env` File**:

   Ensure the `.env` file exists in the project root and contains the correct server addresses for ComfyUI and Ollama.

   ```bash
   COMFYUI_SERVER_ADDRESS=192.168.1.10:8188
   OLLAMA_SERVER_ADDRESS=192.168.1.10:11436
   ```

---

## **Running the Server**

### **Local Environment**

To run the FastAPI server in your local environment, use the following command:

```bash
uvicorn backend.main:app --reload --host 0.0.0.0 --port 8000
```

This will start the app on `http://localhost:8000/`.

To ensure that ComfyUI is functioning correctly, you can test the connection using the workflow defined in `workflow.json`.

---

## **Running with Docker**

If you prefer to run the application inside a Docker container, the following steps guide you through building and running the containerized application.

### **1. Build the Docker Image**

Navigate to the project directory and build the Docker image:

```bash
docker build -t dreamcanvas .
```

### **2. Run the Docker Container**

Once the Docker image is built, run the container:

```bash
docker run -d -p 8000:8000 --env-file .env --name dreamcanvas dreamcanvas
```

This command will:
- Start the container in **detached mode** (`-d`).
- Map port **8000** of the container to port **8000** on your host.
- Use the `.env` file to set environment variables.

### **3. Access the Application**

You can now access the application at `http://localhost:8000/`

---

## **Functionality**

### **Positive and Negative Prompts**

- **Positive Prompt**: Specifies the elements to include in the generated image (e.g., "4k, highly detailed, hyperrealistic").
- **Negative Prompt**: Specifies elements to avoid in the image (e.g., "blurry, watermark").

### **LLM-Assisted Prompt Generation**

- **Ask LLM for a Creative Idea**: The user can request a creative prompt suggestion from a locally hosted LLM (Ollama). The generated prompt can be applied to the positive prompt field.

### **Quick Prompts**

- **Preconfigured Prompts**: Both positive and negative quick prompts are available via buttons. Clicking a button auto-fills the corresponding input field.
- **Custom Prompts**: Themed prompts (like **Halloween** or **Christmas**) are dynamically loaded from the `quick_prompts.json` file. Adding new themes is as easy as editing this file.

### **Image Caching and Navigation**

- **Image History**: The app caches generated images within the session. Users can navigate through cached images using the **Previous** and **Next** buttons.
- **Cache Clearing**: Cached images are cleared when the browser is refreshed or when the **Reset** button is clicked.

### **UI Reset**

- The **Reset** button clears all input fields, resets generated images, and clears the image cache.

---

## **Architecture**

### **Backend**

The backend is powered by **FastAPI** and handles the following operations:
- Generating images using ComfyUI.
- Fetching creative suggestions from the local LLM.
- Serving quick prompts from configuration files.

#### **Key Endpoints**

1. **POST /generate_images/**
   - **Description**: Generates an AI image using the provided prompts and image settings.
   - **Request Example**:
     ```json
     {
       "positive_prompt": "a beautiful sunset",
       "negative_prompt": "blurry",
       "steps": 25,
       "width": 512,
       "height": 512
     }
     ```
   - **Response**: A binary stream containing the generated image.

2. **POST /ask_llm/**
   - **Description**: Requests a creative prompt from the local LLM server (Ollama).
   - **Request Example**:
     ```json
     {
       "positive_prompt": "a beautiful sunset"
     }
     ```
   - **Response**:
     ```json
     {
       "assistant_reply": "How about a stunning sunset over the mountains with golden light reflecting on the water?"
     }
     ```

3. **GET /quick_prompts/**
   - **Description**: Retrieves quick prompt configurations from the `quick_prompts.json` file for dynamic UI updates.
   - **Response Example**:
     ```json
     {
       "Positive Quick Prompts": [
         { "label": "4K", "value": "4K" },
         { "label": "Highly Detailed", "value": "highly detailed" }
       ],
       "Negative Quick Prompts": [
         { "label": "Blurry", "value": "blurry" },
         { "label": "Watermark", "value": "watermark" }
       ],
       "Halloween": [
         { "label": "Black Background", "value": "black background" },
         { "label": "Witch", "value": "witch" }
       ]
     }
     ```

### **Frontend**

The frontend is built with HTML, CSS, and JavaScript. It dynamically interacts with the backend for:
- Generating images.
- Fetching creative prompts from the LLM.
- Loading quick prompt configurations from `quick_prompts.json`.

#### **UI Components**

1. **Image Generation Form**:
   - Includes fields for positive and negative prompts, image steps, width, and height.
   - Quick prompt buttons for easy input.

2. **LLM Integration**:
   - A section that allows users to request and apply creative prompts generated by the LLM.

3. **Image Display and Navigation**:
   - Displays the generated images and includes buttons for navigating through cached images.

4. **Reset Functionality**:
   - A **Reset** button to clear all input fields and generated image history.

### **Tools and Libraries**

1. **FastAPI**: Web framework for building the backend.
2. **Uvicorn**: ASGI server used to run the FastAPI application.
3. **Ollama**: Locally hosted LLM for generating creative prompts.
4. **Pillow**: Python Imaging Library used to handle image operations.
5. **Bootstrap**: CSS framework for styling the UI.
6. **JavaScript (Fetch API)**: Handles asynchronous requests to the backend.

---

## **Testing**

You can test the ComfyUI workflow by running the FastAPI server as described above. Use the `/generate_images/` endpoint to generate images and validate that the workflow is functioning correctly.

To test the LLM integration, use the `/ask_llm/` endpoint to request a creative prompt from the locally hosted Ollama LLM.

For Docker-based setups, ensure that the `.env` file is correctly set up with the server addresses and run the container as described in the [Running with Docker](#running-with-docker) section.

## **Kill Server**

If you need to force kill the server process, you can use the following command:

```bash
sudo kill -9 $(sudo lsof -t -i :8000)
```