Sunday March 1, 2026

Home | Contact | Support | Artificial Intelligence (AI)... going beyond 'knowledge' ....

101 Generative AI Projects

Diffusion Models, Transformers, ChatGPT, and Other LLMs ...

Are you Ready? Ready to go beyond the hype and actually build with AI? This book is your complete hands-on guide to creating text generators, image transformers, voice models, chatbots, and more — all using open-source tools, local models, and step-by-step instructions.

Perfect for learners, makers, and engineers, this project-based guide walks you from the basics of running a simple language model on your laptop to advanced fine-tuning, multimodal workflows, and real deployment techniques.

Every chapter is a self-contained project — complete with Python code, environment setup, and friendly guidance to help you get it running, even if it's your first time touching machine learning.

The following gives you an overview of the chapters - also some insights of the code and expected generated results.

PART I: Getting Started – Text & CLI Projects (Beginner, Local, CPU-Friendly)

Hello AI: Your First Text Generator with GPT-2 (Local) Beginner +

Writing Prompts Auto-Finisher Using GPT-Neo +

Haiku Generator with Simple Markov Chains +

Story Generator Using GPT-J (Offline) +

Text Summarizer CLI with T5 Small

Synonym Replacer with Sentence Transformers

Extract Keywords from Text Using spaCy + Transformers

Simple Zero-Shot Classifier with DistilBERT

Compare Model Outputs Side-by-Side

Build a Dataset Cleaner CLI

PART II: Working with Audio – Essential Projects Only

Text-to-Speech with Bark or Coqui TTS +

Remove Background Noise from Audio (Demucs)

Whisper-Based Audio Transcriber (Offline)

AI-Powered Subtitler (Whisper + FFmpeg)

Generate Singing Voice from Text (VITS + RVC)

PART III: Text2Image & Image2Text – Foundational Visual AI

Text-to-Image Generator with Stable Diffusion (Low VRAM Mode) +

Basic Image Captioning with BLIP-2

Generate AI Stickers from Prompts

Image Tag Generator with CLIP

Describe an Image with MiniGPT-4

Generate Art from Text Using VQGAN+CLIP

Meme Generator: Text + Image + Captioning

ASCII Art Generator from Text

Emoji Art Generator from Text Prompts

PART IV: Intermediate Vision Projects – Local GPU Recommended

Inpaint with Text Prompts Using Stable Diffusion +

Upscale Images with Real-ESRGAN +

Create Image Variations with Diffusers +

Outpaint an Image Beyond Its Borders +

Turn a Child's Drawing into a Professional Illustration

AI-Powered Coloring Book from Sketches

Face Swap with Autoencoders

Convert Sketch to Art with ControlNet

Generate an AI Avatar from a Text Prompt

Super-Resolution Pipeline with SwinIR

PART V: Fine-Tuning and Model Training Projects

Fine-Tune GPT-2 on Your Custom Text Dataset

Fine-Tune T5 for Summarization

Train a Captioning Model on Your Own Photos

Fine-Tune a BERT Classifier with Hugging Face

Train a Custom Tokenizer with SentencePiece

Create a BLEU/ROUGE Evaluation Tool

Track Model Training with Weights & Biases

Train a Transformer from Scratch (Tiny Dataset)

Quantize a Model Using BitsAndBytes

Train a LoRA Adapter on Your Own Prompts

PART VI: LLMs on Your PC – Local-First Language Models

Run LLaMA 2 Locally

Run StableLM 3B on Consumer Hardware

Run Mistral-7B on a 16GB PC

Fine-Tune LLaMA 2 with QLoRA

Fine-Tune StableLM for Domain-Specific Tasks

Build a Local GPT Chatbot (CLI Only)

Build Your Own Retrieval-Augmented Chatbot (RAG + FAISS)

Run WizardCoder and CodeLLaMA for Code Gen

Compare Multiple Local LLMs Side-by-Side (LLaMA 2 vs StableLM vs Mistral)

Create a Multi-LLM Chat Router (AutoModel)

Compare 7B vs 13B vs 65B Models on the Same Task

Switch Between GGUF, GPTQ, and LoRA Models

How to Choose the Right LLM for Your GPU (Memory Guide)

Build a CLI Tool to Load Any HF Text Model Dynamically

Benchmark Speed and Token Output Rate of Popular LLMs

Swap Tokenizers and Prompt Formats (ChatML, Alpaca, Vicuna)

PART VII: Advanced Text2Image Projects & Model Variant Management

Run Stable Diffusion 1.5 vs 2.1 vs SDXL and Compare Outputs +

Compare Open Source Text2Image Models (SDXL, Kandinsky, DeepFloyd IF)

Guide: Choose the Best Text2Image Model for 8GB, 12GB, 24GB GPUs

Adjust Resolution, Batch Size, and Memory in Diffusers

Run Stable Diffusion with Diffusers vs CompVis vs InvokeAI

Fine-Tune Only UNet or Text Encoder (Selective Training)

Auto-Download Text2Image Models from CivitAI or HuggingFace

Animate a Still Portrait with AI Voice

Create a Comic Book Generator (Text → Panels + Bubbles)

Train Your Own Concept Using DreamBooth

Text-to-3D Object Generator +

Style Transfer Using Diffusion Models

Video Frame Interpolation with AI

Motion LoRA for Text-to-Video Experiments

Style-Based Face Generator with GANs

Generate a Music Video from Lyrics

PART VIII: Web Apps & Interfaces (Gradio / FastAPI)

Stable Diffusion Web UI with Gradio

Voice-Controlled Image Generator (Speech → Text → Image)

Multimodal Chatbot (Image + Text Inputs)

Custom AI Assistant Trained on Your Notes

Travel Planner: AI-Generated Itinerary Web App

Local Recipe Generator with OCR + GPT

AI-Powered Text Adventure Game

Convert Handwritten Notes to Markdown

Summarize PDF Files with LangChain & T5

PART IX: Deployment, Optimization, Scaling

Serve a Model with FastAPI

Create Dockerized AI App for Deployment

Use CUDA Efficiently for Local Inference

Monitor VRAM and Performance of AI Apps

Run Transformers with ONNX for Speed

Build a Task Queue for Model Serving (Celery + Redis)

Quantize and Compress Models for Local Use

Serve Multiple Models Concurrently

Deploy a Discord Bot Powered by AI

Create a CLI Installer for AI Apps (with env checks)

PART X: Ethical, Security, and Creative Use Cases

Detect AI-Generated Text with DetectGPT

Watermark AI-Generated Images

Bias Detection in Generated Text Outputs

Simulate AI vs. AI Dialogues (LLM vs. LLM)

Create a Parody Generator with Custom Prompts

Build a Secure AI API with Token Access

Resources

• Python Programming [LINK]

• Artificial Intellengence (AI) [LINK]

• Data Mining & Machine Learning [LINK]

Advert (Support Website)

Visitor:

Copyright (c) 2002-2025 xbdev.net - All rights reserved.
Designated articles, tutorials and software are the property of their respective owners.