www.xbdev.net
xbdev - software development
Monday February 9, 2026
Home | Contact | Support | Artificial Intelligence (AI)... going beyond 'knowledge' ....
     
 

101 Generative AI Projects

Diffusion Models, Transformers, ChatGPT, and Other LLMs ...

(Amazon)
 

Are you Ready? Ready to go beyond the hype and actually build with AI? This book is your complete hands-on guide to creating text generators, image transformers, voice models, chatbots, and more — all using open-source tools, local models, and step-by-step instructions.

Perfect for learners, makers, and engineers, this project-based guide walks you from the basics of running a simple language model on your laptop to advanced fine-tuning, multimodal workflows, and real deployment techniques.

Every chapter is a self-contained project — complete with Python code, environment setup, and friendly guidance to help you get it running, even if it's your first time touching machine learning.

The following gives you an overview of the chapters - also some insights of the code and expected generated results.

PART I: Getting Started – Text & CLI Projects (Beginner, Local, CPU-Friendly)
Hello AI: Your First Text Generator with GPT-2 (Local) Beginner +
Writing Prompts Auto-Finisher Using GPT-Neo +
Haiku Generator with Simple Markov Chains +
Story Generator Using GPT-J (Offline) +
Text Summarizer CLI with T5 Small
Synonym Replacer with Sentence Transformers
Extract Keywords from Text Using spaCy + Transformers
Simple Zero-Shot Classifier with DistilBERT
Compare Model Outputs Side-by-Side
Build a Dataset Cleaner CLI
PART II: Working with Audio – Essential Projects Only
Text-to-Speech with Bark or Coqui TTS +
Remove Background Noise from Audio (Demucs)
Whisper-Based Audio Transcriber (Offline)
AI-Powered Subtitler (Whisper + FFmpeg)
Generate Singing Voice from Text (VITS + RVC)
PART III: Text2Image & Image2Text – Foundational Visual AI
Text-to-Image Generator with Stable Diffusion (Low VRAM Mode) +
Basic Image Captioning with BLIP-2
Generate AI Stickers from Prompts
Image Tag Generator with CLIP
Describe an Image with MiniGPT-4
Generate Art from Text Using VQGAN+CLIP
Meme Generator: Text + Image + Captioning
ASCII Art Generator from Text
Emoji Art Generator from Text Prompts
PART IV: Intermediate Vision Projects – Local GPU Recommended
Inpaint with Text Prompts Using Stable Diffusion +
Upscale Images with Real-ESRGAN +
Create Image Variations with Diffusers +
Outpaint an Image Beyond Its Borders +
Turn a Child's Drawing into a Professional Illustration
AI-Powered Coloring Book from Sketches
Face Swap with Autoencoders
Convert Sketch to Art with ControlNet
Generate an AI Avatar from a Text Prompt
Super-Resolution Pipeline with SwinIR
PART V: Fine-Tuning and Model Training Projects
Fine-Tune GPT-2 on Your Custom Text Dataset
Fine-Tune T5 for Summarization
Train a Captioning Model on Your Own Photos
Fine-Tune a BERT Classifier with Hugging Face
Train a Custom Tokenizer with SentencePiece
Create a BLEU/ROUGE Evaluation Tool
Track Model Training with Weights & Biases
Train a Transformer from Scratch (Tiny Dataset)
Quantize a Model Using BitsAndBytes
Train a LoRA Adapter on Your Own Prompts
PART VI: LLMs on Your PC – Local-First Language Models
Run LLaMA 2 Locally
Run StableLM 3B on Consumer Hardware
Run Mistral-7B on a 16GB PC
Fine-Tune LLaMA 2 with QLoRA
Fine-Tune StableLM for Domain-Specific Tasks
Build a Local GPT Chatbot (CLI Only)
Build Your Own Retrieval-Augmented Chatbot (RAG + FAISS)
Run WizardCoder and CodeLLaMA for Code Gen
Compare Multiple Local LLMs Side-by-Side (LLaMA 2 vs StableLM vs Mistral)
Create a Multi-LLM Chat Router (AutoModel)
Compare 7B vs 13B vs 65B Models on the Same Task
Switch Between GGUF, GPTQ, and LoRA Models
How to Choose the Right LLM for Your GPU (Memory Guide)
Build a CLI Tool to Load Any HF Text Model Dynamically
Benchmark Speed and Token Output Rate of Popular LLMs
Swap Tokenizers and Prompt Formats (ChatML, Alpaca, Vicuna)
PART VII: Advanced Text2Image Projects & Model Variant Management
Run Stable Diffusion 1.5 vs 2.1 vs SDXL and Compare Outputs +
Compare Open Source Text2Image Models (SDXL, Kandinsky, DeepFloyd IF)
Guide: Choose the Best Text2Image Model for 8GB, 12GB, 24GB GPUs
Adjust Resolution, Batch Size, and Memory in Diffusers
Run Stable Diffusion with Diffusers vs CompVis vs InvokeAI
Fine-Tune Only UNet or Text Encoder (Selective Training)
Auto-Download Text2Image Models from CivitAI or HuggingFace
Animate a Still Portrait with AI Voice
Create a Comic Book Generator (Text → Panels + Bubbles)
Train Your Own Concept Using DreamBooth
Text-to-3D Object Generator +
Style Transfer Using Diffusion Models
Video Frame Interpolation with AI
Motion LoRA for Text-to-Video Experiments
Style-Based Face Generator with GANs
Generate a Music Video from Lyrics
PART VIII: Web Apps & Interfaces (Gradio / FastAPI)
Stable Diffusion Web UI with Gradio
Voice-Controlled Image Generator (Speech → Text → Image)
Multimodal Chatbot (Image + Text Inputs)
Custom AI Assistant Trained on Your Notes
Travel Planner: AI-Generated Itinerary Web App
Local Recipe Generator with OCR + GPT
AI-Powered Text Adventure Game
Convert Handwritten Notes to Markdown
Summarize PDF Files with LangChain & T5
PART IX: Deployment, Optimization, Scaling
Serve a Model with FastAPI
Create Dockerized AI App for Deployment
Use CUDA Efficiently for Local Inference
Monitor VRAM and Performance of AI Apps
Run Transformers with ONNX for Speed
Build a Task Queue for Model Serving (Celery + Redis)
Quantize and Compress Models for Local Use
Serve Multiple Models Concurrently
Deploy a Discord Bot Powered by AI
Create a CLI Installer for AI Apps (with env checks)
PART X: Ethical, Security, and Creative Use Cases
Detect AI-Generated Text with DetectGPT
Watermark AI-Generated Images
Bias Detection in Generated Text Outputs
Simulate AI vs. AI Dialogues (LLM vs. LLM)
Create a Parody Generator with Custom Prompts
Build a Secure AI API with Token Access



Resources


• Python Programming [LINK]

• Artificial Intellengence (AI) [LINK]

• Data Mining & Machine Learning [LINK]





 
Advert (Support Website)

 
 Visitor:
Copyright (c) 2002-2025 xbdev.net - All rights reserved.
Designated articles, tutorials and software are the property of their respective owners.