EmbedAI (PrivateGPT)

Create a private QnA chatbot on your documents without relying on the internet. Leverage the power of local LLMs for complete privacy and security - none of your data ever leaves your local environment.

Inspired by imartinez/privateGPT

Features

100% Private - All processing happens locally. Your documents never leave your machine
No Internet Required - Works completely offline after initial setup
Multiple Document Formats - Support for PDF, TXT, DOC, DOCX, and more
Local LLM Support - Uses GPT4All for local language model inference
Interactive UI - Clean web interface for document upload and chat
Document Ingestion - Automatic chunking and embedding of your documents
Source Citations - See which parts of your documents informed each answer

Quick Start

Prerequisites

Python 3.8 or later
Node.js v18.12.1 or later
Minimum 16GB RAM

Installation

Clone the repository

git clone https://github.com/SamurAIGPT/EmbedAI.git
cd EmbedAI

Start the client
```
cd client
npm install
npm run dev
```

Start the server (in a new terminal)

cd server
pip install -r requirements.txt
python privateGPT.py

Open the app

Navigate to http://localhost:3000 and click "Download Model" to get the required LLM.

Architecture

EmbedAI/
├── client/          # Next.js frontend
│   ├── components/  # React components
│   └── pages/       # App pages
└── server/          # Python backend
    ├── privateGPT.py    # Main server
    └── ingest.py        # Document processing

How It Works

Upload Documents - Drop your files into the web interface
Automatic Processing - Documents are chunked and embedded locally
Ask Questions - Chat naturally with your document corpus
Get Answers - Receive responses with source citations

Supported File Types

Format	Extension
PDF	`.pdf`
Text	`.txt`
Word	`.doc`, `.docx`
Markdown	`.md`

Configuration

The default model is GPT4All. You can configure:

Model selection
Chunk size for document processing
Number of source documents to retrieve

Troubleshooting

Model download fails?

Ensure you have a stable internet connection for the initial model download
Check that you have enough disk space (~4GB for the model)

Slow responses?

Local LLM inference requires significant CPU/RAM
Consider using a machine with at least 16GB RAM

Context window errors?

Try reducing the document chunk size
Split large documents into smaller files

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Follow for Updates

Related Projects

Open-Custom-GPT - Build Custom GPTs with the Assistants API
Text-To-Video-AI - Generate videos from text

License

MIT License - see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
client		client
server		server
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EmbedAI (PrivateGPT)

Features

Quick Start

Prerequisites

Installation

Architecture

How It Works

Supported File Types

Configuration

Troubleshooting

Contributing

Follow for Updates

Related Projects

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 9

Uh oh!

Languages

License

SamurAIGPT/EmbedAI

Folders and files

Latest commit

History

Repository files navigation

EmbedAI (PrivateGPT)

Features

Quick Start

Prerequisites

Installation

Architecture

How It Works

Supported File Types

Configuration

Troubleshooting

Contributing

Follow for Updates

Related Projects

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 9

Uh oh!

Languages

Packages