Titan AI LogoTitan AI

h2ogpt

11,942
1,305
Python

Project Description

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/

h2ogpt: Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oL

Project Title

h2ogpt — Private AI Chat and Document Summarization Tool

Overview

h2ogpt is an open-source project that enables private, offline chat and document summarization using local GPT models. It supports various document types and offers a range of features, including persistent databases, efficient context use, and parallel summarization. The project is designed to be 100% private, with no data leaving the user's local environment.

Key Features

  • Private offline database for documents (PDFs, Excel, Word, Images, Video Frames, YouTube, Audio, Code, Text, MarkDown, etc.)
  • Support for various models (LLaMa2, Mistral, Falcon, Vicuna, WizardLM) with GPU and CPU options
  • Gradio UI or CLI with streaming for all models
  • Open Web UI with h2ogpt as backend via OpenAI Proxy
  • Linux, Docker, macOS, and Windows support
  • Inference Servers support for multiple platforms

Use Cases

  • Enterprises requiring private, secure document summarization and chat capabilities
  • Individuals seeking a local, offline AI chat tool without data privacy concerns
  • Researchers and developers working with various document types and AI models

Advantages

  • 100% private and local processing, ensuring data privacy
  • Supports a wide range of document types and AI models
  • Offers both Gradio UI and Open Web UI for flexible access
  • Parallel summarization for efficient processing

Limitations / Considerations

  • Requires local hardware capable of running AI models, which may limit accessibility for some users
  • The project is Apache V2 licensed, but the specific license details are not provided
  • The project's complexity may require a steeper learning curve for new users

Similar / Related Projects

  • LangChain: A framework for building applications with LLMs, offering a different approach to document summarization and chat.
  • OpenAI's Codex: A code-generating AI model, which can be used for similar purposes but is not private or local.
  • Cohere: A platform providing AI models for various applications, including chat and document summarization, but with a focus on cloud-based solutions.

Basic Information


📊 Project Information

  • Project Name: h2ogpt
  • GitHub URL: https://github.com/h2oai/h2ogpt
  • Programming Language: Python
  • ⭐ Stars: 11,910
  • 🍴 Forks: 1,304
  • 📅 Created: 2023-03-24
  • 🔄 Last Updated: 2025-09-12

🏷️ Project Topics

Topics: [, ", a, i, ", ,, , ", c, h, a, t, g, p, t, ", ,, , ", e, m, b, e, d, d, i, n, g, s, ", ,, , ", g, e, n, e, r, a, t, i, v, e, ", ,, , ", g, p, t, ", ,, , ", g, p, t, 4, a, l, l, ", ,, , ", l, l, a, m, a, 2, ", ,, , ", l, l, m, ", ,, , ", m, i, x, t, r, a, l, ", ,, , ", p, d, f, ", ,, , ", p, r, i, v, a, t, e, ", ,, , ", p, r, i, v, a, t, e, g, p, t, ", ,, , ", v, e, c, t, o, r, s, t, o, r, e, ", ]


📚 Documentation


This article is automatically generated by AI based on GitHub project information and README content analysis

Titan AI Explorehttps://www.titanaiexplore.com/projects/h2ogpt-618613272en-USTechnology

Project Information

Created on 3/24/2023
Updated on 10/31/2025