raghost

RAG as a service.

Visit Website
raghost

Introduction

Overview of RagHost

What is RagHost?

RagHost is a platform that enables rapid building of internal tools powered by Retrieval-Augmented Generation (RAG) technology. It provides a simple API for uploading documents and querying them, handling tasks such as document parsing, chunking, and vector embeddings.

What is RAG?

Retrieval-Augmented Generation (RAG) is a technology that allows Large Language Models (LLMs) to access private data, such as internal company documentation, to generate more accurate answers. It works by retrieving relevant chunks of text from a database and providing them to the LLM along with the question.

Key Features of RagHost

Embedding Documents

RagHost provides an API for embedding documents, allowing users to upload files and configure chunk size and chunk overlap for optimal retrieval quality.

Asking Questions

RagHost's API allows users to ask questions and receive answers from the LLM, with built-in streaming for fast response times.

Model Flexibility

RagHost currently uses OpenAI's GPT 3.5 Turbo as its default model, but plans to add more models, including Anthropic's Claude 2, in the future.

Cost-Effective

RagHost aims to provide fair pricing, unlike OpenAI's pricing model, which charges per GB per day and per assistant.

Getting Started

RagHost provides a live demo, example app, and documentation to help users get started with building their internal tools.