Overview of RagHost
What is RagHost?
RagHost is a platform that enables rapid building of internal tools powered by Retrieval-Augmented Generation (RAG) technology. It provides a simple API for uploading documents and querying them, handling tasks such as document parsing, chunking, and vector embeddings.
What is RAG?
Retrieval-Augmented Generation (RAG) is a technology that allows Large Language Models (LLMs) to access private data, such as internal company documentation, to generate more accurate answers. It works by retrieving relevant chunks of text from a database and providing them to the LLM along with the question.
Key Features of RagHost
Embedding Documents
RagHost provides an API for embedding documents, allowing users to upload files and configure chunk size and chunk overlap for optimal retrieval quality.
Asking Questions
RagHost's API allows users to ask questions and receive answers from the LLM, with built-in streaming for fast response times.
Model Flexibility
RagHost currently uses OpenAI's GPT 3.5 Turbo as its default model, but plans to add more models, including Anthropic's Claude 2, in the future.
Cost-Effective
RagHost aims to provide fair pricing, unlike OpenAI's pricing model, which charges per GB per day and per assistant.
Getting Started
RagHost provides a live demo, example app, and documentation to help users get started with building their internal tools.