This is an example task to demonstrate how task pages look on ServedByAI. This is not a real task.

Develop a RAG system for our internal knowledge baseExample listing

Open
CB
Camille B.
FrancePosted 6 hours ago
AI DevelopmentRAGLangChainPythonVector DatabaseOpenAI APILLM

Task description

We are a 70-person professional services company with a significant amount of internal documentation including company policies, client SOPs, product and service specifications, and training materials, totaling roughly 400 documents in a mix of PDF and Google Docs format.

We want our team to be able to ask natural language questions and get accurate, cited answers drawn from these documents instead of manually searching through folders. We are not looking for a general-purpose chatbot, we specifically want a RAG system that only answers from our document corpus and clearly cites its sources.

Preferred infrastructure would be something we can run cost-effectively, either a hosted solution with low ongoing cost or a self-hosted option on our existing cloud account (AWS). We use Notion and Google Workspace heavily, so integration with those as document sources would be a significant plus.

The scope covers: document ingestion pipeline, embedding and vector store setup, retrieval and generation layer, and a simple web UI for staff to use. We are open to architecture proposals, this is partly why we have set the budget as open to offers.

Requirements

  • Proven experience building RAG systems in production
  • Familiarity with vector databases (Pinecone, Weaviate, Chroma, or similar)
  • Experience with LangChain, LlamaIndex, or equivalent orchestration framework
  • Google Workspace or Notion integration experience is a strong plus
  • Clear architecture documentation and deployment guide on handover

Proposals (3)

Only visible to the task poster

Proposals are private

Log in as the task poster to view submitted proposals

Log in to view proposals

Open to offers