1. Introduction
Organizations generate vast amounts of unstructured data daily, including emails, PDFs, wikis, logs, and various other forms. Traditional machine learning models (LLMs) often struggle to process these data silos effectively at scale due to several challenges, such as latency, governance issues, and token limit constraints. Retrieval-Augmented Generation (RAG)