What is describing is a local llm running what they call a rag system.
“Running a local LLM with a Retrieval-Augmented Generation (RAG) system gives you the power of AI without relying on the cloud. You keep full control of your data, run queries securely and privately, and enhance the model wit...