You can also use a self-hosted OpenSearch instance and build out the index. You can use local llm for embedding and have all your data in your local network. Give your agent the skill to query the index (I’ve found hybrid search works well) and it can pull relevant info into context.