RAG + VLLM Batch Processing Data Retrieval System App update

Hello, we have a rag based Data Retrieval app. Our goal is to enhance performance & reduce latency to near zero. I will provide my current front-end and back-end. Notes: -We will not change llm models... (Budget: $250 - $750 USD, Jobs: Deep Learning, LLM Prompt Engineering, Machine Learning (ML), Retrieval-Augmented Generation (RAG))

Apr 13, 2025 - 20:16
 0
RAG + VLLM Batch Processing Data Retrieval System App update
Hello, we have a rag based Data Retrieval app. Our goal is to enhance performance & reduce latency to near zero. I will provide my current front-end and back-end. Notes: -We will not change llm models... (Budget: $250 - $750 USD, Jobs: Deep Learning, LLM Prompt Engineering, Machine Learning (ML), Retrieval-Augmented Generation (RAG))