Plan
NVIDIA Technology Implementation
Computing Infrastructure: We plan to deploy NVIDIA DGX B200 for training and serving large AI models. Our products run on NVIDIA GPUs.
AI Development Frameworks:
• For data ingestion, we implement the NVIDIA NeMo Retriever, followed by a data curation process. We then utilize the synthetic data generation pipelines from NVIDIA NeMo Curator.
• The Query Orchestrator manages initial processing, creates pre-filters, and builds context through vector search. It then leverages NVIDIA NeMo Retriever and NIM microservices - including Deepseek R1, Riva NMT, Llama 3.1, NeMo Retriever reranking, and NeMo Retriever embedding NIM - to produce accurate and contextually relevant responses.