Cost-Optimized OCI & GPU Infrastructure Setup

Completed

This document details a cost-efficient deployment strategy for the AI Agent using Oracle Cloud Infrastructure (OCI) and AWS Spot GPUs. It includes tenancy planning, compartment-based access control, landing zone setup, and provisioning of compute environments. A hybrid infrastructure using OCI’s Always Free services and AWS g4dn Spot Instances ensures under-$20 monthly GPU costs for RAG tasks. This infrastructure is designed to support both inference and training workloads for the ERP agent.

Assigned To

Omkar

Assigned By

Ankit Sir

Due Date

May 12, 2025

Created At

May 12, 2025

Update Task Status