Staff Cloud & AI-Driven DevOps Engineer with 12+ years of experience in GCP, AWS, and enterprise infrastructure. Expert in building scalable cloud systems, automation frameworks, and integrating Generative AI (LLMs like Gemini, Claude) into real-world DevOps workflows. Proven track record of delivering multi-million dollar cloud cost optimizations, designing self-healing systems, and building AI-powered automation platforms for deployment and operations.
UKG
Managed large-scale GCP infrastructure (Compute, IAM, VPC, LB, Storage) Led cloud cost optimization initiatives achieving multi-million dollar savings Implemented Infrastructure as Code using Terraform and Ansible Automated operations using Python, Bash, and PowerShell (effort ↓ up to 80%) Built internal tools using APIs for monitoring and operational efficiency Integrated observability and security tools including Wiz and Cloud Monitoring Managed identity systems (ForgeRock, AD, SSO, DNS) AI Contributions: Built AI-powered automation workflows and self-healing systems Developed intelligent pipelines for deployment and incident handling
Decision Resources Group
Managed AWS infrastructure (EC2, S3, IAM, VPC) Deployed scalable cloud systems and enterprise applications Implemented identity solutions (Azure AD, ADFS, OneLogin) Led domain and SSL migrations for 300+ domains Optimized tools and reduced operational costs
HCL Technologies
Managed Linux systems and large-scale storage (multi-PB environments) Automated server patching and monitoring processes Improved operational efficiency significantly via automation
B.Tech
CGPA: 8.55
Built internal AI chatbot for engineers to execute tasks, troubleshoot issues, and retrieve knowledge Integrated LLM APIs for intelligent responses and automation Reduced dependency on manual documentation and senior engineers
Designed end-to-end automated deployment pipelines for services (DNS, WebProxy, SMTP, AVS) Integrated provisioning, validation, security checks, and rollback strategies Improved deployment speed and reduced manual errors significantly
Automated alert remediation using LLMs with RCA and fix execution Generated incident summaries and reduced MTTR significantly
Automated ticket-based access provisioning and de-provisioning Improved SLA compliance and reduced operational overhead
Built automated pipeline for content generation and publishing Automated script generation (LLMs), voiceover (TTS), and video processing Generated 300+ videos with 400K+ views and 4K+ watch hours Reduced manual effort by ~90%