Nirav Madhani

About

I am an ML engineer with interests in RL, agents, MLOps, and LLMOps, plus hands-on robotics projects.

Highlights

Application developer focused on cloud infrastructure and application architecture.
Open-source contributor to the poliastro astrodynamics library in Python.
Hyperspectral image super-resolution work during an internship at SAC-ISRO.
Hack DFW 2022 subcategory winner and top 7 overall.
Co-author on an Industry 5.0 paper and top 8.83% on LeetCode.
Built Hugging Face Spaces: CV MCP, Latex to PDF MCP, VLA Data Generator, Octo 1.5, Octo 1.5 Base, Gemini Live Voice Chat.

Vision Model + Agent in Simulated World

Achieved >85% task completion in 3D Unity/Gym environments using a custom ReAct agent. Optimized perception-to-action loop with LangChain.

Mobile Robotic Arm (ROS + DL Models)

Integrated Deep Learning perception models directly with ROS nodes for physical actuation. Solved Inverse Kinematics synchronization for a custom wheeled platform.

Engineering & Research Portfolio

2023 - Present

ML/DL · AI Agents · Systems · RAG

Production RAG System (ARGO DATA)

Achieved sub-100ms P50 latency for knowledge retrieval using LangChain + Pinecone + FastAPI.

View Details

ML/DL · Computer Vision · MLOps

ID Verification Vision System (ARGO DATA)

Deployed PyTorch inference on Azure GPU with autoscaling, cutting vendor API costs by ~60%.

View Details

ML/DL · AI Agents · Systems · Enterprise AI

Agentic AI Chatbot for SMS Surveys (ARGO DATA)

Architected an event-driven compound AI system using Microsoft AutoGen and LangGraph.

View Details

2025

ML/DL · Reasoning · GRPO · Training

RL for Mathematical Reasoning (Gemma-3 Fine-tuning)

Implemented GRPO (Group Relative Policy Optimization) to fine-tune Gemma-3-270M for Chain-of-Thought reasoning.

View Details

Robotics · VLA · Transformers · Policy

VLA-Adapter for NVIDIA GR00T Humanoid

Implemented BridgeAttention architecture to map multimodal inputs (vision + language + proprioception) to 43-D action spaces.

View Details

Robotics · AI Agents · MLOps · Model Serving

Octo Inference API Deployment

Deployed Octo-1.5 robotics foundation model on Hugging Face Spaces.

View Details

Robotics · AI Agents · Data Engineering

Specialized Robotics Augmented Dataset

Engineered a specialized VLA dataset that has heavily impacted the community (1,500+ downloads in 1 month).

View Details

2024

AI Agents · ML/DL · Embodied AI · Sim2Real

Vision Model + Agent in Simulated World

Built an autonomous agent navigating 3D Unity/Gym environments with >85% task completion rate.

View Details

Systems Engineering · AI Agents · WebSockets

Gemini Live Backend Proxy

Architected a production-ready backend handling WebSocket streaming with <150ms audio round-trip latency.

View Details

2023

Robotics · ROS · ML/DL · Hardware

Mobile Robotic Arm (ROS + DL Models)

Integrated Deep Learning perception models with ROS nodes for physical actuation.

View Details

2022

ML/DL · Computer Vision · HPC · Research

Hyperspectral Super-Resolution (ISRO Research)

Engineered a 4x super-resolution pipeline (SR-GAN) for hyperspectral satellite imagery.

View Details

ML/DL · AI Agents · Reinforcement Learning · Simulation

TrafficSwarm (Multi-Agent RL)

Implemented decentralized control policies for multi-intersection traffic management.

View Details

Robotics · Systems Programming · C++

Robotics Calculator

Built a custom offline C++/Python engine for calculating Forward/Inverse Kinematics.

View Details

2021

Robotics · Fundamentals · Kinematics · C++

Robotics Course & FK Visualization

Implemented and visualized 3D Forward Kinematics transformations from scratch.

View Details

Writing

May 5, 2025 · rag, ai-infra, system-design

When to Choose In-Memory RAG vs. Vector Database Services

Retrieval Augmented Generation (RAG) lets you enrich LLM prompts with your own knowledge base. You embed documents into vectors and search them at runtime to pull in context. The question is: do you really need an external vector database? If you're just experimenting or working with small datasets, keeping everything in memory might be all you need.

Read

May 4, 2025 · ai-infra, system-design

Welcome to AI Engineering Blog

Welcome to my AI Engineering blog! This space is dedicated to sharing practical insights, experiences, and deep technical knowledge about AI infrastructure, cloud architecture, and modern development tools.

Read

May 4, 2025 · rag, ai-infra, system-design

Practical RAG Implementation Patterns

An overview of practical patterns for implementing Retrieval Augmented Generation (RAG) in production systems.

Read

May 4, 2025 · azure-networking, ai-infra, system-design

Advanced Azure Networking Patterns for AI Workloads

Exploring networking patterns for deploying large-scale AI workloads in Azure.

Read

ML engineer building applied AI, agents, and robotics

About

Featured Videos

Vision Model + Agent in Simulated World

Mobile Robotic Arm (ROS + DL Models)

Engineering & Research Portfolio

Production RAG System (ARGO DATA)

ID Verification Vision System (ARGO DATA)

Agentic AI Chatbot for SMS Surveys (ARGO DATA)

RL for Mathematical Reasoning (Gemma-3 Fine-tuning)

VLA-Adapter for NVIDIA GR00T Humanoid

Octo Inference API Deployment

Specialized Robotics Augmented Dataset

Vision Model + Agent in Simulated World

Gemini Live Backend Proxy

Mobile Robotic Arm (ROS + DL Models)

Hyperspectral Super-Resolution (ISRO Research)

TrafficSwarm (Multi-Agent RL)

Robotics Calculator

Robotics Course & FK Visualization

Writing

When to Choose In-Memory RAG vs. Vector Database Services

Welcome to AI Engineering Blog

Practical RAG Implementation Patterns

Advanced Azure Networking Patterns for AI Workloads

Contact