Nirav Madhani

ML engineer building applied AI, agents, and robotics

I work on cloud infrastructure and AI systems, and I keep a strong hobby focus on astronomy and physics.

ML engineer focused on applied AI, agents, and robotics.

ML EngineerDallas, TX

About

I am an ML engineer with interests in RL, agents, MLOps, and LLMOps, plus hands-on robotics projects.

Highlights

  • Application developer focused on cloud infrastructure and application architecture.
  • Open-source contributor to the poliastro astrodynamics library in Python.
  • Hyperspectral image super-resolution work during an internship at SAC-ISRO.
  • Hack DFW 2022 subcategory winner and top 7 overall.
  • Co-author on an Industry 5.0 paper and top 8.83% on LeetCode.
  • Built Hugging Face Spaces: CV MCP, Latex to PDF MCP, VLA Data Generator, Octo 1.5, Octo 1.5 Base, Gemini Live Voice Chat.

Featured Videos

Vision Model + Agent in Simulated World

Achieved >85% task completion in 3D Unity/Gym environments using a custom ReAct agent. Optimized perception-to-action loop with LangChain.

Mobile Robotic Arm (ROS + DL Models)

Integrated Deep Learning perception models directly with ROS nodes for physical actuation. Solved Inverse Kinematics synchronization for a custom wheeled platform.

Engineering & Research Portfolio

2023 - Present
ML/DL · AI Agents · Systems · RAG

Production RAG System (ARGO DATA)

Achieved sub-100ms P50 latency for knowledge retrieval using LangChain + Pinecone + FastAPI.

View Details
ML/DL · Computer Vision · MLOps

ID Verification Vision System (ARGO DATA)

Deployed PyTorch inference on Azure GPU with autoscaling, cutting vendor API costs by ~60%.

View Details
ML/DL · AI Agents · Systems · Enterprise AI

Agentic AI Chatbot for SMS Surveys (ARGO DATA)

Architected an event-driven compound AI system using Microsoft AutoGen and LangGraph.

View Details
2025
ML/DL · Reasoning · GRPO · Training

RL for Mathematical Reasoning (Gemma-3 Fine-tuning)

Implemented GRPO (Group Relative Policy Optimization) to fine-tune Gemma-3-270M for Chain-of-Thought reasoning.

View Details
Robotics · VLA · Transformers · Policy

VLA-Adapter for NVIDIA GR00T Humanoid

Implemented BridgeAttention architecture to map multimodal inputs (vision + language + proprioception) to 43-D action spaces.

View Details
Robotics · AI Agents · MLOps · Model Serving

Octo Inference API Deployment

Deployed Octo-1.5 robotics foundation model on Hugging Face Spaces.

View Details
Robotics · AI Agents · Data Engineering

Specialized Robotics Augmented Dataset

Engineered a specialized VLA dataset that has heavily impacted the community (1,500+ downloads in 1 month).

View Details
2024
AI Agents · ML/DL · Embodied AI · Sim2Real

Vision Model + Agent in Simulated World

Built an autonomous agent navigating 3D Unity/Gym environments with >85% task completion rate.

View Details
Systems Engineering · AI Agents · WebSockets

Gemini Live Backend Proxy

Architected a production-ready backend handling WebSocket streaming with <150ms audio round-trip latency.

View Details
2023
Robotics · ROS · ML/DL · Hardware

Mobile Robotic Arm (ROS + DL Models)

Integrated Deep Learning perception models with ROS nodes for physical actuation.

View Details
2022
ML/DL · Computer Vision · HPC · Research

Hyperspectral Super-Resolution (ISRO Research)

Engineered a 4x super-resolution pipeline (SR-GAN) for hyperspectral satellite imagery.

View Details
ML/DL · AI Agents · Reinforcement Learning · Simulation

TrafficSwarm (Multi-Agent RL)

Implemented decentralized control policies for multi-intersection traffic management.

View Details
Robotics · Systems Programming · C++

Robotics Calculator

Built a custom offline C++/Python engine for calculating Forward/Inverse Kinematics.

View Details
2021
Robotics · Fundamentals · Kinematics · C++

Robotics Course & FK Visualization

Implemented and visualized 3D Forward Kinematics transformations from scratch.

View Details

Writing

May 5, 2025 · rag, ai-infra, system-design

When to Choose In-Memory RAG vs. Vector Database Services

Retrieval Augmented Generation (RAG) lets you enrich LLM prompts with your own knowledge base. You embed documents into vectors and search them at runtime to pull in context. The question is: do you really need an external vector database? If you're just experimenting or working with small datasets, keeping everything in memory might be all you need.

Read
May 4, 2025 · ai-infra, system-design

Welcome to AI Engineering Blog

Welcome to my AI Engineering blog! This space is dedicated to sharing practical insights, experiences, and deep technical knowledge about AI infrastructure, cloud architecture, and modern development tools.

Read
May 4, 2025 · rag, ai-infra, system-design

Practical RAG Implementation Patterns

An overview of practical patterns for implementing Retrieval Augmented Generation (RAG) in production systems.

Read
May 4, 2025 · azure-networking, ai-infra, system-design

Advanced Azure Networking Patterns for AI Workloads

Exploring networking patterns for deploying large-scale AI workloads in Azure.

Read

Contact

Open to work and collaborations. Reach me on GitHub, Hugging Face, or LinkedIn.