Shivam Lakhotia

About

Engineer by craft,
explorer by nature.

I'm Shivam — a Senior Software Engineer at NVIDIA, working on NVIDIA DeepStream and developer platforms. I build platforms that make advanced AI systems practical for real products — from C++ streaming frameworks to cloud-native tooling and multimodal AI agents, so teams can move from ideas to production quickly and reliably.

I enjoy designing systems that other engineers build on: Python interfaces over high-performance C++ runtimes, declarative tools for deploying microservices, and reusable RAG backends and AI agents. Before NVIDIA, I studied CS at UC San Diego and worked at Samsung R&D in Bangalore, building offline speech for Bixby. Before that, I completed my bachelor's at IIT Guwahati, where I published research on using Hybrid Memory Cube architectures to accelerate CNNs.

Outside of engineering, I'm usually chasing wind or waves, playing guitar, or planning the next trip.

AI / LLM Multimodal Agents RAG Systems Developer Platforms Systems Design

Experience

Where I've built things.

Senior Software Engineer

NVIDIA DeepStream · Santa Clara, CA

2021 — Present

Working on NVIDIA DeepStream and developer platforms — including Context Aware RAG, a modular RAG backend with Milvus, Elasticsearch, and Neo4j support, and the Video Search and Summarization (VSS) Agent, a multimodal AI agent that ingests long-form video for search, summarization, and Q&A using VLMs and LLMs. Working across the full stack: C++ streaming frameworks, cloud-native microservices, and Python-first developer APIs.

DeepStream Multimodal Agents RAG C++ / Python Video AI

MS Computer Science

UC San Diego · La Jolla, CA

2019 — 2021

Graduate studies in computer science, deepening expertise in systems, algorithms, and machine learning at one of the world's top CS programs.

Machine Learning Systems Algorithms

Software Engineer — Bixby

Samsung R&D Institute · Bangalore, India

2017 — 2019

Built offline speech recognition capabilities for Samsung's Bixby assistant. Users could issue voice commands without internet connectivity — a technically challenging problem at the intersection of NLP, on-device ML, and real-time systems.

NLP On-Device ML Speech Recognition

B.Tech Computer Science

IIT Guwahati · Guwahati, India

2013 — 2017

Bachelor's in Computer Science with a published research paper on using Hybrid Memory Cube (HMC) architecture to improve the efficiency of Convolutional Neural Networks on CPU — an early contribution to hardware-ML co-design.

Research CNN Optimization HMC Architecture

Beyond the terminal

Life outside
the screen.

Surfing

Chasing swells up and down the California coast — San Diego is home base. There's a particular kind of flow state that only a good wave delivers.

Windsurfing

Harnessing wind and water simultaneously at Shoreline, Mountain View. Windsurfing demands technical precision and physical endurance in equal measure.

Scuba Diving

Best dive so far: Hawaii, with my wife. Dropping below the surface into a world of silence — every dive is a reminder that most of the planet remains unexplored.

Guitar

Fingerpicking through everything from folk to classical. Music is the other language I've spent years learning to speak fluently.

Chess

Strategy, patience, and the joy of a well-calculated sacrifice. Chess teaches you to think three moves ahead — useful in engineering too.

Coffee & Cooking

Regulars at Backyard Brews in Palo Alto and Shoreline Cafe in Mountain View. Weekend mornings with a pour-over; evenings experimenting in the kitchen.

Moments

Shivam Lakhotia at a pottery wheel in a studio, shaping clay

Shivam Lakhotia enjoying ramen with chopsticks at a restaurant

Conversations I enjoy

What I think about.

Beyond code and waves, I'm drawn to the bigger questions — about how the mind works, what makes a good life, and how technology shapes human experience.

01

Psychology & Happiness

The science of wellbeing, cognitive biases, and what research actually says about living a fulfilling life.

02

Natural Language Understanding

How machines learn to parse meaning, and what the gap between language and thought reveals about both.

03

Wealth & Financial Systems

How capital flows, compounding works, and the structural forces that shape economic outcomes.

04

AI Agent Orchestration

The emerging dynamics of multi-agent systems — how autonomous agents coordinate, fail, and surprise us.

05

Multimodal AI & Video Understanding

Building agents that reason over video — combining VLMs, LLMs, and retrieval to make sense of the world frame by frame.

Writing & Research

Things I've published.

NVIDIA Developer Blog

Build a Video Search and Summarization Agent with NVIDIA AI Blueprint

How we built a multimodal AI agent that ingests long-form video and enables natural language search, summarization, and Q&A using VLMs, LLMs, and a modular RAG backend.

Read on NVIDIA Blog →

NVIDIA Developer Blog

Make Sense of Video Analytics by Integrating NVIDIA AI Blueprints

A practical guide to integrating NVIDIA AI Blueprints into video analytics pipelines — combining structured and unstructured data with multimodal retrieval at scale.