Skip to content
View tensorsofthewall's full-sized avatar

Highlights

  • Pro

Block or report tensorsofthewall

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
tensorsofthewall/README.md

πŸ‘‹ Hi, I'm Sandesh (@tensorsofthewall)

I'm a research engineer working on large-scale multimodal AI systems that see, reason, and operate under real-world constraints. My work spans computer vision, vision-language models, reinforcement learning, distributed training, and more recently, music/audio modeling.

I enjoy building systems that move beyond benchmarks β€” from autonomous navigation to distributed inference pipelines β€” and occasionally making robots smarter (or at least slightly less confused).


πŸ› οΈ Core Skills & Tools

Languages

Python C++ Java Bash


Multimodal AI

PyTorch HuggingFace TensorFlow OpenCV vLLM

  • Multimodal LLMs & Vision-Language Models
  • Long-context training (24K+ tokens)
  • SFT & DPO
  • Synthetic data generation
  • Large-scale evaluation pipelines

Distributed Training & Inference

SLURM Docker Apache Spark

  • PyTorch DDP & FSDP
  • Tensor / Pipeline / Context Parallelism
  • Multi-node GPU clusters (up to 32 nodes)
  • Distributed inference (100B+ models)
  • 3B+ tokens/day serving pipelines

Data & Systems

NumPy Pandas ElasticSearch SQL

  • Spark data pipelines
  • Retrieval-augmented systems (RAG)
  • Throughput & latency optimization
  • Heterogeneous GPU deployment (AMD & NVIDIA)

🌟 Featured Projects

  • BetterSearch
    Desktop application integrating LLMs, advanced RAG, and native OS functionality for semantic file search. Supports text-to-SQL via osquery, CPU/GPU customization, and local/cloud deployment modes.

  • Exo
    Distributed inference framework for running LLMs across heterogeneous device clusters. Contributed AMD GPU support improvements and llama.cpp Windows backend integration.

  • VidTune
    Generative AI web app for tailored music creation for videos, with genre, tempo, keyword, and dynamic mixing customization.

  • Unified Local-Cloud Decision-Making (UniLCD)
    Residual reinforcement learning framework for cloud-edge collaboration in embodied vision systems. Presented at ECCV 2024.

  • Autonomous Underwater Vehicle
    Led team to Singapore AUV Challenge finals. Built navigation, perception, and embedded systems; integrated enhanced underwater vision algorithms.


πŸ… Publications


πŸ”¬ Experience

Research

  • Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) β€” Research Engineer I (Aug 2025 – Present, Abu Dhabi, UAE)
    Training and post-training multimodal LLMs and VLMs for domain-specific reasoning (Multimodal Energy GPT).

    • 70B-class model SFT and DPO across multi-node GPU clusters
    • Distributed inference for 235B models (3B+ tokens/day, 32 nodes)
    • Synthetic data generation and large-scale evaluation pipelines
  • LossFunk (Initialize Program) β€” Researcher (June 2025 – Present)
    Independent research on MIDAS, a modular framework for music source separation.

  • Human-to-Everything (H2X) Lab, Boston University β€” Research Engineer / Graduate Research Assistant (Mar 2023 – Mar 2025)
    Vision-language models for autonomous driving, cloud-edge routing, energy-aware navigation, reinforcement learning, and embodied AI systems.

  • Robert Bosch GmbH β€” Research Intern (Jan 2020 – June 2020)
    Open-set classification and predictive maintenance for vehicular systems.

  • Center for Development of Advanced Computing (C-DAC) β€” Research Intern (May 2019 – Oct 2019)
    Gait-based person re-identification (91.13% accuracy on CASIA-B).

Industry

  • Ottometric Inc. β€” Software Engineering Intern (June 2023 – Aug 2023)
    Submodular optimization for dataset summarization and efficient model training.

  • Ignitarium Technology Solutions β€” AI Engineer (Jan 2022 – July 2022)
    High-performance INT8 kernel development for accelerator hardware (30% inference improvement).

  • Synopsys India β€” Software Engineer (Jan 2021 – Jan 2022)
    Apache Spark pipelines, BERT-based anomaly detection, unified production log systems.

  • Thermo Fisher Scientific β€” Summer Intern (May 2018 – July 2018)
    Embedded C++ development on ARM hardware; Linux Yocto + Qt systems.


πŸŽ“ Education

  • Boston University
    M.S. in Computer Science (Sep 2022 – May 2024)
    Thesis: Efficient Vision and Language Models for Autonomous Systems

  • IIITDM Kancheepuram
    B.Tech. + M.Tech. in Electronics and Communication Engineering (July 2015 – June 2020)


πŸ“« Socials


🎡 Fun Fact

β€œHurricane Sandy” wasn’t a weather event β€” it was me, my spiky hair, and a guitar solo on stage.


Teaching machines to see.
Scaling models responsibly.
Occasionally arguing with SLURM.

Pinned Loading

  1. DIASENGUPTA/UniLCD DIASENGUPTA/UniLCD Public

    Unified Local-Cloud Decision-Making with Residual Reinforcement Learning

    Python 3 1

  2. tensorsofthewall.github.io tensorsofthewall.github.io Public

    Personal Website

    TypeScript

  3. BetterSearch BetterSearch Public

    Desktop search tool that brings natural language to traditional file search.

    Python 5

  4. VidTune VidTune Public

    Tailored Music For Your Videos

    Python 3 1

  5. RUFUS RUFUS Public

    An intelligent web data extraction tool designed for seamless integration with Retrieval-Augmented Generation (RAG) pipelines

    Python

  6. exo-explore/exo exo-explore/exo Public

    Run frontier AI locally.

    Python 42.4k 2.9k