My overarching research goal is to develop robust and safe intelligent multimodal systems. My interests lie at the intersection of reinforcement learning, multi-agent systems, and computer vision. I am particularly eager to explore Vision-Language Models in the context of navigation and control.
Feel free to check out my resume and drop me an e-mail if you would like to chat with me!
Research Intern | University of California, Berkeley
Dec '22 - Jun '23
Worked under the supervision of Prof. Masayoshi Tomizuka. Proposed a novel metric called the Influence Index to quantify coordination levels in multi-agent reinforcment learning.
Research Engineer Intern | Indian Institute of Science Bangalore
May '22 - Aug '22
Worked under the supervision of Prof. Shishir KolathayaStochLab, IISc on implementing algorithms for quadruped locomotion.
I had the opportunity to work with real-world robots and benchmarked algorithms on the Stochlite quadruped.
Computer Vision Intern | Drive Analytics
Dec '21 - Feb '22
Worked with the Computer Vision team under the supervision of Mr. Pradeep Janakiraman on building a pipeline for real-time tracking of basketballs using YOLOv5.
Undergraduate Researcher | IIT Kharagpur
May '21 - Apr '24'
Adapting to Shifts in Vehicle Dynamics with Online Latent Optimization
Under Review at Confenrece on Robot Learning 2024 (CoRL)
More details to be shared soon
Entity Augmentation for Efficient Classification of Vertically Partitioned Data with Limited Overlap [paper]
Proposed Entity Augmentation, a novel approach that eliminates the need for private set intersection (PSI) and entity alignment in Vertical Federated Learning for categorical tasks
[Re] From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting [paper]
Reproduced the results of the seminal YNet architecture of social trajectory prediction. Proposed a novel transfer learning experiment that achieved state-of-the-art performance on the SDD dataset.
Inter IIT Tech Meet 2024
Member of the Gold Winning Team, IIT Kharagpur
Adobe Behavior Simulation Challenge: The task was two fold: (i) Given the contents of a tweet and any media files along with username, predict the number of likes (ii) Given the number of likes, as well as the username and any media files, predict the text contents of the tweet.
Our approach involved finetuning LLama, NeXT-GPT and LLaVA 1.5 on the dataset. We leveraged LLaVA for video-captioning followed by keyword extraction to engineer a custom prompt. We optimized the pipeline further using a bandit-informed routing algorithm to select the best LLM during inference. [code]
The task was to reproduce research papers from top AI conferences and extend the ideas for state-of-the-art performance. We successfully reproduced the results of "From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting" (ICCV '21). Further, we proposed a novel transfer learning experiment that achieved SOTA performance on the Stanford Drone Dataset (SDD) and the Intersection Drone Dataset (InD)
Member of the IIT Kharagpur - IUPUI, Indiana - USB Colombia collaborative team.
Designed tightly/loosely coupled high-speed localisation in for racecar localisation in pre-mapped LiDAR circuit. The localisation was using 3 static-state LiDARs.
Wrote ROS packages to construct the map from LiDAR scans, extract the local map and to implement Iterative Closest Point algorithm for localization in mapped environment. [code]
Online dynamics estimation in unforseen environments for autonomous driving
Toronto Intelligent Systems Lab, UofT
Implemented a two-stage pipeline for online dynamics estimation in unforseen environments. Proposed a novel encoder-decoder architecture for the first stage and performed real-time experiments on a 1/10th scale car. Work is under review at Conference on Robot Learning 2024 (CoRL)
Quantifying coordination levels in Multi-Agent Reinforcement Learning
Mechanical Systems Control Lab, UC Berkeley
Explored novel methods to quantify the coordination between two agents on the Meltingpot environments.
Proposed Influence Index, a scalar metric that measures the interaction level between agents in varying environments. Implemented Self-Play, Population-Play and Fictitious Co-play methods and obtained results.
Worked on the ROS control framework of the Stochlite Robot and benchmarked the Soft Actor-Critic and Advantage Actor-Critic algorithms on the same.
Explored gradient free methods such as augmented random search for end foot trajectory spline generation.
Reinforcement Learning for bipedal walking Term Project under Prof. Parta Pratim Chakrabarti
[Github]
Implemented the DQN, DDQN, PPO and TD3 algorithms to solve the LunarLanderv2 and BipedalWalker-v3 environments within OpenAI Gym. Explored gradient clipping, reward normalization and advantage estimation (GAE) to achieve rewards of over 200.
Student Volunteer | National Service Scheme
Dec 2020 - May 2022 [website]
The National Service Scheme is a public service initiative managed by the Ministry of Youth Affairs and Sports of the Government of India. Its overarching objective is to uplift the quality of life for underrepresented and underprivileged sectors in India, with a dual focus on environmental conservation and fostering a sense of civic duty. The program is dedicated to not only combating poverty but also actively engaging participants in initiatives that promote sustainability, environmental awareness, and a heightened civic responsibility..
During my involvement in the National Service Scheme (NSS), I played an active role in educating students in rural villages, teaching fundamental concepts in math and science. I took the initiative to design study materials to facilitate the learning process. Furthermore, I actively participated in various environmental campaigns as part of NSS, contributing to cleaning initiatives and tree-planting efforts to promote sustainable practices and community well-being.
This template is a modification to Jon Barron's website. Find the source code to my website here.