I am currently working as a Research engineer where my focus is to develop multi-modal large language models (LLMs). Previously, I was a research student at Computer Science & Engineering Department, Indian Institute of Technology, Madras ( IIT Madras ) ,
where I worked on Computer Vision and Deep Learning. I did my undergraduate studies in Computer Science at National Institute of Technology, Silchar( NIT Silchar )
My research interests mainly lie in the areas of computer vision and deep learning.
In partiuclar, my current work is particularly focused on vision language models and label-efficient (Semi-Supervised/ Unsupervised /Self-Supervised ) approaches for deep-learning across Images/Videos.
In addition, I am also interested in video understanding, representation learning ,domain adaptation and transfer learning.
We propose a contrastive framework for semi-supervised domain adaptation (SSDA) where we use instance alignment between unlabeled target samples and centroid alignment between source and target domains.
We propose a temporal contrastive learning framework for semi-supervised action recognition by using contrastive losses between different videos and groups of videos with similar actions.
We introduce a joint dataset repairment strategy by combining classifier with a GAN that makes up for the deficit of training examples from the minority class by producing additional examples.