About
Articles by Bhavin
Activity
-
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence Chen et al.:…
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence Chen et al.:…
Liked by Bhavin Jawade ✈️ NeurIPS
-
Chain of Continuous Thought: novel paradigm with enhanced LLM Reasoning in continuous latent space LLMs are restricted to reason in the “language…
Chain of Continuous Thought: novel paradigm with enhanced LLM Reasoning in continuous latent space LLMs are restricted to reason in the “language…
Liked by Bhavin Jawade ✈️ NeurIPS
-
We're hiring technical customer-facing rockstars at 🚀 Mistral AI 👇 If you have hands-on experience with ML/AI and enjoy working with customers to…
We're hiring technical customer-facing rockstars at 🚀 Mistral AI 👇 If you have hands-on experience with ML/AI and enjoy working with customers to…
Liked by Bhavin Jawade ✈️ NeurIPS
Experience & Education
Licenses & Certifications
Volunteer Experience
-
Mentor
Incubate IND
- 2 months
Science and Technology
Mentor for Mobility Developer Tech Camp organized by INCUBATEIND.
Publications
-
Multi Loss Fusion For Matching Smartphone Captured Contactless Finger Images
IEEE International Workshop on Information Forensics and Security
Fingerprint authentication generally requires the acquisition of fingerprint information through touch-based specialized sensors. However, the global spread of the contagious virus through contact of the surface has increased the attention towards contactless biometrics verification. Another reason for contactless fingerprint identification is the easy availability of low-cost camera sensors available in mobile devices. Traditionally, the enrollment images are captured using touch-based sensors…
Fingerprint authentication generally requires the acquisition of fingerprint information through touch-based specialized sensors. However, the global spread of the contagious virus through contact of the surface has increased the attention towards contactless biometrics verification. Another reason for contactless fingerprint identification is the easy availability of low-cost camera sensors available in mobile devices. Traditionally, the enrollment images are captured using touch-based sensors and the current era requires touch-less images. Therefore, it raises the problem of performing contactless vs contactless as well as cross fingerprint (contact vs. contact-less) matching for identity verification. In the literature limited work has been done so far for smartphone acquired contactless fingerprint matching and cross fingerprint matching and the existing algorithms are computationally challenging to be deployed on mobile devices
Therefore, in this paper, we propose a cost-effective end-to-end solution for user-operated smartphone-based contactless fingerprint enrollment and verification using a novel multi-stage pipeline that includes an automatic finger region segmentation technique, contactless fingerprint enhancement algorithm, and deep convolutional net with contrastive and minutiae loss for learning robust fingerprint representations. We show the effectiveness of our network on a publicly available fingerprint dataset consisting of both contact and contactless fingerprint images. The comparison with state-of-the-art shows that the proposed algorithm performs on par with the existing algorithms using a much lesser amount of data for training and by reducing a significant inference time computation cost. We have also developed a cross-platform mobile application for fingerprint enrollment, verification, and authentication designed keeping security, robustness, and accessibility in mindOther authors -
Low computation in-device geofencing algorithm using hierarchy-based searching for offline usage
IEEE
Most applications use external services and APIs to implement geofencing. This has a major drawback that the user location data is accessible to the external service provider. Another important drawback is the continuous requirement of network connection for geofencing. Typical implementation of geofencing cannot be done within the mobile device as they require high computation for repetitive searching. In this research paper we propose new geofencing architecture based on arranging geofences…
Most applications use external services and APIs to implement geofencing. This has a major drawback that the user location data is accessible to the external service provider. Another important drawback is the continuous requirement of network connection for geofencing. Typical implementation of geofencing cannot be done within the mobile device as they require high computation for repetitive searching. In this research paper we propose new geofencing architecture based on arranging geofences in a tree like structure (geo-tree). Due to the low computation cost of our parsing algorithm, it is fast and can be used directly within mobile devices reducing network cost and more importantly keeping user location data secure. This research paper also talks about the tested efficiency of the architecture and about the probable future scopes where the efficiency can be further increased.
Other authorsSee publication -
Attribute De-biased Vision Transformer (AD-ViT) for Long-Term Person Re-identification
IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2022 (AVSS 2022)
We propose an Attribute De-biased Vision Transformer (AD-ViT) to provide direct supervision to learn identity-specific features. Specifically, we produce attribute labels for person instances and utilize them to guide our model to focus on identity features through gradient reversal. Our experiments on LTCC and NKUP datasets shows that the proposed work consistently outperforms the state-of-the-art methods.
-
Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization
IEEE/CVF Winter Conference on Applications of Computer Vision, 2023 (WACV 2023)
In a video, often-times, the objects exhibiting movement are the ones generating the sound. In this work, we capture this characteristic by modeling the optical flow in a video as a prior to better aid in localizing the sound source. We further demonstrate that the addition of flow-based attention substantially im- proves visual sound source localization. We benchmark our method on standard sound source localization datasets and achieve state-of-the-art performance on the SoundNet Flickr and…
In a video, often-times, the objects exhibiting movement are the ones generating the sound. In this work, we capture this characteristic by modeling the optical flow in a video as a prior to better aid in localizing the sound source. We further demonstrate that the addition of flow-based attention substantially im- proves visual sound source localization. We benchmark our method on standard sound source localization datasets and achieve state-of-the-art performance on the SoundNet Flickr and VGG Sound Source datasets.
-
NAPReg: Nouns as Proxies Regularization for Semantically Aware Cross-Modal Embeddings
IEEE/CVF Winter Conference on Applications of Computer Vision, 2023 (WACV 2023)
We proposed NAPReg, a novel regularization formulation that projects high-level semantic entities i.e. Nouns into the embedding space as shared learnable proxies. We show that using such a formulation allows the attention mechanism to learn better word-region alignment while also utilizing region information from other samples to build a more generalized latent representation for semantic concepts. Experiments on MS-COCO, Flickr30k and Flickr8k demonstrate that our method achieves…
We proposed NAPReg, a novel regularization formulation that projects high-level semantic entities i.e. Nouns into the embedding space as shared learnable proxies. We show that using such a formulation allows the attention mechanism to learn better word-region alignment while also utilizing region information from other samples to build a more generalized latent representation for semantic concepts. Experiments on MS-COCO, Flickr30k and Flickr8k demonstrate that our method achieves state-of-the-art results in cross-modal metric learning for text-image and image-text retrieval tasks
-
RidgeBase: A Cross-Sensor Multi-Finger Contactless Fingerprint Dataset
2022 {IEEE} International Joint Conference on Biometrics ({IJCB})
Contactless fingerprint matching using smartphone cameras can alleviate major challenges of traditional fingerprint systems including hygienic acquisition, portability and presentation attacks. However, development of practical and robust contactless fingerprint matching techniques is constrained by the limited availability of large scale real-world datasets. To motivate further advances in contactless fingerprint matching across sensors, we introduce the RidgeBase benchmark dataset. RidgeBase…
Contactless fingerprint matching using smartphone cameras can alleviate major challenges of traditional fingerprint systems including hygienic acquisition, portability and presentation attacks. However, development of practical and robust contactless fingerprint matching techniques is constrained by the limited availability of large scale real-world datasets. To motivate further advances in contactless fingerprint matching across sensors, we introduce the RidgeBase benchmark dataset. RidgeBase consists of more than 15,000 contactless and contact-based fingerprint image pairs acquired from 88 individuals under different background and lighting conditions using two smartphone cameras and one flatbed contact sensor. Unlike existing datasets, RidgeBase is designed to promote research under different matching scenarios that include Single Finger Matching and Multi-Finger Matching for both contactless-to-contactless (CL2CL) and contact-to-contactless (C2CL) verification and identification. Furthermore, due to the high intra-sample variance in contactless fingerprints belonging to the same finger, we propose a set-based matching protocol inspired by the advances in facial recognition datasets. This protocol is specifically designed for pragmatic contactless fingerprint matching that can account for variances in focus, polarity and finger-angles. We report qualitative and quantitative baseline results for different protocols using a COTS fingerprint matcher (Verifinger) and a Deep CNN based approach on the RidgeBase dataset. The dataset can be downloaded here: https://www.buffalo.edu/cubs/research/datasets/ridgebase-benchmark-dataset.html
Other authors
Courses
-
Analysis of Algorithm
CSE 531
-
Android Development
REP ID 4127
-
Biometrics and IoT Security
CSE 741
-
C/C++
-
-
Computer Vision and Image Processing
CSE 573
-
Deep Learning
CSE 656
-
Information Retrieval
CSE 535
-
Java
-
-
Machine Learning
CSE 574
-
Python
-
-
Reinforcement Learning
CSE 510
-
Web Development : Back-end Front-end
3
-
Web Development Via HTML CSS Javascript PHP
-
Projects
-
Full Feature Web based Image-Editor
Built during Wittyhacks Hackathon.
Currently being used at Wittyfeed (Vatsana).
Image Editor has all features, like drag and drop. It acts like small-scale web-based photoshop.
Project Link: https://github.com/bhavinjawade/Web-Image-Editor
Features:
Upload Image
Add Frames
Add Text
Drag and Drop on Web
Layers (Send back, bring front)
Colors, Fonts and Shapes
Save Image
-
Deep Learning - Attention Based Neural Image Captioning
-
Implemented Show Attend and Tell 's Neural Image Captioning model with attention.
Improved it my implementing Adaptive Attention Mechanism.
Used ResNet 101, DenseNet 201 and VGG 16 CNNs for encoder.
Used LSTM for decoder.
Evaluated score using BLEU-4.
Technologies:
Pytorch, Python, Sklearn, NLTK.
Keywords:
NLP, AI, Deep Learning, Machine Learning, Neural Networks, Pytorch -
Computer Vision and Image Processing - Virtual Wall
-
Touch sensing and interaction detection using Stereo Vision and Object tracking. Used a Mynt Eye S-1030 Stereo camera to detect when the users hand is close enough to a selected section of wall. Used OpenCV CSRT Detector to locate the exact position of the hand and performed an operation on the computer. The project allows a user to convert any wall into a virtual touch screen using stereo vision.
Technologies: Python, C++, OpenCV, SGBM, Mynteye SDK. -
Reinforcement Learning - Actor Critic | DQN | Multiagent RL | Atari Games
-
Trained a CNN based Deep Q Network, DDQN network, Dueling Network and Policy gradient algorithms like REINFORCE and Advantage Actor Critic (A2C) algorithm to play Atari Games (Road Runner and Breakout) at a human level performance.
For DDQN my implementation got the same normalized score as the original DDQN paper.
Final Project - Multiagent Reinforcement Learning algorithm to solve a Ship Docker Problem.
Technologies: Pytorch, Tensorflow, OpenAI Gym
Honors & Awards
-
Graduate Leadership Award
University at Buffalo
Graduate Leadership Award, University at Buffalo
-
IEEE Best Paper Award
IEEE
Best Paper Award, IEEE, IJCB 2023, Slovenia
Awarded for - “CoNAN - Conditional Neural Aggregation Network for Unconstrained Face Feature Aggregation”. -
UB CSE Student Innovation Award
University at Buffalo
Best Project - Contactless Fingerprint Authentication Using a Mobile Device.
Russell Agrusa Annual Awards. -
Blackstone Launchpad Ideas Competition
Blackstone Launchpad
Best Project - "VisionAll" for Social and Climate Change.
-
Maple Ridge City Hackathon
Maple Ridge City
Winner of Maple Ridge City hackathon.
https://drive.google.com/file/d/1jlSJTPf3aOfiOkE1jU6lCSNARVd0YgHZ/view?usp=sharing -
NSF DIBBS Grant
National Science Foundation
Awarded graduate funding under NSF DIBBS Grant.
Test Scores
-
TOEFL
Score: 109
-
GRE
Score: 324
Languages
-
English
Full professional proficiency
-
Hindi
Native or bilingual proficiency
Organizations
-
Ecell SGSITS
Head of Design
-
Facebook Developer Circle
Co-lead
-
HashInclude - Techno Learning Club
Head
Recommendations received
6 people have recommended Bhavin
Join now to viewMore activity by Bhavin
-
This fall, I have been attending in the LLM Agents MOOC offered by UC Berkeley. Last week’s lecture, delivered by Prof. Dong Song, was particularly…
This fall, I have been attending in the LLM Agents MOOC offered by UC Berkeley. Last week’s lecture, delivered by Prof. Dong Song, was particularly…
Liked by Bhavin Jawade ✈️ NeurIPS
-
Are you currently a PhD student working at the intersection of Culture and AI? Come join me and a bunch of wonderful folks - Erin MacMurray van…
Are you currently a PhD student working at the intersection of Culture and AI? Come join me and a bunch of wonderful folks - Erin MacMurray van…
Liked by Bhavin Jawade ✈️ NeurIPS
-
💡 Conference tip: Took a bunch of poster photos from #NeurIPS2024? Mistral AI Le Chat can help find the papers!
💡 Conference tip: Took a bunch of poster photos from #NeurIPS2024? Mistral AI Le Chat can help find the papers!
Liked by Bhavin Jawade ✈️ NeurIPS
-
🌌 James Webb & Hubble Confirm a Profound Mystery in Our Understanding of the Universe 🌌 Recent combined efforts by the James Webb Space Telescope…
🌌 James Webb & Hubble Confirm a Profound Mystery in Our Understanding of the Universe 🌌 Recent combined efforts by the James Webb Space Telescope…
Liked by Bhavin Jawade ✈️ NeurIPS
-
Who came up with the term "attention"? A fun read around the names of Karpathy, Bengio, Ilya, Dzmitry Bahdanau and many more. “So one day I had…
Who came up with the term "attention"? A fun read around the names of Karpathy, Bengio, Ilya, Dzmitry Bahdanau and many more. “So one day I had…
Liked by Bhavin Jawade ✈️ NeurIPS
-
Thrilled to present our work on Monty Hall inspired re-prompting and score optimization in conformal prediction (CP) to improve uncertainty…
Thrilled to present our work on Monty Hall inspired re-prompting and score optimization in conformal prediction (CP) to improve uncertainty…
Liked by Bhavin Jawade ✈️ NeurIPS
-
🎉 Great News! 🎉 I'm thrilled to share that our(Animesh Mukherjee, Paramita Das and Ritabrata Chakraborty) paper "On the effective transfer of…
🎉 Great News! 🎉 I'm thrilled to share that our(Animesh Mukherjee, Paramita Das and Ritabrata Chakraborty) paper "On the effective transfer of…
Liked by Bhavin Jawade ✈️ NeurIPS
-
ICPC India is seeking task/problem proposals for the upcoming ACM ICPC India Regionals. If you are interested in contributing, please submit your…
ICPC India is seeking task/problem proposals for the upcoming ACM ICPC India Regionals. If you are interested in contributing, please submit your…
Liked by Bhavin Jawade ✈️ NeurIPS
-
NeurIPS celebrates the Test of Time Awards. These are papers that introduced novel ideas, techniques, or frameworks that remain relevant and widely…
NeurIPS celebrates the Test of Time Awards. These are papers that introduced novel ideas, techniques, or frameworks that remain relevant and widely…
Liked by Bhavin Jawade ✈️ NeurIPS
-
I recently had the pleasure of reviewing the MEAP release of Build LLM Applications (From Scratch) by Hamza Farooq (https://lnkd.in/gZnqSaJ5)…
I recently had the pleasure of reviewing the MEAP release of Build LLM Applications (From Scratch) by Hamza Farooq (https://lnkd.in/gZnqSaJ5)…
Liked by Bhavin Jawade ✈️ NeurIPS
-
‼️ 𝗔𝗔𝗔𝗜 𝟮𝟬𝟮𝟱 I am happy to share that our work, 𝗦𝗲𝗴𝗙𝗮𝗰𝗲: 𝗙𝗮𝗰𝗲 𝗦𝗲𝗴𝗺𝗲𝗻𝘁𝗮𝘁𝗶𝗼𝗻 𝗼𝗳 𝗟𝗼𝗻𝗴-𝗧𝗮𝗶𝗹 𝗖𝗹𝗮𝘀𝘀𝗲𝘀, has…
‼️ 𝗔𝗔𝗔𝗜 𝟮𝟬𝟮𝟱 I am happy to share that our work, 𝗦𝗲𝗴𝗙𝗮𝗰𝗲: 𝗙𝗮𝗰𝗲 𝗦𝗲𝗴𝗺𝗲𝗻𝘁𝗮𝘁𝗶𝗼𝗻 𝗼𝗳 𝗟𝗼𝗻𝗴-𝗧𝗮𝗶𝗹 𝗖𝗹𝗮𝘀𝘀𝗲𝘀, has…
Liked by Bhavin Jawade ✈️ NeurIPS
-
Now Available - MAISI An #NVIDIANIM foundation model for state-of-the-art medical imaging synthesis. Generate high-fidelity CT scans with anatomical…
Now Available - MAISI An #NVIDIANIM foundation model for state-of-the-art medical imaging synthesis. Generate high-fidelity CT scans with anatomical…
Liked by Bhavin Jawade ✈️ NeurIPS
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore More