Shrey Jain
MS, Computer Engineering, New York University
B.Tech Computer Science and Engineering, PES University
Email- jain.shrey98@gmail.com
I am a graduate student at New York University with major in Computer Engineering with a strong background in Machine Learning, OCR, and Software Development. I have had the opportunity to work on several projects during my graduate and undergraduate studies that involved developing an Image Captioning model for chest X-ray images using deep learning models, IPL match result prediction using Machine Learning algorithms, a hotel recommendation system using natural language processing and analyzing security issues in NFT marketplaces. I also have experience in software development, with a strong background in Java, Python, and ReactJS. I interned as a Full Stack Engineer at Audible in the summer of 2022, where I got the chance to work on several projects and gain practical experience in the field.
You will find here a sample of my most relevant projects, which will give you a glimpse of my technical skills, as well as my ability to take on challenges and deliver results. If you are interested in working together or have any questions, feel free to contact me.
Face Recognition in the age of CLIP & Billion image datasets
In this paper, an evaluation of the performance of various CLIP models as zero shot face recognizers is done. The findings show that CLIP models perform well on face recognition tasks, but increasing the size of the CLIP model does not necessarily lead to improved accuracy. Additionally, there is an investigation of the robustness of CLIP models against data poisoning attacks by testing their performance on poisoned data. Through this analysis, the main aim was to understand the potential consequences and misuse of search engines built using CLIP models, which could potentially function as unintentional face recognition engines.
Image Captioning for Chest Xray Images
This project is an image captioning model, which will take in Chest X-ray images as the input and automatically generate a caption that will contain the summary of findings in the image. For the image-based model (Encoder) – a Convolutional Neural Network model is used and for the language-based model (Decoder) – a Recurrent Neural Network is used. A pretrained CNN extracts the features from the input image. The feature vector is linearly transformed to have the same dimension as the input dimension of the RNN/LSTM network. This network is trained as a language model on the feature vector.
IPL-Match-Analysis
This project aims to predict the outcome of IPL matches using probability and machine learning. The project uses Big Data technologies such as MapReduce, Spark MLLib, and clustering in Scala. The data collection phase involved downloading and converting data from a website into a usable format, and then calculating runs and wickets for each batsman-bowler pair. Clustering was used to group similar players and make predictions for unknown pairs. The prediction phase uses probability to determine the outcome of each ball, and a machine learning approach using Spark MLLib and decision trees was used to improve the accuracy. The overall efficiency of the project was quite high, accurately predicting 8 out of 10 results.
Webapp Using Docker
Implements a Web Application using docker, flask and REST API
Instance-1 Load Balancer run in built-in Flask Server on port 80 (Needs to be give permission and stop all other services. Acts running in docker on port (8000 and above). Image-ccdocker1/acts:latest
Instance-2 Users app running on docker container(on port 80) in instance-2. Image- ccdocker1/users:latest