About
I am a Software Engineer at Meta, working in the AI Infrastructure organization…
Articles by Vishwa
Activity
-
I'm stepping down from PyTorch and leaving Meta on November 17th. tl;dr: Didn't want to be doing PyTorch forever, seemed like the perfect time to…
I'm stepping down from PyTorch and leaving Meta on November 17th. tl;dr: Didn't want to be doing PyTorch forever, seemed like the perfect time to…
Liked by Vishwa Karia
-
Honoured to attend the NIFT Gandhinagar Graduation Project Showcase 2025. A wonderful throwback to attending this inspiring event, alongside Mr…
Honoured to attend the NIFT Gandhinagar Graduation Project Showcase 2025. A wonderful throwback to attending this inspiring event, alongside Mr…
Liked by Vishwa Karia
-
My wholehearted recommendations go to the awesome SDEs and individuals, who were unfortunately impacted today due to reasons outside of their…
My wholehearted recommendations go to the awesome SDEs and individuals, who were unfortunately impacted today due to reasons outside of their…
Liked by Vishwa Karia
Experience
Education
Volunteer Experience
-
Social Initiative Volunteer
Technovanza
- 1 year 6 months
Education
Organized educational sessions at Mumbai Municipal schools and Non-Governmental Organizations to teach Basic Algebra and Science to underprivileged students
Publications
-
GenSample: A Genetic Algorithm for Oversampling in Imbalanced Datasets
See publicationImbalanced datasets are ubiquitous. Classification performance on imbalanced datasets is generally poor for the minority class as the classifier cannot learn decision boundaries well. However, in sensitive applications like fraud detection, medical diagnosis, and spam identification, it is extremely important to classify the minority instances correctly. In this paper, we present a novel technique based on genetic algorithms, GenSample, for oversampling the minority class in imbalanced…
Imbalanced datasets are ubiquitous. Classification performance on imbalanced datasets is generally poor for the minority class as the classifier cannot learn decision boundaries well. However, in sensitive applications like fraud detection, medical diagnosis, and spam identification, it is extremely important to classify the minority instances correctly. In this paper, we present a novel technique based on genetic algorithms, GenSample, for oversampling the minority class in imbalanced datasets. GenSample decides the rate of oversampling a minority example by taking into account the difficulty in learning that example, along with the performance improvement achieved by oversampling it. This technique terminates the oversampling process when the performance of the classifier begins to deteriorate. Consequently, it produces synthetic data only as long as a performance boost is obtained. The algorithm was tested on 9 real-world imbalanced datasets of varying sizes and imbalance ratios. It achieved the highest F-Score on 8 out of 9 datasets, confirming its ability to better handle imbalanced data compared to other existing methodologies.
Courses
-
Artificial Intelligence
-
-
Big Data Analytics
-
-
Cloud Computing
-
-
Compiler Construction
-
-
Computer Networks
-
-
Computer Organization and Architecture
-
-
Cryptography and Network Security
-
-
Current Topics in Computer Science: System Design/Architecture
CS 259
-
Data Mining and Data Warehousing
-
-
Data Structures and Algorithms
-
-
Digital Image Processing
-
-
Fundamentals of Database Systems
-
-
Internet of Things
-
-
Introduction to Data Management and Statistical Computing
BIOSTAT203A-1
-
Introduction to Operating Systems
-
-
Large Scale Data Mining
ECE 219
-
Learning from Text
-
-
Machine Learning
-
-
Neural Networks and Deep Learning
ECE239AS
-
Probabilistic Programming and Relational Learning
CS 267A
-
Software Design Patterns
-
-
Software Engineering
-
-
Statistics and Optimization
-
-
Web Information Management
CS 246
Projects
-
A Neural-based System for Question Answering on SQUAD 2.0
-
- Used a Bidirectional Attention Flow (BiDAF) network to build a question answering system for the SQuAD 2.0 data set
- Explored different embeddings like GloVe, FastText and BERT with the BiDAF model to improve performance
-
Code2Cap: Automated Code Captioning
-
See project- Constructed a novel model that produces a natural language description of an input code snippet written in Python or Java
- Used transformers for embedding the input data as well as predicting the output sequence of words
- Evaluated the effect of using an AST generated from the code snippet as an input feature
-
Automatic Colorization of Grayscale Images
-
- Developed an Autoencoder-based CNN model and a Generative Adversarial Network (GAN) based model for automatic colorization of grayscale images
- Performed prediction of pixel values for RGB and LAB channels to assess the effect of predicting 3 vs 2 values for each pixel
-
Mini Search Engine
-
- Built a small-scale search engine based on the Simple English Wikipedia dataset using a custom analyzer of Elasticsearch
- Used a probabilistic language model-based approach for spell correction to improve search accuracy
- Improved the ranking function by leveraging Latent Dirichlet Analysis to extract latent topics from the corpus
-
Prediction of Flight Delays
-
- Built different supervised learning models like Logistic Regression, Random Forests and Neural Network that predict the occurrence of airline delay from historical flight data
- Used Hadoop and Pig to partition the data and generate a feature matrix
- Analyzed and compared the results produced by these models
-
Probabilistic Database System
-
- Built a Probabilistic Database System to compute the probability of fully-quantified queries in first-order logic
- Implemented different approximation schemes like Monte Carlo sampling and Markov Chain Monte Carlo (MCMC) methods such as Metropolis-Hastings and Gibbs Sampling for evaluating #P-Hard queries and compared their results. -
Image Captioning using Object Detection
-
- Constructed annotation vectors using Deep CNN (based on the VGG deep neural network model) to recognize objects in an image and their locations
- Built a model to feed the annotation vectors into a Recurrent Neural Network to generate a short descriptive sentence
- Implemented the system on Keras API using Tensorflow framework -
Classification of Ball Bearings
-
- Applied Wavelet-based Denoising and Logistic Regression to detect the healthy ball bearings from the defective ones using Python
- Achieved an accuracy of 86% which was among the top 10 in the class
-
Online Gift Shop Website
-
- Made the Software Project Plan for an online store which can be used for buying and selling gift articles
- Developed the front end of the website using HTML, CSS, JavaScript, Bootstrap and MySQL
-
Maze Solving Bot
-
- Built a line following bot which could solve a grid and provide shortest path to destination using Graph Theory
- Programmed the bot using AVR Programming
Honors & Awards
-
Rank Holder at the end of BTech in Computer Engineering
-
Secured 5th Rank at the end of Btech in Computer Engineering in a class of 75 students
Test Scores
-
GRE
Score: 331/340
Quants: 169/170
Verbal: 162/170
AWA: 5.0/6.0 -
TOEFL
Score: 117/120
Reading: 29/30
Listening: 30/30
Writing: 30/30
Speaking: 28/30
Languages
-
English
Full professional proficiency
-
Hindi
Native or bilingual proficiency
-
Marathi
Professional working proficiency
-
Gujarati
Native or bilingual proficiency
-
Spanish
Elementary proficiency
Recommendations received
1 person has recommended Vishwa
Join now to viewMore activity by Vishwa
-
Looks like we're going to welcome two more Hugging Faces to the family next year. My wife is a hero 💛💛💛
Looks like we're going to welcome two more Hugging Faces to the family next year. My wife is a hero 💛💛💛
Liked by Vishwa Karia
-
It was an exciting journey to be a part of this amazing service launch and to keep working on improving this incredible product! Amazon Quick Suite…
It was an exciting journey to be a part of this amazing service launch and to keep working on improving this incredible product! Amazon Quick Suite…
Liked by Vishwa Karia
-
This is a project my team and I have been working on over the last year and I am happy to see it open sourced. We are using it within Meta as well…
This is a project my team and I have been working on over the last year and I am happy to see it open sourced. We are using it within Meta as well…
Liked by Vishwa Karia
-
Couldn’t be prouder of my daughter Saanya Bhasin presenting her cancer research at an international conference in Vienna. From the first spark of…
Couldn’t be prouder of my daughter Saanya Bhasin presenting her cancer research at an international conference in Vienna. From the first spark of…
Liked by Vishwa Karia
-
⭐ Employee Spotlight: Breejesh Rathod ⭐ Meet Breejesh, one of our incredible Engineering leaders at Concentric AI! He currently manages 3…
⭐ Employee Spotlight: Breejesh Rathod ⭐ Meet Breejesh, one of our incredible Engineering leaders at Concentric AI! He currently manages 3…
Liked by Vishwa Karia
-
Honored to serve on the program committee for this year’s PyTorch Conference. Exciting innovations are happening in the inference space and vLLM!…
Honored to serve on the program committee for this year’s PyTorch Conference. Exciting innovations are happening in the inference space and vLLM!…
Liked by Vishwa Karia
-
We're expanding to India. We’re announcing plans to open our second Asia-Pacific office in Bengaluru, India in early 2026. India is one of the most…
We're expanding to India. We’re announcing plans to open our second Asia-Pacific office in Bengaluru, India in early 2026. India is one of the most…
Liked by Vishwa Karia
-
I’ve started a new role as a DevOps Engineer at LiveRamp. Grateful for the opportunity and excited to learn and grow in this next chapter.
I’ve started a new role as a DevOps Engineer at LiveRamp. Grateful for the opportunity and excited to learn and grow in this next chapter.
Liked by Vishwa Karia
-
Today is my last day at Databricks. ~2.5 years ago Ali Ghodsi told me his goal was to build a $100B company. Databricks was at a $38B valuation when…
Today is my last day at Databricks. ~2.5 years ago Ali Ghodsi told me his goal was to build a $100B company. Databricks was at a $38B valuation when…
Liked by Vishwa Karia
-
I shut down Playmo earlier this year. Playmo was my attempt to solve a simple, hard problem: reliable, meaningful care for children while parents…
I shut down Playmo earlier this year. Playmo was my attempt to solve a simple, hard problem: reliable, meaningful care for children while parents…
Liked by Vishwa Karia
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content