Jiaqi Zhang
Mountain View, California, United States
7K followers
500+ connections
Experience
Education
Courses
-
Applied Data Science
95852
-
Client-Side Web Technologies
08724
-
Cloud Computing
15619
-
Coding Boot Camp
11601
-
Data Mining
95791
-
Data Structures and Algorithms
95771
-
Data Structures for Application Programmers
08722
-
Distributed Systems
95702
-
Intermediate Java II
95713
-
Internet Technologies
95733
-
Java Smartphone Development
18641
-
Python for Developers
95880
Projects
-
Cab Matching Service with Dynamic Pricing
{ Java, Kafka, Samza }
• Implemented a Samza application which consumed driver-locations stream and events stream and output to a matching stream;
• Incorporated the distance, driver’s rating, driver’s salary, and the client’s gender preference factors into the matching system;
• Introduced the surge pricing factor into the system to enable dynamic pricing based on the supply and demand in a specific block. -
Social Graph Analysis
{ Apache Spark, Scala, PageRank }
• Analyzed the Twitter social graph dataset(over 60G) using Apache Spark framework;
• Implemented PageRank algorithm to find the most influential users on the network. -
Input Text Prediction Model
{ Java, MapReduce, HBase }
• Worked with a corpus from Wikipedia dataset and generate a list of n-grams from it;
• Computed a statistical language model using the n-gram counts and store them in HBase;
• Connected the language model to a web interface, recommended words according to the input. -
Auto Scaling, Load Balancing and Caching
{ Java, Bash, Auto Scaling, Load Balancing, Healthcheck, Multithreading }
• Managed instances automatically with cloud APIs of AWS and Azure; Developed customized algorithms to balance load and conduct auto scaling, and assigned threads to check the connections and replace the unhealthy instances;
• Implemented an LRU Cache in the front end for both temporal locality and spatial locality. -
Contact List Web Appication
{ Javascript, AJAX, AngularJS, Bootstrap }
• Developed a web application that allowed users to manage their contact lists
• Stored the contact information into local storage, including name(key), addresses and images, and displayed the locations in Google Map -
Car Configuration Application
{ Java, Socket, J2EE, Apache Tomcat, MySQL, Multithreading }
• Designed and implemented a car configuration application featuring car model upload and configuration
• Stored and fetched the data using JDBC
• Implemented Java servlets and JSP to handle client requests -
Wikipedia Trending Topics Analytics
{ Java, Bash, AWK, MapReduce }
• Implemented a MapReduce job flow to analyze Wikipedia page-view logs for December 2015
• Filtered the data with regular expressions, aggregated the view counts and generated a daily timeline of page views
• Analyzed the trending topics with AWK and Bash scripts -
TripAdvisor Reviews Helpfulness Prediction
-
{ Python, R, Text Processing, Classification Models, Cross-Validation }
• Built a prediction model for TripAdvisor reviews with three helpfulness levels;
• Run ten-fold cross-validation for various classification models to choose optimized parameters and reduce over-fitting;
• Chose logistic regression as the final model based on ROC curves, and analyzed patterns for helpful reviews based on the coefficients.Other creators -
Food Deliver Android Application
-
{ Java, Android, Servlet, MySQL }
• Built an online order application, recommended restaurants to customers by locations and customized filter settings;
• Designed ui/entities/remote layers to support order management and profile management for both kinds of users;
• Implemented the server side to handle HTTP requests and manage transaction data.Other creators -
Twitter Analytics Web Services
-
{ Java, Bash, MySQL, HBase, MapReduce, ETL, Undertow }
• Processed and loaded over 1TB raw Twitter data into both MySQL and HBase databases, and implemented server with Undertow;
• Designed and optimized the database schemas and settings to scale and improve throughput, applied both replication and sharding;
• Reached 10k QPS for different queries, and ranked the third out of 80 teams in the final live test.Other creators -
Airline On-Time Performance Analysis | Course: Data Mining
-
● Obtained airline, airplane and weather data, used greedy search for feature selection
● Applied machine learning techniques like Logistics Regression, SVM, KNN, NaïveBayes and Random Forest to build delay performance 2-class/3-class classifiers, evaluated the classifiers by comparing Accuracy metrics and AUC scores
● Analyzed possibility of personalized flight delay warning system, and recommended focus areas that would help shareholders to improve airline on-time performance
Other creators
Honors & Awards
-
Outstanding Graduate
Xiamen University
-
National Scholarship
Chinese Ministry of Education
-
Dean's List
Wang Yanan Institute for Studies in Economic, Xiamen University
Other similar profiles
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top contentOthers named Jiaqi Zhang in United States
153 others named Jiaqi Zhang in United States are on LinkedIn
See others named Jiaqi Zhang