Gokul Venugopal

Creative and Business-minded Computer Engineer with 4+ years of experience in developing cutting-edge engineering solutions with a wide range of banking and technology features. Skilled in applying data science to extract valuable insights and engineering useful software features helping the businesses grow. In current role, identified several major bottlenecks and improved software performance by 30% for a Inc. 5000 fastest growing private company. _

Skills

  • Programming Languages
  • :
  • Python
  • Java
  • R
  • C/C++
  • CUDA
  • SQL
  • MATLAB
  • HTML
  • CSS
  • JS
  • Verilog
  • Arduino

  • Frameworks
  • :
  • Django
  • Spring
  • Hibernate
  • Bootstrap

  • Machine Learning
  • :
  • Sci-kit Learn
  • NLTK
  • Apache Spark
  • Numpy
  • Pandas
  • Scipy
  • Matplotlib

  • Deep Learning
  • :
  • LSTM
  • CNN
  • RNN
  • Tensorflow
  • ggplot2
  • Keras
  • OpenCV
  • Caret
  • GANs

  • Tools & Utilities
  • :
  • Git
  • SVN
  • Eclipse
  • RStudio
  • Tableau
  • VPN
  • PyCharm
  • Jupyter
  • MySQL
  • LEGO
  • Excel

  • IDE/Editors
  • :
  • PyCharm
  • IntelliJ
  • Vim
  • Visual Studio
  • Eclipse
  • RStudio

  • Software Engineering
  • Data Visualization
  • Natural Language Processing
  • Probability and Statistics

Education

University of Houston
Master of Science, Computer Systems Engineering
AUG 2018 - MAY 2020

Hewlett-Packard Enterprise Data Science Institute
Graduate Certificate in Engineering Data Science
AUG 2018 - MAY 2020

University of Kerala
Bachelor of Technology, Electronics and Communication Engineering
AUG 2011 - MAY 2015

Experience

Software Engineer

Instafuel Houston, TX* June 2020 - Present
  • Designed and implemented an alarm system that detects any anomalies in the daily operations and guides the respective team to take the necessary actions.
  • Configured AWS EC2 instance with CI/CD to host a python Django-based web application and databases.
  • Proposed the use of Elasticsearch instead of PostgreSQL and implemented it resulting in a 100% improvement in data retrieval and aggregation.
  • Devised a Python framework that generates reports by data analyzes and data processing to fit operational needs.

Teaching Assistant

College of Technology University of Houston Houston, TX Aug 2018 - May 2020
  • Administered academic activities Senior Design Lab, Senior Design Lecture and Digital Electronics Lab for four semesters.
  • Mentored 340+ students over the period of two years in their a year-long final project with 99% success rate.
  • Assisted Professor in developing a new course work with MATLAB and LEGO robots.
  • Managed the course website using Cascade CMS.

Data Analyst Intern

Enriched Data LLC Houston, TX May 2018 – Aug 2018
  • Conducted research to find obsolete property types resulting in the removal of 43 types and reclassification of records in 40 types out of 248 property types.
  • Increased handling capacity of the search algorithm to find comparable properties by 10 times and increased overall search efficiency by 45%.
  • Determined 200K outliers from a data set of 30 million records by conducting data analysis to find anomalies.
  • Developed a prototype that displays the nearby properties with proximity as low as 0.1 miles.

Senior Engineer

Mindtree Ltd. Bangalore, India OCT 2015 – JUL 2018
  • Delivered 30+ critical enhancements using Java (Hibernate & Spring Framework) on client's legacy application
  • Owned data migration for the account and upgraded existing SQL procedure to fetch millions of records with an improvement of 20% in time
  • Created Autosys batch jobs to maintain millions of records of client/user activity chronologically. This activity was able to optimise storage by 20%
  • Oversaw customer interaction and development activities for 9 applications.
  • Analysed customer feedback and history of incidents to suggest value-adds.
  • Played a significant role in implementing Agile Methodology in the account.
  • Proposed and developed a web portal for client using JSP & JS as front end, web services and Spring-Hibernate Java classes as backend, reducing the process time from 2 days to 5 minutes.

Portfolio

Guesstimate Accuracy Improvement

This project was done in association with O'Connor & Association. The objective was to build the best model to predict the estimated sale price of commercial properties. Implemntation involved extensive data cleaning and feature engineering followed by modelling. Out of the models tested (Linear Regression, SVR, CNN, Decision Tree, Random Forest, Ensemble Forest, SVR), Random Forest came out to be the best with 24% Coefficient of dispersion compared to 68% of the existing algorithm.

Hand Gesture Recognition using SVM with CUDA

Through this project, I designed a Linear SVM to classify hand language from MNIST data set in GPU and CPU. CUDA implementation of SVM was then compared with CPU implementation using NVIDIA profiler. GPU version took only 22 microseconds comparing to 1298 microseconds of CPU version.

Pollutant Forecasting using Time Series modelling with geographical model

The objective of this project was to detect the pollutant concentration (PM 2.5) and gaseous concentration of SO2, NO, CO2, NO2 for every hour. A geographical model was used to add a feature that described the effect of wind on a location based on the elevation. Although Linear Regression, RNN and Holtz-Winter model was tested, ARIMA model turned out to be best with 94.13% accuracy.

Bike Sharing Prediction Problem using R

The prediction problem based on a famous Kaggle competition dataset, Bike Sharing Data Set, was implemented using Random Forest in R with a R2-square of 0.8435. The model was fined tuned using grid search to obtain a R2-square value of 0.9586.

Certifications

Complete Guide to Elasticsearch

Udemy UC-92775c22-f127-4365-91c3-93649706fbe8 Dec 2020

Spark and Python for Big Data with PySpark

Udemy UC-79a84901-fab8-4b4a-af19-00d9f3aeddd2 Mar 2020

Data Science and Machine Learning Bootcamp with R

Udemy UC-653a8caa-0ad2-416f-80e4-79d62ed8a9e0 Feb 2020

Natural Language Processing

Udemy UC-NQLNQTUC Dec 2019

Tableau Training for Data Science

Udemy UC-Z0L5LETY July 2019

Python for Data Science Expert

Edureka FXR6EBRP Dec 2017

Restful Web Services Professional

Spring People May 2017

Advanced Spring Course

Skillspeed Dec 2016