Data Scientist/Data Analyst using R/Tableau/Excel
Summary
•Currently pursuing interests in Data Science (R). Knowledgeable on variety of Data Science & Machine Learning algorithms both Supervised and Unsupervised Learning. •Keen to develop deeper expertise and gain exposure in the ML space.
•My current ML knowledge includes:
Linear (Simple & Multiple) and Logistic Regression,
Predictive Modelling
Support Vector Machine, Decision trees and Random Forest,
Naïve Bayes
K-Means Clustering, K Nearest Neighbors,
Text Mining,
Forecasting &
Time Series Analysis
•Performed data mining, data cleaning & explored data visualization techniques on a variety of data stored in spreadsheets and text files using R and plotting the same using ggplot2 function
•Sufficient exposure to designing and developing Tableau reports and dashboards for data visualization using R & Tableau
•Sufficient knowledge about the Natural Language Processing using R
•Seeking a career with a progressive organization which will utilize my data science / analytic skills using R Language or other development skills, abilities and education.
•A quick learner, eager to prove myself as a valuable member of a team and achieve the career heights I aim for.
•Exposure to Big Data Hadoop environment and its components HDFS, Apache Pig, Hive, Sqoop, MapReduce, HBase, Zoo Keeper, Flume
Education
2005 – B.E (Computer Engineering) from University of Mumbai
Experience
– Jul’16 to till date – Mindteck India Pvt Ltd placed at Dell Technologies (EMC), Bengaluru, India as Senior Software Engineer
Working on Legacy Modernization in testing batch & online components in the new environment and responsible for migration of data from mainframe to MS SQL Server
POCs/Case Studies for Analytics:
• Currently working on POCs provided by EMC to pull the data using Impala & Kudu using Cloudera Hadoop environment and visualizing the trends on Tableau using R Forecasting Models
• Working on Twitter Sentiment Analysis using Shiny & R
• Extensively worked on Twitter Sentiment Analysis using R packages & Tableau to map the followers & friends as per the geographic locations
• Extensively worked on Titanic, Twitter & Iris data to explore the various Regression & Predictive Models using R Packages & used tableau for visualization
• Extensively worked on data exploration, data cleaning methods using R
– Apr’11 – Apr’16 with Capgemini India Pvt Ltd, Mumbai, India & Detroit and Chicago, USA as Senior Consultant
Associated with Tenneco Automotive, Michigan, US to support their Cullinet Application spanning complete life cycle of orders from Order Management to Billing system and extensively worked on migrating their plants from mainframe
– Feb’06 – Apr’11 with Tata Consultancy Services, Mumbai & Guadalajara, Mexico as IT Analyst
Associated with GE Commercial Finance division of GE, Connecticut, US to work on various Data Conversion & Syndication projects
Skills
- Machine Learning
- Statistics (Hypothesis testing, confidence interval finding, ANOVA)
- R (dplyr, lm, glm, decision tree, n-fold cross validation, random forests, rpart, ggplot, shiny)
- ETL
- Machine Learning
- R
- Tableau
- Informatica
- Mainframe
- Oracle
- MS Excel
- PL/SQL
- MS SQL Server
Specialties
- Application development, Application Maintenance, data scientist, Machine Learning, Project Management, QA, SDLC, Team Management
Spoken Languages
- English (Fluent), Hindi, Marathi, Telugu