Automobile Data exploration
In this session we are doing some basic analysis of automobile data which the learners can further expand on. This analysis can be used to find patterns and resolve quality issues either in the nick of time or prevent them from happening altogether.
Data Cleaning and EDA
IPL data analysis
In this session, we perform analysis to decide which factors led one of the teams winning. This would help plan out strategy for the future sessions.
Data Cleaning and EDA
Big mart regression
The data scientists at BigMart have collected 2013 sales data for 1559 products across 10 stores in different cities. The aim is to build a predictive model and find out the sales of each product at a particular store.
Model Building, Model Evaluation
Election data visualization
The Lok Sabha election is a very complex affair as it involves a lot of factors. There are more than 700 million voters with more than 800,000 polling stations. It is this very fact that makes it a perfect topic to analyze.
Data Cleaning and EDA
ML studio Azure
Azure ML Studio provides you an interactive, visual workspace where your drag and drop data sets and analysis are converted to an interactive canvas. This session will give you an idea on building a low code Machine Learning Solution
Model Building, Model Evaluation
Visualizing cricket performance
We want to know as to what happens during an IPL match which raises several questions in our mind. This analysis is done to know which factors led one of the teams to win and how does it matter.
Data Cleaning and EDA
Feature selection with Breast cancer data
Breast cancer (BC) is one of the most common cancers among women worldwide, representing the majority of new cancer cases. This analysis aims to observe which features are most helpful in predicting malignant or benign cancer and to see general trends that may aid us in model selection and hyperparameter selection.
Model Building, Model Evaluation
Ted Data Analysis
Since the time we have begun watching TED Talks, they have never ceased to amaze us.we are attempting to find insights about the world of TED, its speakers and its viewers and try to answer a few questions.
Data Cleaning and EDA
Low Code Machine Learning Solution
Creating a low code solution using useful libraries like Sweetviz, pycare and pandas profiling
Model Building, Model Evaluation
Intro to Plotly
In this session we will understand how to go through a new library with Plotly. Plotly is a library used to create interactive visualizations.
Data Cleaning and EDA
Startup Data Analysis
There are a lot of innovative startups coming up in the region and a lot of funding for these startups as well. We will analyze the startup eco system in this session
Data Cleaning and EDA
Plotly Dash
Dash is an open-source Python framework used for building analytical web applications. It is a powerful library that simplifies the development of data-driven applications. This session will help us create an interactive web app using Dash.
Reporting
A/B testing
A/B testing is an essential element while developing product in an organization. In this session we will be understanding A/B testing and its impact on business.
Model Building, Model Evaluation
Salary Prediction
With the data given of the income of various individuals, we will study the data and predict in which salary bracket do they come under
Model Building, Model Evaluation
Daily cleaning with exit survey
Are employees who only worked for the institutes for a short period of time resigning due to some kind of dissatisfaction? What about employees who have been there longer? We will try to answer these questions in this session.
Data Cleaning and EDA
Time series - FB Prophet
In this session we will get started with using time series data in a hands on manner. We will understand various ways to use and gain insights from a time series data
Model Building, Model Evaluation
Cleaning Tata with Regex
Natural Language Processing is widely being applied in the world for various purposes at the moment. To make it work the input data has to be created in a specific manner, it requires a lot of data cleaning. In this session we will understand the data cleaning required for Natural Language Processing
Data Cleaning and EDA
Data collection - web scraping
Data extraction from web(Ranging from manual copy paste to complex automations)Websites come in different formats resulting in need of different web scrapers. The core process remains the same though.
Data Collection
Customer Marketing Strategy with Clustering
Segmentation in marketing is a technique used to divide customers or other entities into groups based on attributes such as behaviour or demographics. Here we will be using Credit card data to segment the customers
Model Building, Model Evaluation
Deployment with Flask/Heroku
We just dont create a ML model but there is a need to deploy it as well. In this session we will be creating an interactive web app using flask and deploying a ML model using Heroku
Reporting
Chennai water Management analysis
We will be analyzing the various water bodies of Chennai and try to solve the water crisis the city is facing and suggestions to avoid the same in the future
Data Cleaning and EDA
Credit Delinquency Prediction
Delinquency describes something or someone who fails to accomplish that which is required by law, duty, or contractual agreement, such as the failure to make a required payment or perform a particular action.This use-case requires learners to improve on the state of the art in credit scoring, by predicting the probability that somebody will experience financial delinquency in the next two years
Model Building, Model Evaluation
Medical Insurance code along
The cost of treatment depends on many factors: diagnosis, type of clinic, city of residence, age and so on. We have no data on the diagnosis of patients. But we have other information that can help us to make a conclusion about the health of patients and practice regression analysis.
Model Building, Model Evaluation
Basics of NLP with Twitter Data
In this session we will understand the basics of how to deal with text data to be used for various applications of NLP.
Model Building, Model Evaluation