Program Pipeline
The Data Science with Artificial Intelligence program is a hybrid program that is a mix of self-paced learning, mentor led LIVE group instruction and the career bootcamp
In your self-paced learning, pick up the basics of ML concepts. In the mentor led LIVE instruction, you can learn the problem solving skills from a real data scientist. The mentor led instruction is designed in such a way that irrespective of your progress on GLabs, you should be able to pick up the skills and get value. The more you progress on glabs, the more value you will get. Only pre-requisite - you must be comfortable with what is covered in the basecamp. In the career bootcamp we will build our profile in an accelerated pace.
FOR BEST RESULTS: Plan your study in such a way that you finish the projects on Glabs in 6 months and attend as many LIVE sessions as you can. Do atleast 1-2 hackathons within this time (start the hackathon from month 3). Have 1-2 awesome portfolio projects. Then you are ready to make a transition in your career.
Recommended Program Effort
Non-Programmers - Effort of 8-10 hours per week needed. Programmers - Effort of 5-7 hours per week needed.
Self-paced learning will proceed → Check it out here . As you keep going through the Glabs content in your own pace, attend the LIVE mentor session. The LIVE sessions will cover the following topics. Please note that the order might vary and topics might change based on feedback of industry and mentors. But this should give you a good idea of what to expect. For every session when scheduled in Glabs, we will have the pre-requisites defined in the session description. Try to go through the pre-requisites before the session.
Data Science Pipeline
A typical Data Science pipeline is as follows. In the Data Science Transition Program, you will learn how to execute the different parts of the pipeline.
Identify Business Problem
A problem or pain point for the business is presented to you. As a data scientist, you must be able to formulate a data science problem from the business problem that you would be ready to solve.
Data Collection
In this stage, based on the problem that you have defined, collect the data that is required for solving the problem.
Data Cleaning and EDA
The next stage is to prepare the data by processing the collected data to solve the data science problem. And a data scientist spends about 60-70% of his time at this stage.
Model Building & Evaluation
Then, we train a machine learning model on this data. All the ML algorithms - both supervised and unsupervised learning are used here. The output of this model is then used to figure out the right insights for the business and solve the problem. If the model is found to be unsuitable or not giving satisfactory results, then you go back, collect more data and rebuild the ML model.
Reporting
Finally you would have to cut through the technical jargon and convey the key insights to the business. This is an important step where you showcase the solution to the business problem and convey the recommended data-driven decisions to the stakeholder. If the required pain point is not solved, you go back to reframe the business problem.
The LIVE sessions on Saturdays are as follows :
Self Learning Sprints on Glabs
Career Bootcamp Expectations
The Data Science Pipeline is iterative and non-linear. Hence your journey is also non-linear. Think of it like a Christopher Nolan movie - stay through the initial few scenes and as you go towards the end, all the pieces fall in place. We hope you enjoy the transition journey with us.