Page 27 - REC :: M.E. CSE Curriculum and Syllabus - R2019
P. 27

CP19P08                          DATA SCIENCE                          Category   L  T  P  C
                                                                                           PE      3   0   0  3


               Objectives:
                ⚫   To get understand the basics of data science
                ⚫   To get thorough knowledge in analytical methods
                ⚫   To learn and understand data science packages of python
                ⚫   To learn and apply data science algorithms using R language
                ⚫   To study various visualization techniques


               UNIT-I     INTRODUCTION TO DATA SCIENCE                                                     9
               Big  Data  and  Data  Science  Hype-Characteristics  of  Big  Data-  Data  Science  Life  Cycle-Statistical  Methods-
               Probability-Sampling and Sampling Distributions-Statistical Inference-Prediction and Prediction Error-Resampling

               UNIT-II    ANALYTICAL THEORY AND METHODS                                                    9
               Linear  Regression-Simple  Linear  Regression-  Multiple  Linear  Regression-Logistic  Regression-Linear  Discriminant
               Analysis-Bayesian  Methods  -  Introduction  to  Clustering  Techniques  -K  means-  Gaussian  Mixture  Models  and
               Expectations – Maximization – agglomerative clustering – evaluation of clustering – Rand index – mutual information
               based scores – Fowlkes – Mallows index - Ensemble Techniques – Bagging & Boosting

               UNIT-III   DATA SCIENCE USING PYTHON                                                        9
               Data  science  packages-  Numpy  Basics-Pandas-Data  Loading-Data  Wrangling-Plotting  and  Visualization-Data
               Aggregation and Group Operations – Data Exploration – Visualization using python

               UNIT-IV    INTRODUCING R LANGUAGE                                                           9
               R  Basics-R  Objects-R  Notations-  Packages  –  Indexing  Data-  Loading  Data  -  Exploratory  Data  Analysis  using  R-
               Statistical  Methods  for  Evaluation  using  R  -  Data  Science  applications-Time  Series  Forecasting,  Text  Mining  &
               Sentiment Analysis

               UNIT-V     DATA VISUALIZATION                                                               9
               Data  Visualization:  Basic  Principles,  Categorical  and  Continuous  Variables  –  Exploratory  Graphical  Analysis  –
               Creating Static Graphs, Animated Visualizations – Loops, GIFs and Videos.

                                                                                   Total Contact Hours   :  45


               Course Outcomes:
               Upon completion of the course, students will be able to:
                ⚫   Understand the basics of data science
                ⚫   Acquire knowledge of various analytical methods
                ⚫   Able to work with data science packages of python
                ⚫   Able to apply data science algorithms using R language
                ⚫   Obtain knowledge of various visualization techniques


               Reference Books(s) :
                1   Cathy O'Neil and Rachel Schutt. “Doing Data Science, Straight Talk From The Frontline”, O'Reilly. 2014.
                2   Garrett Grolemund, “Hands on programming with R”,O’Reilly,2014
                    Gareth  James,  Daniela  Witten,  Trevor  Hastie,  Robert Tibshirani,  “An  introduction  to statistical  learning  with
                3
                    application in R”, Springer.
                4   Wes McKinney, “Python for Data Analysis”, O'Reilly Media, 2012
                5   Sebastian Raschka, “Python Machine Learning”,Packpub.com,2015
                6   Michael Berthold, David J. Hand, “Intelligent Data Analysis”, Springer, Second Edition, 2007

                7   Nathan Yau, “Data Points: Visualization That Means Something”, Wiley, 2013.
   22   23   24   25   26   27   28   29   30   31   32