Page 46 - REC :: M.E. CSE Curriculum and Syllabus - R2019
P. 46

CP19213                   BIG DATA ANALYTICS LABORATORY                     Category   L  T  P  C
                                                                                              PE      0  0  4  2


               Objectives:
                ⚫   To implement Map Reduce concept to process big data.
                ⚫   To apply linear models to analyze big data.
                ⚫   To analyze big data using machine learning techniques.
                ⚫   To realize storage of big data using Hbase, MongoDB.
                ⚫   To develop big data applications for streaming data using Apache Spark.


                                                      List of  Experiments
                1   Configure and run Hadoop and HDFS.
                2   Implement word count/frequency programs using MapReduce.
                3   Implement a MapReduce program to process a weather dataset.
                4   Implement Linear and Logistic Regression.
                5   Implement SVM/Decision tree classification techniques.
                6   Implement clustering techniques – Hierarchical and K-Means.
                7   Implement an application that stores big data in Hbase/MongoDB/Pig using Hadoop/R.
                8   Install, Deploy & Configure Apache Spark Cluster.
                9   Run Apache Spark applications using Scala.
               10  Perform Data Visualization using any Online tools like Plot.ly, advanced Excel etc.,
               11  Install and execute Rapid Miner/Tableau and perform data analysis. ( use www.data.gov.in for real-time data)
                                                                                   Total Contact Hours   :   60

               PLATFORM NEEDED
                 Hardware     Systems with Virtualization Enabled, dual core processor with 8 GB RAM
                  Software     Hadoop, R package, Spark, Rapid Miner, Tableau.

               Course Outcomes:
               Upon completion of the course, students will be able to
                ⚫   Process big data using Hadoop framework.
                ⚫   Build and apply linear and logistic regression models.
                ⚫   Perform data analysis with machine learning methods.
                ⚫   Perform graphical data analysis.
                ⚫   Create applications for big data analytics.
   41   42   43   44   45   46   47   48