Providing Technology Training and Mentoring For Modern Technology Adoption
The main purpose of the course is to give students the ability to use Microsoft R Server to create and run an analysis on a large dataset, and show how to utilize it in Big Data environments, such as a Hadoop or Spark cluster, or a SQL Server database.
After completing this course, students will be able to:
In addition to their professional experience, students who attend this course should have:
Module 1: Microsoft R Server and R ClientExplain how Microsoft R Server and Microsoft R Client work.Lessons
Module 2: Exploring Big Data
At the end of this module the student will be able to use R Client with R Server to explore big data held in different data stores.
Module 3: Visualizing Big DataExplain how to visualize data by using graphs and plots.Lessons
Module 4: Processing Big DataExplain how to transform and clean big data sets.Lessons
Module 5: Parallelizing Analysis OperationsExplain how to implement options for splitting analysis jobs into parallel tasks.Lessons
Module 6: Creating and Evaluating Regression ModelsExplain how to build and evaluate regression models generated from big dataLessons
Module 7: Creating and Evaluating Partitioning ModelsExplain how to create and score partitioning models generated from big data.Lessons
Module 8: Processing Big Data in SQL Server and HadoopExplain how to transform and clean big data sets.Lessons