Big Data - Capstone Project
- Offered byCoursera
Big Data - Capstone Project at Coursera Overview
Duration | 21 hours |
Start from | Start Now |
Total fee | Free |
Mode of learning | Online |
Official Website | Explore Free Course |
Credential | Certificate |
Big Data - Capstone Project at Coursera Highlights
- 21%
- started a new career after completing these courses.
- 27%
- got a tangible career benefit from this course.
- 13%
- got a pay increase or promotion.
- Earn a shareable certificate upon completion.
Big Data - Capstone Project at Coursera Course details
- Welcome to the Capstone Project for Big Data! In this culminating project, you will build a big data ecosystem using tools and methods form the earlier courses in this specialization. You will analyze a data set simulating big data generated from a large number of users who are playing our imaginary game "Catch the Pink Flamingo". During the five week Capstone Project, you will walk through the typical big data science steps for acquiring, exploring, preparing, analyzing, and reporting. In the first two weeks, we will introduce you to the data set and guide you through some exploratory analysis using tools such as Splunk and Open Office. Then we will move into more challenging big data problems requiring the more advanced tools you have learned including KNIME, Spark's MLLib and Gephi. Finally, during the fifth and final week, we will show you how to bring it all together to create engaging and compelling reports and slide presentations. As a result of our collaboration with Splunk, a software company focus on analyzing machine-generated big data, learners with the top projects will be eligible to present to Splunk and meet Splunk recruiters and engineering leadership.
Big Data - Capstone Project at Coursera Curriculum
Simulating Big Data for an Online Game
Welcome to the Big Data Capstone Project
Welcome from Splunk: Rob Reed World Education Evangelist
A Summary of Catch the Pink Flamingo
A Conceptual Schema for Catch the Pink Flamingo
Planning, Preparation, and Review
A Game by Eglence Inc. : Catch The Pink Flamingo
Overview of the Catch the Pink Flamingo Data Model
Overview of Final Project Design
Downloading the Game Data and Associated Scripts
Understanding the CSV Files Generated by the Scripts
Optional Review of Splunk
'Catch the Pink Flamingo-Data Exploration with Splunk
Aggregate Calculations Using Splunk
Filtering the Data With Splunk
Data Exploration With Splunk
Data Classification with KNIME
Review: Classification Using Decision Tree in KNIME
Review: Interpreting a Decision Tree in KNIME
Workflow Overview for Building a Decision Tree in KNIME
Description of combined_data.csv
Clustering with Spark
Informing business strategies based on client base
Practice with PySpark MLlib Clustering
Graph Analytics of Simulated Chat Data With Neo4j
Understanding the Simulated Chat Data Generated by the Scripts
Graph Analytics of Catch the Pink Flamingo Chat Data Using Neo4j
Reporting and Presenting Your Work
Week 5: Bringing It All Together
Final project preparation
Final Submission
Congratulations! Some Final Words...
Part 2: Help us connect your video to your LinkedIn profile