Big Data Modeling and Management Systems
- Offered byCoursera
Big Data Modeling and Management Systems at Coursera Overview
Duration | 13 hours |
Start from | Start Now |
Total fee | Free |
Mode of learning | Online |
Official Website | Explore Free Course |
Credential | Certificate |
Big Data Modeling and Management Systems at Coursera Highlights
- Shareable Certificate Earn a Certificate upon completion
- 100% online Start instantly and learn at your own schedule.
- Course 2 of 6 in the Big Data Specialization
- Flexible deadlines Reset deadlines in accordance to your schedule.
- Approx. 13 hours to complete
- English Subtitles: Arabic, French, Portuguese (European), Italian, Vietnamese, Korean, German, Russian, Turkish, English, Spanish
Big Data Modeling and Management Systems at Coursera Course details
- Once you?ve identified a big data issue to analyze, how do you collect, store and organize your data using Big Data solutions? In this course, you will experience various data genres and management tools appropriate for each. You will be able to describe the reasons behind the evolving plethora of new big data platforms from the perspective of big data management systems and analytical tools. Through guided hands-on tutorials, you will become familiar with techniques using real-time and semi-structured data examples. Systems and tools discussed include: AsterixDB, HP Vertica, Impala, Neo4j, Redis, SparkSQL. This course provides techniques to extract value from existing untapped data sources and discovering new data sources.
- At the end of this course, you will be able to:
- * Recognize different data elements in your own work and in everyday life problems
- * Explain why your team needs to design a Big Data Infrastructure Plan and Information System Design
- * Identify the frequent data operations required for various types of data
- * Select a data model to suit the characteristics of your data
- * Apply techniques to handle streaming data
- * Differentiate between a traditional Database Management System and a Big Data Management System
- * Appreciate why there are so many data management systems
- * Design a big data information system for an online game company
- This course is for those new to data science. Completion of Intro to Big Data is recommended. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments. Refer to the specialization technical requirements for complete hardware and software specifications.
- Hardware Requirements:
- (A) Quad Core Processor (VT-x or AMD-V support recommended), 64-bit; (B) 8 GB RAM; (C) 20 GB disk free. How to find your hardware information: (Windows): Open System by clicking the Start button, right-clicking Computer, and then clicking Properties; (Mac): Open Overview by clicking on the Apple menu and clicking ?About This Mac.? Most computers with 8 GB RAM purchased in the last 3 years will meet the minimum requirements.You will need a high speed internet connection because you will be downloading files up to 4 Gb in size.
- Software Requirements:
- This course relies on several open-source software tools, including Apache Hadoop. All required software can be downloaded and installed free of charge (except for data charges from your internet provider). Software requirements include: Windows 7+, Mac OS X 10.10+, Ubuntu 14.04+ or CentOS 6+ VirtualBox 5+.
Big Data Modeling and Management Systems at Coursera Curriculum
Introduction to Big Data Modeling and Management
Welcome to Big Data Modeling and Management
Why is this a New Course in the Big Data Specialization?
Summary of Introduction to Big Data (Part 1)
Summary of Introduction to Big Data (Part 2)
Summary of Introduction to Big Data (Part 3)
Big Data Management "Must-Ask Questions"
Data Ingestion
Data Storage
Data Quality
Data Operations
Data Scalability and Security
Energy Data Management Challenges at ConEd
Gaming Industry Data Management: Q&A with Apmetrix CTO Mark Caldwell
Flight Data Management at FlightStats: A Lecture by CTO Chad Berkley
Slides: Summary of Introduction to Big Data
Slides: Big Data Management
Reading on Storage Systems
Slides: Energy Data Management Challenges at ConEd
Slides: Flight Data Management at FlightStats
Downloading and Installing the Cloudera VM Instructions (Windows)
Downloading and Installing the Cloudera VM Instructions (Mac)
Instructions for Downloading Hands On Datasets
Big Data Modeling
Introduction to Data Models
Data Model Structures
Data Model Operations
Data Model Constraints
Introduction to CSV Data
What is a Relational Data Model?
What is a Semistructured Data Model?
Exploring the Relational Data Model of CSV Files
Exploring the Semistructured Data Model of JSON data
Exploring the Array Data Model of an Image
Exploring Sensor Data
Slides: What Is A Data Model?
Introduction to CSV Data
Slides: What Is A Relational Data Model?
Slides: What is a Semistructured Data Model?
Exploring the Relational Data Model of Comma Separated Values (CSV)
Exploring the Semistructured Data Model of JSON data
Exploring the Array Data Model of an Image
Exploring Sensor Data
Practical Quiz for Week 2 Hands-On Lectures
Big Data Modeling (Part 2)
Vector Space Model
Graph Data Model
Other Data Models
Exploring the Lucene Search Engine's Vector Data Model
Exploring Graph Data Models with Gephi
Slides: Vector Space Model
Slides: Graph Data Model
Slides: Other Data Models
Exploring Vector Data Models with Lucene
Exploring Graph Data Models with Gephi
Data Models Quiz
Working With Data Models
Data Model vs. Data Format
What is a Data Stream?
Why is Streaming Data different?
Understanding Data Lakes
Exploring Streaming Sensor Data
Exploring Streaming Twitter Data (Optional)
Slides: Data Model vs. Data Format
Slides: What is a Data Stream?
Slides: Why is Streaming Data Different?
Slides: Understanding Data Lakes
Exploring Streaming Sensor Data
Instructions for Creating a Twitter App (Optional)
Exploring Streaming Twitter Data (Optional)
Data Formats and Streaming Data Quiz
Big Data Management: The "M" in DBMS
DBMS-based and non-DBMS-based Approaches to Big Data
From DBMS to BDMS
Redis: An Enhanced Key-Value Store
Aerospike: a New Generation KV Store
Semistructured Data ? AsterixDB
Solr: Managing Text
Relational Data ? Vertica
Slides: DBMS-based and non-DBMS-based Approaches to Big Data
Slides: From DBMS to BDMS
BDMS Quiz
Designing a Big Data Management System for an Online Game
A Game by Eglence Inc. : Catch The Pink Flamingo