Stanford University
Stanford University Logo

Mining Massive Data Sets by Stanford University 

  • Private University
  • Institute Icon8180 acre campus
  • Estd. 1885

Mining Massive Data Sets by Stanford University
 at 
Stanford University 
Overview

Data mining can lead to the development of new products, services, and solutions that address various challenges

Mode of learning

Online

Official Website

Go to Website External Link Icon

Course Level

UG Certificate

Mining Massive Data Sets by Stanford University
 at 
Stanford University 
Highlights

  • Earn a certificate from Stanford University
  • Learn from industry experts
Details Icon

Mining Massive Data Sets by Stanford University
 at 
Stanford University 
Course details

Who should do this course?
  • For individuals who want to enhance their knowledge & skills in the field
What are the course deliverables?
  • Apply data mining techniques and algorithms to solve real-world problems and case studies across various domains such as e-commerce, social media, healthcare, finance, and cybersecurit
  • Consider ethical and legal considerations in mining massive datasets, including issues related to privacy, data security, bias, fairness, and transparency, and explore strategies for responsible data usage and handling
  • Develop skills in communicating insights derived from large-scale datasets effectively through visualization techniques such as charts, graphs, and dashboards, and in presenting findings to diverse stakeholders in a clear and understandable manner
  • Understand graph mining algorithms and network analysis techniques for extracting insights from large-scale network datasets, including community detection, centrality measures, and link prediction
  • Explore techniques for mining unstructured data sources such as text documents and web content, including natural language processing (NLP) methods, sentiment analysis, topic modeling, and web crawling and scraping
Read more
More about this course
  • This courses introduces modern distributed file systems and MapReduce, including what distinguishes good MapReduce algorithms from good algorithms in general
  • The rest of the course is devoted to algorithms for extracting models and information from large datasets
  • Participants will learn how Google's PageRank algorithm models importance of Web pages and some of the many extensions that have been used for a variety of purposes
  • We'll cover locality-sensitive hashing, a bit of magic that allows you to find similar items in a set of items so large you cannot possibly compare each pair
  • When data is stored as a very large, sparse matrix, dimensionality reduction is often a good way to model the data, but standard approaches do not scale well; we'll talk about efficient approaches
Read more

Mining Massive Data Sets by Stanford University
 at 
Stanford University 
Curriculum

Week 1:

MapReduce

Link Analysis -- PageRank

Week 2:

Locality-Sensitive Hashing -- Basics + Applications

Distance Measures

Nearest Neighbors

Frequent Itemsets

Week 3:

Data Stream Mining

Analysis of Large Graphs

Week 4:

Recommender Systems

Dimensionality Reduction

Week 5:

Clustering

Computational Advertising

Week 6:

Support-Vector Machines

Decision Trees

MapReduce Algorithms

Week 7:

More About Link Analysis -- Topic-specific PageRank, Link Spam.

More About Locality-Sensitive Hashing

Mining Massive Data Sets by Stanford University
 at 
Stanford University 
Entry Requirements

Eligibility criteriaUp Arrow Icon
Conditional OfferUp Arrow Icon
  • Not mentioned

Other courses offered by Stanford University

– / –
    – / –
71.42 L
– / –
    – / –
36.64 L
– / –
    – / –
56.41 L
2 years
A++ Shiksha Grade
#1 QS
– / –
    – / –
71.42 L
View Other 213 CoursesRight Arrow Icon

Mining Massive Data Sets by Stanford University
 at 
Stanford University 
Contact Information

Address

450 Serra Mall, Stanford, CA 94305, USA

Stanford ( California)

Go to College Website ->