Eduonix
Eduonix Logo

Data Science:Data Mining & Natural Language Processing in R 

  • Offered byEduonix

Data Science:Data Mining & Natural Language Processing in R
 at 
Eduonix 
Overview

Duration

13 hours

Mode of learning

Online

Schedule type

Self paced

Difficulty level

Beginner

Credential

Certificate

Data Science:Data Mining & Natural Language Processing in R
 at 
Eduonix 
Highlights

  • Start instantly and learn at your own schedule.
  • Lifetime Access. No Limits!
  • Learn online from any location with practical examples
  • Learn visualization, stats, machine learning, data mining, and neural networks the easiest way
Read more
Details Icon

Data Science:Data Mining & Natural Language Processing in R
 at 
Eduonix 
Course details

What are the course deliverables?
  • Use machine learning in R for data/text mining and natural language processing
  • Learn clustering, classification and regression in R Pull out insights from text data and Twitter Conveniently use packages like caret, dplyr to work with real data in R
  • Learn to use common NLP packages for extracting insights from text data
More about this course
  • This course will help you implement the methods using real data obtained from different sources.After taking this course, you will easily use packages like caret, dplyr to work with real data in R. You will also learn to use the common NLP packages to extract insights from text data.

Data Science:Data Mining & Natural Language Processing in R
 at 
Eduonix 
Curriculum

Section 1 : INTRODUCTION TO THE COURSE: The Key Concepts and Software Tools

Introduction

Data and Scripts For the Course

Introduction to R and RStudio

Start with Rattle

Conclusion to Section 1

Section 2 : Reading in Data from Different Sources in R

Read in Data from CSV and Excel Files

Read Data from a Database

Read Data from JSON

Read in Data from Online CSVs

Read in Data from Online HTML Tables-Part 1

Read in Data from Online HTML Tables-Part 2

Read Data from Other Sources

Conclusions to Section 2

Section 3 : Exploratory Data Analysis and Data Visualization in R

Remove NAs

More Data Cleaning

Exploratory Data Analysis(EDA): Basic Visualizations with R

More Exploratory Data Analysis with xda

Introduction to dplyr for Data Summarizing-Part 1

Introduction to dplyr for Data Summarizing-Part 2

Data Exploration & Visualization With dplyr & ggplot2

Pre-Processing Dates-Part 1

Pre-Processing Dates-Part 2

Plotting Temporal Data in R

Twist in the (Temporal) Data

Associations Between Quantitative Variables- Theory

Testing for Correlation

Evaluate the Relation Between Nominal Variables

Cramer's V for Examining the Strength of Association Between Nominal Variable

Section 4 : Data Mining for Patterns and Relationships

What is Data Mining?

Association Mining with Apriori

Apriori with Real Data

Visualize the Rules

Association Mining with Eclat

Eclat with Real Data

Section 5 : Machine Learning for Data Science

How is Machine Learning Different from Statistical Data Analysis?

What is Machine Learning (ML) About? Some Theoretical Pointers

Section 6 : Unsupervised Classification- R

K-means Clustering

Fuzzy K-Means Clustering

Weighted K-Means Clustering

Hierarchical Clustering in R

Expectation-Maximization (EM) in R

Use Rattle for Unsupervised Clustering

Conclusions to Section 6

Section 7 : Dimension Reduction

Dimensionality Reduction-theory

PCA

Removing Highly Correlated Predictor Variables

Variable Selection Using LASSO Regression

Variable Selection With FSelector

Boruta Analysis for Feature Selection

Conclusions to Section 7

Section 8 : Supervised Learning Theory

Some Basic Supervised Learning Concepts

Pre-processing for Supervised Learning

Section 9 : Supervised Learning: Classification

What are GLMs?

Logistic Regression Models as Binary Classifiers

Linear Discriminant Analysis (LDA)

Binary Classifier with PCA

Our Multi-class Classification Problem

Classification Trees

More on Classification Tree Visualization

Decision Trees

Random Forest (RF) classification

Examine Individual Variable Importance for Random Forests

GBM Classification

Support Vector Machines (SVM) for Classification

More SVM for Classification

Conclusions to Section 9

Section 10 : Supervised Learning: Regression

Ridge Regression in R

LASSO Regression in R

Generalized Additive Models (GAMs) in R

Boosted GAMs

MARS Regression

CART-Regression Trees in R

Random Forest (RF) Regression

GBM Regression

Compare Models

Conclusions to Section 10

Section 11 : Introduction to Artificial Neural Networks (ANN)

What are Artificial Neural Networks?

Neural Network for Binary Classifications

Neural Network with PCA for Binary Classifications

Neural Network for Regression

More on Neural Networks- with neuralnet

Identify Variable Importance in Neural Networks

Section 12 : More Web-scraping and Text Data Mining

Read in Text Data from an HTML Page

Explore Amazon with R

More Webscraping With rvest-IMDB Webpage

Prior to Mining Data from Twitter

Extract Tweets Using R

More Twitter Data Extraction Using R

Get Data from Facebook Using R

Conclusions to Section 12

Section 13 : Gaining Insights from Text Data- Text Mining and Natural Language Processing (NL

Explore Tweet Data

Visualize Tweet Sentiment Wordcloud- India's Demonetization Policy

More Wordclouds: Amazon Review Data

Word Frequency in Text Data

Tweet Sentiments- India's Demonetization Policy

Tweet Sentiments- Mugabe's Ouster

Examine the Polarity of Text

Polarity of Individual Tweets

Topic Modelling a Document

Topic Modelling Multiple Documents

Conclusions to Section 13

Section 14 : Text Data and Machine Learning

EDA With Text Data

Identify Deceptive Reviews With Supervised Classification

Identify Spam Emails with Supervised Classification

Other courses offered by Eduonix

199
10 hours
Intermediate
– / –
4 hours
Intermediate
– / –
4 hours
Intermediate
View Other 119 CoursesRight Arrow Icon

Data Science:Data Mining & Natural Language Processing in R
 at 
Eduonix 
Students Ratings & Reviews

4.5/5
Verified Icon2 Ratings
M
Md Sohail
Data Science:Data Mining & Natural Language Processing in R
Offered by Eduonix
4
Learning Experience: The course is pretty tough to understand but if you have the capability of understanding very well you can do it
Faculty: The faculty is super cool he explains me again and again with a lot of patience and understanding The thing i like about is machine learning which is easy to understood the concept it is quite interesting
Course Support: Because it is a top paying IT job
Reviewed on 23 Dec 2022Read More
Thumbs Up IconThumbs Down Icon
View 1 ReviewRight Arrow Icon
qna

Data Science:Data Mining & Natural Language Processing in R
 at 
Eduonix 

Student Forum

chatAnything you would want to ask experts?
Write here...