Skip to content

Using Spark to analyze a dataset, public or self-collected, and drawing some insights from it

Notifications You must be signed in to change notification settings

janecww/spark-cloud

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data Analysis with Spark

This project is to analyze the MovieLens 1M Dataset from Grouplens and draw some insights from it using Apache Spark.

Data File

https://grouplens.org/datasets/movielens/1m/

Techniques Involved

  1. Exploratory Data Analysis & Visualization
  2. Correlation-based Recommendation System
  3. Alternating Least Square (ALS) Matrix Factorization using MLlib

About

Using Spark to analyze a dataset, public or self-collected, and drawing some insights from it

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published