Skip to content

creationw/SparkMLADS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

63 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scalable Machine Learning with Spark and R on HDInsight

Instructors: Robert Horton, Mario Inchiosa, Ali Zaidi, Katherine Zhao

Requirements

  • An Azure subscription

Tutorial Cluster Deployment Instructions

  1. Go to https://github.com/Azure/SparkMLADS/tree/master/azure-templates

  2. Click the “Deploy to Azure” button

  3. Fill in the form and click “Purchase”. IMPORTANT: Set Cluster Login User Name = "admin" and Ssh User Name = "sshuser". Here is an example:

    Image of creating a new cluster

  4. Wait 30-40 minutes for the cluster to deploy

  5. We will run our R scripts using the RStudio IDE. To launch RStudio in your browser, from the cluster overview in the Azure portal, click "R Server dashboards" and then "R Studio server". At the first login screen, enter "admin" and the password you supplied. At the second login screen, enter "sshuser" and the password you supplied.

    Image of the cluster overview

  6. Once in RStudio, go to the Files pane in the lower right-hand corner and click on "SparkMLADS" and then "Code". Here you will find the directories for the hands-on tutorial scripts.

Contributing

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

About

MLADS Spark content

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published