Skip to content
This repository has been archived by the owner on Apr 7, 2020. It is now read-only.

Latest commit

 

History

History
22 lines (18 loc) · 1.33 KB

File metadata and controls

22 lines (18 loc) · 1.33 KB

Hadoop Performance Comparison Project

This project is to compare performance between a Hadoop DFS and a Centralized File System. From the lab experiment results, we try to find the inflection point by which Hadoop will obtain better performance than single machine.

Purpose:

This project has already done before. There are even 305 downloads by people who are interested in Hadoop from 2/13/2017 to 10/12/2017. The reason should be the code that I attached on my paper so that people can directly use it, but it's picture code format. For more convenient use, I create this repo to attach my code here so that people can download code from here directly.
alt text

Paper topic:

Performance Comparison of a Hadoop DFS to a Centralized File System of a Single Machine

Link to the project paper:

http://repository.stcloudstate.edu/csit_etds/14/

Notes:

  1. About Hadoop sort code, this is the optimized one. If you need to check the version before optimizing, please check the paper.
  2. If anyone finds any difference between this code version and the paper code version, FOLLOW the paper version.
  3. Number of downloads is 331 (1/15/2018)!

  4. Number of downloads is 532 (4/6/2020)!