Skip to content

Commit

Permalink
updated READMEs
Browse files Browse the repository at this point in the history
  • Loading branch information
malitsky authored and malitsky committed Feb 12, 2018
1 parent 8a0ebcb commit 9172a67
Show file tree
Hide file tree
Showing 2 changed files with 29 additions and 4 deletions.
25 changes: 24 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1 +1,24 @@
# spark-mpi-tf
# SPARK-MPI-TF

This project demostrates the Spark-MPI approach within the context of Spark-based TensorFlow distributed deep learning
applications. The direction is addressed by several other projects, such as
[BigDL](https://github.com/intel-analytics/BigDL) and
[TensorFlowOnSpark](https://github.com/yahoo/TensorFlowOnSpark). In comparison with these alternative
solutions, Spark-MPI aims to derive an application-neutral mechanism based on the MPI Process Management Interface (PMI)
for the effortless integration of Big Data and HPC ecosystems.

## Prerequisites

1. [Spark-MPI](https://github.com/SciDriver/spark-mpi): PMI-based approach for integrating together the Spark platform and MPI applications

2. [Horovod](https://github.com/uber/horovod): MPI-based training framework for TensorFlow

## Examples

The MNIST Spark-Horovod [IPython notebook](https://github.com/SciDriver/spark-mpi-tf/blob/master/examples/mnist/spark_horovod.ipynb) for handwritten digit classification (see, for reference, [TensorFlow Tutorial](https://www.tensorflow.org/tutorials/layers)).






8 changes: 5 additions & 3 deletions examples/mnist/README.txt
Original file line number Diff line number Diff line change
@@ -1,6 +1,9 @@
This Spark-MPI example demonstrates the integration of Horovod's MPI-based deep learning engine
with the Spark platform within the context of the MNIST application.

Running Spark-MPI with Jupyter
-------------------------------

Running with Jupyter on a single node
-------------------------------------

export HYDRA_PROXY_PORT=55555

Expand All @@ -9,4 +12,3 @@ export PYSPARK_DRIVER_PYTHON_OPTS='notebook --no-browser --port=7777'

pyspark --master local[*]

pkill -9 "hydra_pmi_proxy"

0 comments on commit 9172a67

Please sign in to comment.