Skip to content

Latest commit

 

History

History
53 lines (42 loc) · 2.58 KB

README.md

File metadata and controls

53 lines (42 loc) · 2.58 KB

In and Out: Optimizing Overall Interaction in Probabilistic Graphs under Clustering Constraints

Overview

This project is developed as part of the following research paper:

D. Mandaglio, A. Tagarelli, F. Gullo (2020). In and Out: Optimizing Overall Interaction in Probabilistic Graphs under Clustering Constraints. In Procs. of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, August 23rd - August 27th, 2020, CA, USA. DOI: https://doi.org/10.1145/3394486.3403190

Please cite the above paper in any research publication you may produce using this code or data/analysis derived from it.

Folders

  • datasets: contains the original data as well as the preprocessed data (as described in the paper). Biggest networks (to be unzipped in datasets folder) can be downloaded at the following link
  • code: it contains this project code and links to the competing methods
  • output: it stores all results produced by the algorithms

Usage

From the folder 'optimize_interactions/code', run the following command:

run_offline_algorithm.py [-h] -d DATASET -a {MIL,D-MIL} [-s SEED]
                                [-r RUNS] [--hill_climbing [HILL_CLIMBING]]
                                [-I HILL_CLIMBING_ITERS]
                                [--save_only_avg [SAVE_ONLY_AVG]]

Dependencies

  • numpy
  • python-igraph

Positional arguments

  -d DATASET, --dataset DATASET
                        Input dataset, whose name identifies a particular subfolder in 'datasets/'
  -a {MIL,D-MIL}, --alg {MIL,D-MIL}
                        Selected algorithm 

Optional arguments

  -s SEED, --seed SEED  Random generation seed -- for reproducibility (default value 100)
  -r RUNS, --runs RUNS  Number of runs of the selected randomized algorithm (default value 50) 
  --hill_climbing [HILL_CLIMBING]
                        It enables the hill climbing postprocessing step
  -I HILL_CLIMBING_ITERS, --hill_climbing_iters HILL_CLIMBING_ITERS
                        Maximum number of iterations for the hill climbing
                        algorithm (default value 8)
  --save_only_avg [SAVE_ONLY_AVG]
                        if True (default value), only average results across several runs are
                        stored, otherwise the results for each run is saved in a specific sub-
                        folder in 'output/'
                    

Default values for -s, -r and -I correspond to the settings relating to the results presented in the paper.