Skip to content
This repository was archived by the owner on Jul 21, 2022. It is now read-only.

Files

Latest commit

9c4ca41 · Sep 1, 2021

History

History
35 lines (24 loc) · 966 Bytes

README.md

File metadata and controls

35 lines (24 loc) · 966 Bytes

Memo_ru

Parser and data of lists.memo.ru website (database of victims of soviet repressions) This project is incomplete and under heavy development.

Contact - ibegtin (at) gmail.com

Setting Up

Install included dependencies

pip install pyparsing
pip install lxml
pip install pymongo

Script description

  • analyze.py - data anylisys functions using pyparsing
  • parse_memo.py - loader data into the mongo database

Folders description:

  • data - collection of JSON files from lists.memo.ru (needs to be downloaded and unpacked from - )
  • refined - extracted data using analyze.py
  • refined2 - temp data

Data:

Terms of use

Apache Licence 2.0