Skip to content

Latest commit

 

History

History
83 lines (49 loc) · 2.52 KB

README.md

File metadata and controls

83 lines (49 loc) · 2.52 KB

datacollect

A collection of tools to collect and download various data.

Often, I write simple scripts and tools to collect data for various "data science" tasks. I thought that it might be worthwhile to collect them in a central repository since they might be useful to others!

Contents


Important Note
Please note that I developed and tested these tools in Python 3.x, and it could be possible that the scripts do not work flawlessly in Python 2.7.x due to the more challenging unicode handling.



[back to top]

A command line tool to download song lyrics given artist names and song titles.



[back to top]

A command line tool that downloads your personal twitter timeline in CSV format with optional keyword filter.

Tutorial for turning your twitter timeline into a word cloud.



[back to top]

A command line tool to download popular tags for a list of songs from last.fm, e.g., for various data mining projects.



[back to top]

A command line tool that creates an info table from a list of PDB files.

[back to top]

A command line tool for downloading 3D structures of small chemical molecules from http://zinc.docking.org.



[back to top]

A command line tool to Collect Fantasy Soccer data from the Premier League.