Skip to content

Latest commit

 

History

History
39 lines (24 loc) · 1.1 KB

README.md

File metadata and controls

39 lines (24 loc) · 1.1 KB

SoFIFA Scraper

This is the custom SoFIFA scraper built for our project of Big Data Computing 2021-22 at Sapienza University of Rome.

By Daniele Solombrino and Davide Quaranta.

Requirements

Only Scrapy 2.6.1

Usage

cd src
scrapy crawl sofifa

To save the output as CSV:

scrapy crawl sofifa -o out.csv -t csv -a

To set a specific year to scrape:

scrapy crawl sofifa -o out.csv -t csv -a year=13

Extending the scraper with custom years

To scape other years than the currently supported ones, it is needed to:

  1. Find the "year key" that SoFIFA uses to identity a date.
  2. Add the key to the YEAR_KEYS dictionary in the utils file.

Finding the year key is simple: just go to the players page on SoFIFA and select an year/date, then note in the ?r=x value in the URL. For example, ?r=220019 refers to Dec 9 2021 (FIFA 22).

Note that across different FIFA versions, players' fields may be different, hence it may be needed to manually change the fields to scrape.