This repository has been archived by the owner on Aug 4, 2023. It is now read-only.
v1.5.2
github-actions
released this
28 Mar 18:04
·
23 commits
to refs/heads/main
since this release
New Features
- Log last query_params hit before AirflowTaskTimeout (#1058) @stacimc
- Update README.md with documentation reference (#1052) @itemrarity
- Add DAG for terminating long-running queries (#1050) @stacimc
Improvements
- Update Freesound to quarterly, extend timeout (#1068) @stacimc
- Update Flickr large batch handling (#1047) @stacimc
- Add SuggestedSubProvider type (#1040) @stacimc
- Add option to skip specific ingestion errors (#1011) @stacimc
- Add a DAG for backfilling license_url when meta_data is null (#1005) @obulat
- Improve license URL validation (#1028) @obulat
- Add flickr sub provider auditing dag (#1034) @stacimc
- Add Airflow variable used to configure overrides for task timeouts (#976) @stacimc
- Add logging to iNaturalist date check (#1035) @rwidom
- Update
Dockerfile
s with small improvements (#1016) @dhruvkb - Update Flickr to use new time delineated ingester class (#995) @stacimc
Internal Improvements
- Add isort configuration file (#1054) @raiyaj
- Update pgcli version to 3.5.0 (#1070) @AetherUnbound
- Bump apache-airflow[amazon,http,postgres] from 2.5.1 to 2.5.2 (#1064) @dependabot
- Bump pre-commit from 3.1.1 to 3.2.0 (#1065) @dependabot
- Add required stack label to dependabot PRs (#1063) @AetherUnbound
- Remove Implementation section from issue templates (#992) @miikkuu
- Bump pytest-socket from 0.5.1 to 0.6.0 (#1029) @dependabot
- Bump pre-commit from 3.0.2 to 3.1.1 (#1030) @dependabot
- Speed up some tests (#1021) @AetherUnbound
- Add an "Airflow Alert" issue template (#994) @AetherUnbound
- 🔄 synced file(s) with WordPress/openverse (#993) @openverse-bot
- Remove unnecessary dev dependencies (#990) @miikkuu
Bug Fixes
- Add required stack label to dependabot PRs (#1063) @AetherUnbound
- Handle the upper case licenses in the add_license_dag (#1049) @obulat
- Remove watermarked setting for SMK (#1048) @AetherUnbound
- Adjust schedule for long running queries termination (#1051) @obulat
- Use Python to group items by license to speed up the query (#1045) @obulat
- Remove alternate image extraction from SMK, fix foreign landing URL (#1003) @AetherUnbound
- Update
LICENSE
to match main repo (#1042) @dhruvkb - Tweak Flickr time division settings, add logs (#1041) @stacimc
- Add trailing slash to Jamendo thumbnail URLs (#1038) @AetherUnbound
- Adjust Flickr max records to account for incorrect reporting (#1031) @stacimc
- Temporarily turn off scheduled image data refreshes, increase matview refresh timeout (#1036) @stacimc
- Wikimedia: re-attempt large batches with reduced parameter selection (#1008) @AetherUnbound
- Increase image matview refresh timeout, remove retries, better timeouts (#1014) @AetherUnbound
- Terminate PG query when task is killed via Airflow (#717) @rwidom
- Ensure uniqueness of load table names (#1009) @stacimc
- Preserve trailing slashes for WordPress URLs (#1006) @AetherUnbound
- Replaced
execution_date
withlogical_date
(#1001) @sora-san45 - Remove API & Frontend repos from PR reminder check (#1010) @AetherUnbound
- Add dayshift to tsv filenames for reingestion workflows (#969) @stacimc
- Update Europeana endpoint (#974) @stacimc
Credits
Thanks to @AetherUnbound, @dependabot, @dependabot[bot], @dhruvkb, @itemrarity, @miikkuu, @obulat, @openverse-bot, @raiyaj, @rwidom, @sora-san45 and @stacimc for their contributions!