Releases: Living-with-machines/alto2txt
Releases · Living-with-machines/alto2txt
v0.3.4
alto2txt: Extract plain text from newspapers
Converts XML
(in METS 1.8
/ALTO 1.4
, METS 1.3
/ALTO 1.4
, BLN
or UKP
format) publications to plaintext articles and generates minimal metadata.
Full documentation and demo instructions.
Added
- Added
PyPI
version andMIT
license badges toREADME.md
- Added
pytest-cov
with default options to assess documentation - Added
isort
to.pre-commit-config.yaml
to sort import consistency - Added
pycln
to.pre-commit-config.yaml
to check unused imports - Added
pycln
configuration topyproject.toml
- Added
alto2txt
as a command line script inpyproject.toml
Changed
- Switch from
Apache v2.0
license toMIT
license, inline with project recommendations. - Updated
mypy
in.pre-commit-config.yaml
Deprecated
- Replace
extract_publications_text.py
with thealto2txt
command line interface
script specified inpyproject.toml
Removed
setup.py
requirements.txt
Fixed
- Fixed
python = ">3.6.0"
inpyproject.toml
rather than>3.7
for consistency with documentation - Fixed licensing ambiguity (now all should be
MIT
) - Fixed typos in
README.md
- Fixed surperflous imports via
pycln
inpre-commit