This is the code and data used to produce the results from the EMNLP 2023 paper Addressing Linguistic Bias through a Contrastive Analysis of Academic Writing in the NLP Domain.
The results from the paper are produced with python 3.6. We suggest creating your own environment. Packages required are as follows:
– spacy
– claucy
– seaborn
– pandas
– nltk
– scipy
– matplotlib
To run the code in this project:
– git clone https://github.com/robert1ridley/linguisticBias.git
– cd linguisticBias
– python {analysis file}
(Replace analysis file
with any of the following: complete_cohesion_analyses.py
, complete_lexical_complexity_analyses.py
, complete_morphological_analyses.py
, complete_syntactic_analyses.py
)