Skip to content

Code velds encapsulating creation of statistical summary on conllu data.

License

Notifications You must be signed in to change notification settings

veldhub/veld_code__analyse_conllu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

veld chain veld_code__analyse_conllu

*work in progress*

This repo contains code velds encapsulating creation of statistical summary on conllu data. For now, the observed features are:

  • count of total tokens
  • count of total lemma
  • count of lemma normalized by token (to put the lemma in relation with token)
  • count of occurrence of each (Universal Dependencies) part of speech tag

The code can be adapted to other use cases but is primarily used in this chain veld: https://github.com/veldhub/veld_chain__eltec_udpipe_inference

requirements

  • git
  • docker compose (note: older docker compose versions require running docker-compose instead of docker compose)

how to use

A code veld may be integrated into a chain veld, or used directly by adapting the configuration within its yaml file and using the template folders provided in this repo. Open the respective veld yaml file for more information.

./veld.yaml : Launches a jupyter notebook providing statistical analysis

docker compose -f veld.yaml up

About

Code velds encapsulating creation of statistical summary on conllu data.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published