This notebook uses the SpaCy Chinese model to tokenize and analyze parts of speech for a short Buddhist text. Mistakes in the tags were corrected by hand to train an improved model for Pre-Modern Literary Chinese Texts. It was built with Pandas, SpaCy, and Xiaoquan Kong's Chinese SpaCy model. It implements a Part of Speech tagger and visualizations using diSplaCy.
-
Notifications
You must be signed in to change notification settings - Fork 0
This notebook explores using the SpaCy Chinese model to tokenize and analyze parts of speech for a short Buddhist text. Mistakes in the tags were corrected by hand to train an improved model for Pre-Modern Literary Chinese Texts. It was built with Pandas, SpaCy, and Xiaoquan Kong's Chinese SpaCy model. It implements a Part of Speech tagger and v…
kelseymour/Chinese-NLP_Buddhist-Texts
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
This notebook explores using the SpaCy Chinese model to tokenize and analyze parts of speech for a short Buddhist text. Mistakes in the tags were corrected by hand to train an improved model for Pre-Modern Literary Chinese Texts. It was built with Pandas, SpaCy, and Xiaoquan Kong's Chinese SpaCy model. It implements a Part of Speech tagger and v…
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published