Skip to content

Takes a text and parses it into sentences. Then uses nltk to determine the most common phonemes in the text, and chooses sentences that are both short and contain high numbers of common phonemes.

Notifications You must be signed in to change notification settings

hudsonmiears/Automatic-Text-Selector

Repository files navigation

Automatic-Text-Selector

Reads a text and selects sentences from it in order to create a body of text that is both naturalistic and has a roughly equal distributions of English diphones.

The main script is in cleanup.py.

About

Takes a text and parses it into sentences. Then uses nltk to determine the most common phonemes in the text, and chooses sentences that are both short and contain high numbers of common phonemes.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages