Fixes and tests for Unit Arabic (Emirates) and Kurdish languages.
Pre-release
Pre-release
Donatello-za
released this
22 Mar 16:17
·
32 commits
to master
since this release
- This should fix special unicode symbols for Kurdish and Arabic (Emirates) being included with keyword and phrase results.
- Added LangParseOptions class with which to set custom language orientated punctuation symbols/characters. This will help remove unicode symbols from keywords and phrases.
- Punctuation symbols is automatically pulled in (if there is one available) for the specified language code.
- Added additional Kurdish and Arabic unit tests to test the punctuation.
- Made sure 100% unit test coverage is performed.