Multi-language code character detection tool

This is a Python tool for detecting invalid characters in code files. It supports detecting English, Chinese, Japanese, Korean and Russian characters and can help you find special or invisible characters in your code that may cause problems, and you can use it to detect invalid characters in your code.

Features

Supports multiple programming language files (default: .py, .java, .c, .cpp, .js, .html, .css, .txt)
Support for detecting characters in multiple languages:
- English letters and numbers
- Chinese (CJK Unified Kanji)
- Japanese (Hiragana and Katakana)
- Korean (Hiragana and Katakana)
- Russian (Cyrillic alphabet)
- Vietnamese (Latin alphabet)
Recursive checking of the entire project catalog
Precisely locate the row and column numbers of invalid characters
Displays the Unicode encoding value of invalid characters
Command line parameter support
Detailed error output

Usage

Basic usage:

python invalid_char_checker.py /path/to/your/project

Specify the file type:

python invalid_char_checker.py /path/to/your/project -e .py, .java, .c, .cpp, .js, .html, .css, .txt

Supported characters

ASCII characters:
- English letters (a-z, A-Z)
- Numbers (0-9)
- Common punctuation and operators
- Blank characters (spaces, tabs, line breaks, etc.)
Unicode characters:
- Chinese characters (CJK Unified Kanji)
- Japanese hiragana and katakana
- Korean characters
- Russian characters (Cyrillic alphabet)

Caution

Files must be encoded in UTF-8
If an encoding error is encountered, the program will display an appropriate error message
It is recommended that you test the program on a small scale before working on a large project.

Error handling

If the specified directory does not exist, the program will display an error message and exit.
If the file is not UTF-8 encoded, the program will display an encoding error message.
If the file is not UTF-8 encoded, an encoding error will be displayed. Other errors while processing the file will be caught and detailed information will be displayed.

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
LICENSE		LICENSE
README.md		README.md
README_ja.md		README_ja.md
README_ko.md		README_ko.md
README_ru.md		README_ru.md
README_vi.md		README_vi.md
README_zh.md		README_zh.md
README_zh_TW.md		README_zh_TW.md
invalid_char_checker.py		invalid_char_checker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-language code character detection tool

Features

Usage

Supported characters

Caution

Error handling

License

About

Releases

Packages

Contributors 2

Languages

License

oslook/char-checker

Folders and files

Latest commit

History

Repository files navigation

Multi-language code character detection tool

Features

Usage

Supported characters

Caution

Error handling

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages