Pre-process arabic text (remove diacritics, punctuations, and repeating characters)
Usage: clean_arabic_text.py [-h] -i INFILE -o OUTFILE Pre-process arabic text (remove diacritics, punctuations, and repeating characters). optional arguments: -h, --help show this help message and exit -i INFILE, --infile INFILE input file. -o OUTFILE, --outfile OUTFILE out file. python clean_arabic_text.py -i infile.txt -o outfile.txt Your contributions to improve the code are welcomed. Please follow the steps below.
- Fork the project.
- Modify the code, test it, make sure that it works fine.
- Make a pull request.
Please consult github help to get help.