Skip to content

Latest commit

 

History

History
39 lines (30 loc) · 1.05 KB

README.md

File metadata and controls

39 lines (30 loc) · 1.05 KB

ViCorrection: Vietnamese Spelling Correction

Overview

A project to correct spelling errors in Vietnamese texts using Seq2Seq and Seq2Seq-Attention models at character-level

More information can read from training.ipynb or testing.ipynb

Setup

pip install -r requirements.txt

Create dataset

Change .csv path and save_path in dataset/create_dataset.py and run:

cd dataset
python create_dataset.py

Model (character-level)

List of neural models in the repo:

Training

Edit parameters in params.py file and training:

python training.py

Performances

Spell
Checker
CER (%) Full Sequence Acc (%)
Seq2Seq 1.34 82.3
Seq2Seq-Attention 1.12 85.7