- Put english corpus (train.txt, dev.txt, test.txt and long_test.txt[optional]) into the data/ .
- Edit config.json .
- cd src/ and run the following command to preprocess and generate training data .
python make_dataset.py ../data/ - Run the following command to prepapre model folder. Feel free to open the config.json to tune some hyperparameters.
mkdir ../models/your_model_folder cp ../models/seq2seq/config.json ../models/your_model_folder - Run the following command to start training.
python train.py ../models/your_model_folder/ - Trained model will saved in the folder 'your_model_folder/'