What changed:
- Fix some bug in source.
- Add word embedding support, test passed on 64 size word2vec.
- Add predict function, already tested on test set.
Description:
- model.py -> NLP model with encoder and decoder.
- preprocess.py -> preprocessing raw data. including word segment, stop words removing.
- data.py -> utility class used to load training / test data from batch.
- train.py -> train data.
- service.py -> websocket service.
- main.py -> entrypoint for training / test / configuration
Things todo:
- Replace LSTM with GRU.
- Add F1 score and confusion matrix for accuracy.