Case embedding #27

aquemy · 2019-09-04T14:02:28Z

To go beyond naive BoW and TF-IDF, we should investigate case embedding.

It could be done at several level since we have the tree of documents per paragraph, per subsection, section and obviously the whole document.

We should also create two separate embedding, one for the descriptive features and one for the documents because the later are available after the judgement (by definition) such that it makes no sense to build a system to predict the outcome.

Not sure what method would be the most appropriate. Maybe starting by a word2vec or sent2vec?

aquemy added the enhancement Enchancement of existing features label Sep 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Case embedding #27

Case embedding #27

aquemy commented Sep 4, 2019

Case embedding #27

Case embedding #27

Comments

aquemy commented Sep 4, 2019