Skip to content

Latest commit

 

History

History
8 lines (7 loc) · 507 Bytes

README.md

File metadata and controls

8 lines (7 loc) · 507 Bytes

This is the code for the canonical segmenter presented in: Katharina Kann, Ryan Cotterell and Hinrich Schütze. Neural Morphological Analysis: Encoding-Decoding Canonical Segments. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, USA, November 2016.

Usage:

  • Train the encoder-decoder on the segmentation data.
  • Test several times to sample from the defined distribution.
  • Store the samples in a pickle file.
  • Run the reranker using the provided script.