Sentence embedding


Sentence embedding is the collective name for a set of techniques in natural language processing where sentences are mapped to vectors of real numbers

Application

Sentence embedding is used by the deep learning software libraries PyTorch and TensorFlow

Evaluation

A way of testing sentence encodings is to apply them on Sentences Involving Compositional Knowledge corpus
for both entailment and relatedness.
In the best results are obtained using a BiLSTM network trained on the . The Pearson correlation coefficient for SICK-R is 0.885 and the result for SICK-E is 86.3. A slight improvement over previous scores is presented in : SICK-R: 0.888 and SICK-E: 87.8 using a concatenation of bidirectional Gated recurrent unit.