Published March 14, 2017
| Version v1
Dataset
Restricted
Word embeddings obtained from decow14ax - 300 dimensional
Description
Word vectors trained using GloVe for the most frequent 1000000 tokens in the decow14ax corpus. See associated paper (Dima, 2015) for description of the training parameters.
word embeddings - distributional representations - word vectors
GloVe is an unsupervised learning algorithm for obtaining vector representations for words. Training is performed on aggregated global word-word co-occurrence statistics from a corpus, and the resulting representations showcase interesting linear substructures of the word vector space.
Other (English)
Research carried out in work package A03 of the SFB 833.
Files
Additional details
Related works
- Is described by
- Text: https://aclweb.org/anthology/D/D15/D15-1188.pdf (URL)