Early Slavic language models

Nilo Pedrazzini

doi:10.5281/zenodo.8414137

Word embeddings trained on the lemmatised TOROT Treebank, using Word2Vec and the following parameters:<o:p></o:p>sg = True min_count = <1,3,5> window = <3,5> vector_size = <100,200,300> epochs = 5<o:p></o:p>One model was trained for each combination of the parameters enclosed in angled brackets (< >). <o:p></o:p>The release contains both the full models (.model) and the plain vector files (_vectors.txt). The models are named according to the parameters they were trained with.<o:p></o:p>Note that these are the result of very preliminary experiments and no systematic evaluation of their quality was carried out, so use with caution.<o:p></o:p>

Early Slavic language models

Abstract

Files and links (1)

Metrics

Details

Early Slavic language models

Abstract

Files and links (1)

Metrics

Details

The Alan Turing Institute Social media