Skip to content

AcademiaSinicaNLPLab/word2vec

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Transform google-pretrained (C format) to gensim format (python)

  1. Download pretrained file (Reference)
  2. Run frombin2gensim.py
./frombin2gensim.py # will generate google_word2vec_pretrained

Train word embedding from raw corpus

  1. Run train_from_corpus.py
./train_from_corpus.py corpus model_output 

Use word embedding in your project

from gensim.models import Word2Vec as W2V
model_path = 'path/to/embedding_model'
w2v = W2V.load(model_path)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages