Questions tagged [gensim]
gensim is the python library for topic modelling. multi-dimensional vector representation of words or sentences which preserves semantic meaning is computed through word2vec and doc2vec models.
97 questions
1 vote
1 answer
41 views
predict next career suggestion
I have a dataset having job and description. i want to make model which can predict what are the thing that user needs to improve when the user inputs his skills. For an example, If he has skills - ...
3 votes
2 answers
1k views
Gensim doc2vec error: KeyError: "word 'senseless' not in vocabulary"
I am new to machine learning and tried doc2vec on quora duplicate dataset. new_dfx has columns 'question1' and 'question2' which has preprocessed questions in each row. Following is the tagged ...
1 vote
0 answers
558 views
How can I decide the threshold value for relevance score in a search problem?
I am using a LSA/TF-IDF/BM25/Ensemble models for text search and finally calculating similarity score to rank my search. I would like to decide a threshold value for the score, below which I would not ...
1 vote
1 answer
23 views
Recommend products based on historical queries of other users
Given the user data as in the following: ...
2 votes
2 answers
7k views
How to calculate the mean average of word embedding and then compare strings using sklearn.metrics.pairwise
I am totally new to this topic, that's why I am so confused or stuck in this code for a while, but I am not sure how to solve it correctly. My goal is to write a short text embedding using vector ...
0 votes
1 answer
321 views
Training fasttext on your own corpus
I want to train fasttext on my own corpus. However, I have a small question before continuing. Do I need each sentences as a different item in corpus or can I have many sentences as one item? For ...
1 vote
0 answers
66 views
default estimation method of gensim's word2vec skipgram?
I am now trying to use word2vec by estimating skipgram embeddings via NCE (noise contrastive estimation) rather than conventional negative sampling method, as a recent paper did (https://asistdl....
2 votes
1 answer
4k views
Fine-tuning pre-trained Word2Vec model with Gensim 4.0
With Gensim < 4.0, we can retrain a word2vec model using the following code: ...