Skip to main content

Questions tagged [gensim]

gensim is the python library for topic modelling. multi-dimensional vector representation of words or sentences which preserves semantic meaning is computed through word2vec and doc2vec models.

1 vote
1 answer
41 views

I have a dataset having job and description. i want to make model which can predict what are the thing that user needs to improve when the user inputs his skills. For an example, If he has skills - ...
pycoder's user avatar
  • 11
3 votes
2 answers
1k views

I am new to machine learning and tried doc2vec on quora duplicate dataset. new_dfx has columns 'question1' and 'question2' which has preprocessed questions in each row. Following is the tagged ...
Ankit Rohilla's user avatar
1 vote
0 answers
558 views

I am using a LSA/TF-IDF/BM25/Ensemble models for text search and finally calculating similarity score to rank my search. I would like to decide a threshold value for the score, below which I would not ...
Prateek Coder's user avatar
1 vote
1 answer
23 views

Given the user data as in the following: ...
william007's user avatar
2 votes
2 answers
7k views

I am totally new to this topic, that's why I am so confused or stuck in this code for a while, but I am not sure how to solve it correctly. My goal is to write a short text embedding using vector ...
test's user avatar
  • 21
0 votes
1 answer
321 views

I want to train fasttext on my own corpus. However, I have a small question before continuing. Do I need each sentences as a different item in corpus or can I have many sentences as one item? For ...
BlueMango's user avatar
  • 113
1 vote
0 answers
66 views

I am now trying to use word2vec by estimating skipgram embeddings via NCE (noise contrastive estimation) rather than conventional negative sampling method, as a recent paper did (https://asistdl....
krcoder's user avatar
  • 11
2 votes
1 answer
4k views

With Gensim < 4.0, we can retrain a word2vec model using the following code: ...
NST's user avatar
  • 51

15 30 50 per page
1
2 3 4 5
7