How to get a similarity vector from two vectors?

Question

I want to make a classification model for 3 classes, i have 2 sentences for each observation, firstly i apply a cnn layer for each sentence and then i added dense layer.

inputs = Input(shape=(2,n_timesteps)) embedding_inputs = embedding_layer(inputs) sentence1 = Lambda(lambda x: x[:,0,:,:])(embedding_inputs) sentence2 = Lambda(lambda x: x[:,1,:,:])(embedding_inputs) conv_sentence1 = Conv1D(filters=64, kernel_size=3, activation='relu', input_shape=(n_timesteps,n_features))(sentence1) conv_sentence2 = Conv1D(filters=64, kernel_size=3, activation='relu', input_shape=(n_timesteps,n_features))(sentence2) pooling_sentence1 = MaxPooling1D(pool_size=2)(conv_sentence1) pooling_sentence2 = MaxPooling1D(pool_size=2)(conv_sentence2) flat_sentence1 = Flatten()(pooling_sentence1) flat_sentence2 = Flatten()(pooling_sentence2) concat_senrences = concatenate([flat_sentence1,flat_sentence2]) dense_layer = dense(50)(concat_senrences) dense_prediction = dense(3,activation='softmax')(dense_layer)

but i get an early overfetting, so i thought that the problem comes from "sentence 2", each observation has an unique "sentence 1", ,instead, "sentence 2" can be exists in several observations, in that case the neural network relies strongly on it, so i want to combine two sentences and apply an unique CNN layer, that's why i asked how to obtain a similarity vector.

thanks !!!

If it is indeed a vector that you want then why not take the difference between the two vectors? A summation of the difference vector might not be a good idea though, since you do not know what each dimension in the vector stands for. — Atif Hassan
– Atif Hassan, Commented Dec 19, 2019 at 14:16
thanks @AtifHassan, but i want a vector result from 2 vectors in order to use it as input in neural network(CNN) — joe_mind
– joe_mind, Commented Dec 19, 2019 at 14:39
@joe_mind what you're asking is very uncommon, so you should probably explain what task you are trying to achieve and why you want to do it this way. This way people can tell you how the task itself is usually done. — Erwan
– Erwan, Commented Dec 19, 2019 at 17:02

Stephen Rauch · Accepted Answer · 2019-12-19 13:15:12Z

2

If I understand your question correctly, you have 2 sentences and you converted those sentences into 2 vectors, you want to know how those sentences are similar. If this is the case use cosine similarity between 2 sentences and cosine similarity is a scalar not a vector its the dot product of your 2 embedding vectors.

from numpy import dot from numpy.linalg import norm cos_sim = dot(a, b)/(norm(a)*norm(b))

where a and b are your vectors.

There is direct formula as well:

from sklearn.metrics.pairwise import cosine_similarity cosine_similarity(vector1,vector2)

edited Dec 19, 2019 at 13:15

Stephen Rauch♦

1,85111 gold badges23 silver badges34 bronze badges

answered Dec 19, 2019 at 10:53

Saandeep Sreerambatla

3561 silver badge5 bronze badges

$\begingroup$ thank you for your reply, In fact i want to get a new vector that represents the similarity, not a value as cosine. $\endgroup$

joe_mind
– joe_mind

2019-12-19 12:17:27 +00:00
Commented Dec 19, 2019 at 12:17
1

$\begingroup$ Similarity score is something which explains how similar two words are, lets say for example if you have a software engineer as one word and software architect as another word, the similarity lies around mentioning that in the vector space these two are close to each other. Its like your co-relation coefficient between 2 columns. So that means similarity can only be a scalar. $\endgroup$

Saandeep Sreerambatla
– Saandeep Sreerambatla

2019-12-19 12:36:48 +00:00
Commented Dec 19, 2019 at 12:36
$\begingroup$ thanks, it is clear, but i want a vector result between two vectors in order to use it in neural network $\endgroup$

joe_mind
– joe_mind

2019-12-19 14:37:39 +00:00
Commented Dec 19, 2019 at 14:37
$\begingroup$ Can you tell me how you want to use it in a neural network? Any sample code please? $\endgroup$

Saandeep Sreerambatla
– Saandeep Sreerambatla

2019-12-19 15:39:08 +00:00
Commented Dec 19, 2019 at 15:39

Add a comment |

shahid hamdam · Accepted Answer · 2019-12-20 10:20:03Z

If you have two vectors than You can get third similarity vector by subtracting the corresponding values. E.g VectorC =(VectorA - VectorB)

This will subtract the values. If the sum(VectorC) == 0 It means the VectorA and B are same. This is a way you get a similarity vector otherwise you will have a similarity in a single float value.

thanks shaid hamdam, maybe it's a solution but I'm still looking for a credible solution. — joe_mind
– joe_mind, Commented Dec 20, 2019 at 13:54

Stack Exchange Network

How to get a similarity vector from two vectors?

2 Answers 2

Hot Network Questions

How to get a similarity vector from two vectors?

2 Answers 2

Related

Hot Network Questions