Questions tagged [similarity]
17 questions
14
votes
3 answers
How do I compute the structural similarity between sentences?
I am working on a problem where I need to determine whether two sentences are similar or not. I implemented a solution using BM25 algorithm and wordnet synsets for determining syntactic & semantic similarity. The solution is working adequately,…

Shubham Tiwari
- 243
- 1
- 2
- 6
4
votes
1 answer
Is music/sound similarity comparison feasible on neural networks?
I wonder on the following concept:
A given neural network gets two audio input (preferably music) and gives a real number between 0 and 1 which describes "similarity" between the second and the first track.
As far as my understanding of neural…

Zoltán Schmidt
- 623
- 7
- 14
3
votes
3 answers
Why are Siamese Neural Networks used instead of a single neural network?
Siamese Neural Networks are a type of neural network used to compare two instances and infer if they belong to the same object. They are composed by two parallel identical neural networks, whose output is a vector of features. This vector of…

IgnacioGaBo
- 63
- 6
3
votes
2 answers
Levenshtein Distance between each word in a given string
From Calculate Levenshtein distance between two strings in Python it is possible to calculate distance and similarity between two given strings(sentences).
And from Levenshtein Distance and Text Similarity in Python to return the matrix for each…

Pluviophile
- 1,223
- 5
- 17
- 37
2
votes
2 answers
How can we compare, in terms of similarity, two pieces of text?
How can we compare, in terms of similarity (and/or meaning), two pieces of text (or documents)?
For example, let's say that I want to determine whether a document is a plagiarized version of another document. Which approach should I use? Could I use…

cuong tran
- 33
- 1
- 5
2
votes
1 answer
Can CNNs detect image similarity?
I have been running some experiments to see whether a CNN can detect whether two images are the same. However, I can't seem to make it work. I am wondering whether CNNs are not able to do what I am trying, or whether I am making mistakes.
This is…

Bart
- 21
- 1
2
votes
1 answer
Similarity score between 2 words using Pre-trained BERT using Pytorch
I'm trying to compare Glove, Fasttext, Bert on the basis of similarity between 2 words using Pre-trained Models. Glove and Fasttext had pre-trained models that could easily be used with gensim word2vec in python.
Does BERT have any such models?
Is…

Pluviophile
- 1,223
- 5
- 17
- 37
1
vote
1 answer
Is there any metric for calculating how natural a single image is given a dataset of the same class images?
Suppose there is a dataset $D$ of images. We have enough number $n$ of images in the dataset and all the images are of a single class.
Suppose I generated a new image $I$, which is not present in the given dataset, of the same class using a…

hanugm
- 3,571
- 3
- 18
- 50
1
vote
0 answers
On learning to rank tasks. Could it be that the input of the Siamese network is a vector, or should it be exclusively raw text?
I'm developing a method to document and query representation as concept vectors (bag-of-concepts). I want to train a machine learning model on ranking (learning to rank a task). So I have document vector V1 and query vector V2. How should I use…

mike sam
- 11
- 2
1
vote
0 answers
How to estimate the convolutional representation of a graph from its similarity to other graph convolutional representation?
Suppose we have two graphs A and B disconnected to each other (let's say 2-hops each), within a larger graph. If the convolutional representation of graph A is known, is it possible to estimate the definitive convolutional representation of graph B…

sujeto1
- 129
- 2
1
vote
0 answers
Threshold selection for Siamese network hyper-parameter tuning
I'm interested in modeling a Siamese network for facial verification. I've already written a simple working model that inputs feature vectors generated from two CNNs with shared weights then outputs a similarity score (euclidean distance.)
Here is a…

Niccolo
- 11
- 2
1
vote
0 answers
String matching algorithm for product recognition
I'm building a web application that collects schema.org data from different webshops as Amazon, Shopify, etc. It collects data every 6h and shows the current and lowest price. It is used for monitoring products and buying at the lowest price.
My…

Mr.Code
- 111
- 2
0
votes
0 answers
How to evaluate output of text generation models?
Evaluation of a wide variety of natural language generation (NLG) tasks is difficult. For instance, for a question answering model, it is hard for a human to quantify how well the model has answered a particular question. Doing this at scale is even…

Greggs
- 101
0
votes
0 answers
How do you specify the dimension to search for similarity in CLIP image embeddings?
I have a question about CLIP semantic image search. When you have an image of a person e.g. a skinny person wearing red shirt, clip will search for you similarity in all dimensions including body shape, gender, shirt color, etc yielding me more…

off99555
- 325
- 3
- 12
0
votes
0 answers
How can you add data to BERT? Will 10-20 books added affect the word embeddings?
I will be using BERT to get word embeddings before performing cosine similarity analysis on my data. According to this paper the accuracy of word embeddings can be improved by updating the model with domain specific textbooks. They do not provide…

learner
- 13
- 3