I have come across a strange behavior in the similarity search.
I was looking at compounds similar to necrostatin and found that some of the results with quite high similarities (0.8) are not similar at all.
I compared the results with ChEMBL and these compounds were not in their results at all.
I took some of the compounds for which Open PHACTS had a Tanimoto similarity of 0.8 and compared them with a couple of different fingerprints. In all cases, the first three had similar values to each other, whereas the 4th ("strange") compound had a much lower value.
1st "strange" compound on slide:
2nd "strange" compound on slide:
Compounds in the table, top to bottom:
I understand the similarity search is done by the RSC engine, however, I am not sure this explains why these compounds should be similar. There are very few things similar about them and especially when you look at the other compounds with 0.8 similarity, it raises questions.