Word Similarity／Distance

measuring the similarity/differences between 2 words based on their definition/sense
useful in Information Retrieval (IR), Question/Answering, Machine Translation, etc

Measure	Description
Path Similarity	-𝑙𝑜𝑔(𝑝𝑎𝑡ℎ𝑙𝑒𝑛(𝑐₁,𝑐₂)) # 𝑝𝑎𝑡ℎ𝑙𝑒𝑛(𝑐₁,𝑐₂) is the number of edges the shortest path in thesaurus graph between synsets 𝑐₁ and 𝑐₂
Resnik Similarity	-𝑙𝑜𝑔𝐏(𝐿𝐶𝑆(𝑐₁,𝑐₂))
Lin Similarity	[2·𝑙𝑜𝑔𝐏(𝐿𝐶𝑆(𝑐₁,𝑐₂))] / [𝑙𝑜𝑔𝐏(𝑐₁) + 𝑙𝑜𝑔𝐏(𝑐₂)]
Jiang-Conrath Similarity	1 / [2·𝑙𝑜𝑔𝐏(𝐿𝐶𝑆(𝑐₁,𝑐₂)) - (𝑙𝑜𝑔𝐏(𝑐₁) + 𝑙𝑜𝑔𝐏(𝑐₂))]
Lesk Similarity	𝛴_{𝑟,𝑞∊𝑅𝐸𝐿𝑆} [𝑜𝑣𝑒𝑟𝑙𝑎𝑝(𝑔𝑙𝑜𝑠𝑠(𝑟(𝑐₁)), 𝑔𝑙𝑜𝑠𝑠(𝑞(𝑐₂)))]

／var／log marcus chiu