A Knowledge Based Word Sense Disambiguation in Telugu Language
Suneetha Eluri1, Vishala Siddu2
1Suneetha Eluri*, Professor, CSE Department, JNTUK, Andhra Pradesh, India.
2Vishala Siddu, Academics, CSE department, JNTUK, Andhra Pradesh, India.
Manuscript received on October 05, 2020. | Revised Manuscript received on October 26, 2020. | Manuscript published on October 30, 2020. | PP: 440-445 | Volume-10 Issue-1, October 2020. | Retrieval Number: 100.1/ijeat.A19110109119 | DOI: 10.35940/ijeat.A1911.1010120
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Telugu ( ( ) is one of the Dravidian languages which are morphologically rich. As within the other languages, it too consists of ambiguous words/phrases which have one-of-a-kind meanings in special contexts. Such words are referred as polysemous words i.e. words having a couple of experiences. A Knowledge based approach is proposed for disambiguating Telugu polysemous phrases using the computational linguistics tool, IndoWordNet. The task of WSD (Word sense disambiguation) requires finding out the similarity among the target phrase and the nearby phrase. In this approach, the similarity is calculated either by means of locating out the range of similar phrases (intersection) between the glosses (definition) of the target and nearby words or by way of finding out the exact occurrence of the nearby phrase’s sense in the hierarchy (hypernyms/hyponyms) of the target phrase’s senses. The above parameters are changed by using the intersection use of not simplest the glosses but also by using which include the related words. Additionally, it is a third parameter ‘distance’ which measures the distance among the target and nearby phrases. The proposed method makes use of greater parameters for calculating similarity. It scores the senses based on the general impact of parameters i.e. intersection, hierarchy and distance, after which chooses the sense with the best score. The correct meaning of Telugu polysemous phrase could be identified with this technique.
Keywords: Natural Language Processing (NLP), Polysemous, IndoWordNet, Word Sense Disambiguation (WSD), Intersection, Hierarchy, Senses, Distance measure.