Root Based Stemmer for Telugu Script
Dr. Narla Swapna, Department of CSE, CMR college of Engineering & Technology Hyderabad, India.
Manuscript received on July 20, 2019. | Revised Manuscript received on August 10, 2019. | Manuscript published on August 30, 2019. | PP: 2565-2569 | Volume-8 Issue-6, August 2019. | Retrieval Number: F8734088619/2019©BEIESP | DOI: 10.35940/ijeat.F8734.088619
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: In this paper, a new stemmer has been proposed named as “Root based stemmer”. This stemmer is strictly based on Dravidian script. Stemming can be used to pick up the effectiveness of information retrieval. In proposed Root based stemming technique, each and every token is compared against with all the words of a valid root words dictionary until a match is found. Then extract the matched string or substring from a token and identified as valid root. The present work is aimed to build dictionary based stemmer to extract valid root words for Indian languages especially for Telugu and compare the results with existing stemmers.
Keywords: Stemming, Information Retrieval, Telugu.