An Efficient Clustering Based CBIR System using Hadoop to Analyze Massive MRI Images Dataset for Early Disease Diagnosis
Harinder Singh1, Kulvinder Singh Mann2

1Harinder Singh*, Research Scholar, IKGPTU, Kapurthala, Punjab, India.
2Kulvinder Singh Mann, Professor, Department of IT, GNDEC, Ludhiana, Punjab, India.
Manuscript received on September 17, 2019. | Revised Manuscript received on October 20, 2019. | Manuscript published on October 30, 2019. | PP: 270-278 | Volume-9 Issue-1, October 2019 | Retrieval Number: A1139109119/2019©BEIESP | DOI: 10.35940/ijeat.A1139.109119
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: In past decade, the use of medical imaging for disease diagnosis is increased rapidly. The medical images provide useful information about the anatomy of patients. The medical images are used not only to assist the doctors for diagnosis purpose, but also used in Research & Development for deeper insights and better understanding into cause and cure of numerous diseases. To retrieve medical images from large scale repositories in real time, there is urgent need of an efficient medical image retrieval system. For this purpose, an efficient clustering based content based image retrieval (CBIR) system using Hadoop is proposed to analyze massive magnetic resonance imaging (MRI) images dataset for early disease diagnosis. The proposed CBIR system uses Hadoop platform, local mesh peak valley edge patterns (LMePVEP) for feature extraction, MapReduce based parallel k-means algorithm for clustering and euclidean distance to measure similarity. The method proposed is tested and compared with state-of-the-art CBIR methods on massive MRI images dataset. The experimental results obtained show that the method proposed in our work outperforms traditional CBIR methods in terms of average retrieval time and mean average precision for massive MRI images dataset.
Keywords: Content based image retrieval (CBIR), Hadoop, magnetic resonance imaging (MRI) images dataset, local mesh peak valley edge patterns (LMePVEP), MapReduce based parallel k-means algorithm.