Frequent Subgraph Mining by Giraph Distributed System
Sadhana Priyadarshini1, Sireesha Rodda2

1Ms Sadhana Priyadarshini*, Phd Scholar, Department of Computer Science and Engineering, GITAM (Deemed to be University), Visakhapatnam Andhra Pradesh India.
2Dr. Sireesha Rodda, Professor, Department of Computer Science & Engineering, GITAM (Deemed to be University), Visakhapatnam Andhra Pradesh India.

Manuscript received on June 08, 2020. | Revised Manuscript received on June 25, 2020. | Manuscript published on June 30, 2020. | PP: 1267-1275 | Volume-9 Issue-5, June 2020. | Retrieval Number: E1128069520/2020©BEIESP | DOI: 10.35940/ijeat.E1128.069520
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (

Abstract: To overcome the challenges for managing the rapid growth of social graphs, massive Distributed Graph Mining Systems are developed, such as Pregel, GiraphHama, GraphLab, PowerLab, etc. The common approach to all systems is to divide the entire Graph Dataset into smaller divisions and use it as “think like a vertex”, the programing model is to hold up a continual graph calculation. In this paper, we use the Optimized Frequent Subgraph Mining algorithm in the Giraph framework model and make a comparative study with existing different Distributed Systems. To enhance the flexibility and performance of the novel method, we carry out different optimization techniques associating it with updating different run time limits. We also investigate how the performance could be improved by Giraph Distribution System, which plays a vital role in social graphs such as LinkedIn, Twitter, Facebook, etc. The graph input, output, cluster set up and hardware configuration play vital roles in optimizing the performance of our proposed algorithm. 
Keywords: Frequent subgraph mining, Graph Distribution, minimum support, Zoopkeeper.