Coupling Factor and Cost Based Task Clustering Method to Optimize Task Clustering For Scientific Workflows in Cloud Environment
J. Jabanjalin Hilda1, C.Srimathi2

1J. Jabanjalin Hilda, School of Computer Science and Engineering, Vellore Institute of Technology, Vellore – 632014, India
2C.Srimathi, School of Computer Science and Engineering, Vellore Institute of Technology, Vellore – 632014, India
Manuscript received on July 30, 2019. | Revised Manuscript received on August 25, 2019. | Manuscript published on August 30, 2019. | PP: 4136-4143 | Volume-8 Issue-6, August 2019. | Retrieval Number: F9288088619/2019©BEIESP | DOI: 10.35940/ijeat.F9288.088619
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Scientific workflows are large scale loosely coupled submissions that are used by Computational Scientists. They are composed of multiple tasks with dependencies between them and are composed of many fine granular tasks. Task clustering is an optimization method that combines multiple tasks into a single job such that task execution time and system overhead is reduced and thus the whole performance is improved in a cloud environment. Though existing task clustering algorithms has significantly reduced the System overhead, yet dependencies among the tasks are not well-thought-out. This work examines the features of task by which the tasks can be clustered and developed proficient task clustering algorithm. In this work two task clustering ideas were proposed namely Horizontal Coupling Factor (HCF) based clustering and Horizontal Processing Cost (HPC) based Task Clustering. Next, the proposed algorithm have been evaluated and tested for various real world applications and the experiment results shows that the proposed approach suits best for data intensive and Compute intensive applications. The obtained results showed that the HCF and HPC task clustering strategies can significantly improve the performance by reducing the task execution time and inter task Communication delay. 
Keywords: Coupling Factor, Clustering, Execution Time. Processing Cost, Scientific Workflows,