• français
    • English
  • English 
    • français
    • English
  • Login
JavaScript is disabled for your browser. Some features of this site may not work without it.
BIRD Home

Browse

This CollectionBy Issue DateAuthorsTitlesSubjectsJournals BIRDResearch centres & CollectionsBy Issue DateAuthorsTitlesSubjectsJournals

My Account

Login

Statistics

View Usage Statistics

Bi-objective CSO for Big Data ScientificWorkflows scheduling in the Cloud: case of LIGO workflow

Thumbnail
View/Open
ICSOFT_2020_31_CR.pdf (240.1Kb)
Date
2020
Dewey
Programmation, logiciels, organisation des données
Sujet
Scientific Workflow; Data intensive; Cat Swarm Optimization; Multi-objective Scheduling; LIGO
Conference name
15th International Conference on Software Technologies (ICSOFT 2020)
Conference date
07-2020
Conference city
Paris
Conference country
France
Book title
Proceedings of the 15th International Conference on Software Technologies - Volume 1: ICSOFT
Publisher
SciTePress
ISBN
978-989-758-443-5
URI
https://basepub.dauphine.fr/handle/123456789/20969
Collections
  • LAMSADE : Publications
Metadata
Show full item record
Author
Bousselmin, K.
Ben Hamida, Sana
989 Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
Rukoz, Marta
989 Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
Type
Communication / Conférence
Item number of pages
615-624
Abstract (EN)
Scientific workflows are used to model scalable, portable, and reproducible big data analyses and scientific experiments with low development costs. To optimize their performances and ensure data resources efficiency, scientific workflows handling big volumes of data need to be executed on scalable distributed environments like the Cloud infrastructure services. The problem of scheduling such workflows is known as an NP-complete problem. It aims to find optimal mapping task-to-resource and data-to-storage resources in order to meet end user’s quality of service objectives, especially minimizing the overall makespan or the financial cost of the workflow. In this paper, we formulate the problem of scheduling big data scientific workflows as bi-objective optimization problem that aims to minimize both the makespan and the cost of the workflow. The formulated problem is then resolved using our proposed Bi-Objective Cat Swarm Optimization algorithm (BiO-CSO)which is an extension of the bio-inspired algorithm CSO. The extension consists of adapting the algorithm to solve multi-objective discrete optimization problems. Our application case is the LIGO Inspiral workflowwhich is a CPU and Data intensive workflow used to generate and analyze gravitational waveforms from data collected during the coalescing of compact binary systems. The performance of the proposed method is then compared to that of the multi-objective Particle Swarm Optimization (PSO) proven to be effective for scientific workflows scheduling. The experimental results show that our algorithm BiO-CSO performs better than themulti-objective PSO since it provides more and better final scheduling solutions.

  • Accueil Bibliothèque
  • Site de l'Université Paris-Dauphine
  • Contact
SCD Paris Dauphine - Place du Maréchal de Lattre de Tassigny 75775 Paris Cedex 16

 Content on this site is licensed under a Creative Commons 2.0 France (CC BY-NC-ND 2.0) license.