WORKSHOPS

Day 3: Wednesday 04 September 2024

Parallel Schedule 1

Workshop 1: Data Science on HPC platforms

Abstract:

Abstract: Methods in Machine and Deep Learning are being investigated to either augment or replace traditional simulation. Contemporary trends in hardware and software have enabled convergence of HPC and AI. In this workshop we survey these trends and explore what is available on system level and in software to ease the transition to developing data science workloads. The talks include a walk through the current hardware and system resources available to accelerate the Data Science workflows. It is followed by the cataloging of software tools and frameworks available to develop ML/DL models, accelerate their training process and handle the associated data requirements at scale. Attendees are assumed to have no prior experience with ML/DL workloads.

“Data Science on HPC platforms “

Time: 09:00 – 9:40 am

Dr. Saber Feki

Senior Computational Scientist Lead – KAUST

Biography:

Dr. Saber Feki leads the computational and data science and engineering at the KAUST Supercomputing Core Laboratory, providing support, training, advanced services and research collaborations with users of the leadership supercomputer Shaheen III and a heterogeneous cluster “Ibex” with over 600 GPUs. Saber is passionate about technology, and enjoys working with users and technology vendors to plan and execute refreshes to KAUST HPC and AI infrastructure with the latest hardware and software technologies. He is leveraging his expertise to support and consult for several similar deployments for local and regional. Saber received his MSc and Ph.D. degrees in computer science from the University of Houston in 2008 and 2010, respectively. He then joined the oil and gas company TOTAL Energies in 2011 as an HPC Research Scientist and has been working at KAUST since 2012.

Topic:

“Trends in HPC architecture enabling accelerated Data Science”

Abstract:

KAUST Supercomputing Lab supports a significant group of users using our machines for AI workloads. Based on our experience, we catalog the current and future trends of hardware that primarily hosts Data Science workloads in the present day. We survey the computer architectures designed to accelerate both HPC and ML/DL workloads and the interconnect requirements and available options to scale out these workloads. We also discuss the workload and data management characteristics for AI-related workloads and available options to run them with fairness in a multi-user environment. We then survey the solutions, either bespoke or turnkey, that are efficiently addressing the needs of AI workloads.

Dr. Mohsin Ahmed Shaikh

Computational Scientist – KAUST

Biography:

Dr. Mohsin Ahmed Shaikh is a Computational Scientist at KAUST Supercomputing Lab (KSL). He has over 10 years of experience in designing, developing and supporting large scale HPC applications. He holds a PhD in Computational Bioengineering and a Post Doc, both from University of Canterbury, New Zealand. He worked previously at the Pawsey Supercomputing Centre as Supercomputing Applications Specialist before joining KAUST. As part of KSL’s Application Support team, he provides support to users of Shaheen Supercomputer and GPGPUs in Ibex cluster. His recent interests include compassable HPC infrastructure and workflows to accelerate ML/DL workloads on KAUST Supercomputing Lab resource and in cloud

Time: 09:40 – 10:10 am

Topic:

“Parallel and distributed ML/DL on HPC platforms”

Abstract:

Training ML and DL models can be a computationally expensive task to perform. If done repeatedly, it can slow down the overall scientific investigation. There exist a range of libraries today which can enable parallelism in training of ML/DL models. In this talk, we present a walkthrough of the life cycle of a typical ML/DL model development and survey the trends and techniques used to accelerating the model pre-training/finetuning on HPC resources.

Dr. Mohsin Ahmed Shaikh

Computational Scientist – KAUST

Time: 10:20 – 11:50 am

Topic:

“Hands on workshop on training a DL model in parallel”

Abstract:

In this hands-on session we will train a model on a given dataset and find good enough hyperparameters to improve its performance. As a next step participants will accelerate the pre-training of the model on a larger dataset to reduce the time to a good-enough fit.

Workshop 2:

“Hands-on AI Tools and Techniques”

Time: 12:50 – 02:50 pm

Dr. Abdelghafour HALIMI

Facilities Director of Research Computing Core Labs – KAUST

Dr. Didier Barradas BauCsta

Visualization Scientist– KAUST

Dr. Sohaib Ghani

Lead Research Scientist – KAUST

Topic:

“Hands-on AI Tools and Techniques”

Abstract:

This workshop is designed to provide an integrated introduction to the realms of Machine Learning (ML) and Deep Learning (DL), two critical subsets of Artificial Intelligence (AI). Without delving into overwhelming technical complexities, attendees will gain a foundational understanding of machine learning and deep learning models. Utilizing the top-tier resources of the KAUST Visualization Core Lab, attendees will experience a mix of indepth lectures and hands-on sessions, ensuring a foundational grasp of the subject. This workshop is ideal for those looking to embark on a journey into AI, as well as for professionals seeking to enhance their knowledge in this rapidly evolving domain.

Parallel Schedule 2

Workshop 5:

Workshop 3: HPC in the Cloud: Unleash Exascale Power & Future-Proof Your Work

Time: 09:00 – 9:40 am

Mr. Abdulrahman Alkhamees

Principal Customer Engineer - CNTXT

Biography:

Abdulrahman Alkhamees currently holds the position of Principal Customer Engineer at CNTXT, where he brings over a decade of experience gained from his previous roles at Salam, STC Solutions, DDN Storage, and Udacity. Throughout his career, Abdulrahman has worked in a variety of roles including HPC, support, consultancy, marketing, engineering, and presales. With his extensive background, he has developed a wealth of knowledge and expertise in the tech industry. Looking towards the future, Abdulrahman has recently shifted his focus to assisting customers with HPC on cloud migration. Drawing upon his years of experience, he is well-equipped to guide and support organizations as they navigate this critical aspect of their digital transformation.

topic:

Topic: HPC in the Cloud: Unleash Exascale Power & Future-Proof Your Work.

Abstract:

Is your current infrastructure bottlenecking your groundbreaking High-Performance Computing (HPC) projects? This talk empowers you to break free with a strategic migration plan to the cloud, unlocking the potential of exascale computing. We’ll guide you through the essential steps to take your HPC workloads to the cloud, including Cloud Migration Strategies: Explore proven methods for a smooth transition of your HPC environment. Building for the Cloud: Discover best practices for optimizing your HPC applications for maximum performance in the cloud. The Future of HPC: Learn about the multi-cloud landscape and the exciting career and professional opportunities that await in this rapidly evolving field.

Time: 09:40 – 10:20 am

Mr. Amr Mohamed Nasr Eldin,

Principal HPC Software Engineer, BrightSkies

Biography:

Amr Elsayed is the team Lead for the Advanced Computing and AI team under the HPC department at Brightskies. With a solid foundation in computer engineering from Alexandria University, Amr has cultivated extensive expertise in high-performance computing and artificial intelligence over 6 years. Throughout his career, Amr has led his team in pioneering HPC and AI projects, achieving significant milestones in fields such as healthcare and industrial applications. His leadership has driven the successful deployment of advanced AI solutions, including general solutions, segmentation solutions, chatbot and generative AI systems tailored to various industries. Amr’s work and experience is deeply rooted in software consultancy, where he has provided critical support across diverse hardware platforms for various HPC applications, particularly in sectors like oil and gas. His contributions have consistently enhanced the efficiency and performance of complex workflows, underscoring his commitment to innovation and excellence in HPC and AI.

topic:

Workshop Title to be given”

Abstract: