llview - RISC2 Project https://www.risc2-project.eu Tue, 31 Jan 2023 13:40:39 +0000 en-US hourly 1 https://wordpress.org/?v=6.6.2 RISC2 Webinar Series | HPC System & Tools https://www.risc2-project.eu/2022/07/26/1st-webinar-series-hpc-system-tools/ Tue, 26 Jul 2022 12:54:46 +0000 https://www.risc2-project.eu/?p=2251 The RISC2 project is organizing a webinar series about “HPC System & Tools”. The webinars will be happening until the end of the first semester of 2023, starting on August 24, 2022. In each webinar, it will be presented the state-of-the-art in methods and tools for setting-up and maintaining HPC hardware and software infrastructures. The duration […]

The post RISC2 Webinar Series | HPC System & Tools first appeared on RISC2 Project.

]]>
The RISC2 project is organizing a webinar series about “HPC System & Tools”. The webinars will be happening until the end of the first semester of 2023, starting on August 24, 2022.

In each webinar, it will be presented the state-of-the-art in methods and tools for setting-up and maintaining HPC hardware and software infrastructures. The duration of each talk will be around 30-40 minutes, followed by a 10–15-minute moderated discussion with the audience.

The first 4 webinars are already scheduled:

 

 

 

The post RISC2 Webinar Series | HPC System & Tools first appeared on RISC2 Project.

]]>
Webinar: HPC system and job monitoring with LLview https://www.risc2-project.eu/events/webinar-4-hpc-system-and-job-monitoring-with-llview/ Tue, 26 Jul 2022 12:39:25 +0000 https://www.risc2-project.eu/?post_type=mec-events&p=2245 Date: December 7, 2022 | 4 p.m. (UTC) Speakers: Vitor Silva and Filipe Guimarães, Jülich Supercomputer Centre Moderator: Esteban Mocskos, Universidad de Buenos Aires Check the speakers’ presentation slides here.  LLview is a monitoring infrastructure developed by the Jülich Supercomputing Centre with the objective to provide an easy to use and adaptable software suite for monitoring High Performance […]

The post Webinar: HPC system and job monitoring with LLview first appeared on RISC2 Project.

]]>

Date: December 7, 2022 | 4 p.m. (UTC)

Speakers: Vitor Silva and Filipe Guimarães, Jülich Supercomputer Centre

Moderator: Esteban Mocskos, Universidad de Buenos Aires

Check the speakers’ presentation slides here. 

LLview is a monitoring infrastructure developed by the Jülich Supercomputing Centre with the objective to provide an easy to use and adaptable software suite for monitoring High Performance Computing systems. With the emergence of large heterogeneous machines, in the range of Exascale, the challenges of monitoring such huge systems increase significantly. To address that, LLview is under continuous development in order to work for a wide range of hardware systems and software interfaces with negligible overhead and at the same time providing fast, reliable access to job reports, system-wide monitoring data, and real-time system information. That information is provided to system users, project advisors, support teams and system administrators, helping the managing of jobs, identification of performance issues at many levels and also helping the system administrators to find failures and system malfunctions. This webinar gives an overview of the different LLview components and their interaction with each other and the system. Moreover, particular attention is drawn to the system monitoring views and the job reporting features, as they allow to trace the entire life cycle of a job and can help identify problems and bottlenecks at a very early stage.

 

About the Speakers:

Vitor Silva received his Computer Science degree from Universiade Federal de Minas Gerais. His M.Sc was earned in Systems and Computer Engineering from Universidade Federal do Rio de Janeiro and later received his Ph.D from Universidade Federal de Minas Gerais, this time in Nuclear Engineering. He worked as software developer in the digital image processing field, but most of his career was in the Nuclear Engineering field, mainly working with computer modeling and solving Neutronics and Thermal-hydraulics problems related to nuclear reactors. He was also the main admin of a small cluster system installed from scratch. Since 2021 he has been working at the Jülich Supercomputing Centre with monitoring tools and simulation.

Filipe Guimarães is a computational physicist. Graduated in Physics, M.Sc in Physics and Ph.D in Physics from the Universidade Federal Fluminense. He has been working with High Performance Computing since 2014 – initially from a user’s side, but moved to the support side in 2020. Since then, one of his focuses was to improve monitoring tools used and developed at the Jülich Supercomputing Centre.

About the Moderator: Esteban Mocskos is a full-time professor at Universidad de Buenos Aires (UBA) and researcher at the Center for Computer Simulation (CSC-CONICET). He received his Ph.D. in Computer Science from UBA in 2008 and was postdoc at the Protein Modelling group at UBA. His research interests include distributed systems & blockchain, computer networks, processor architecture, and parallel programming. He is part of the steering committee of the Latin-American HPC CARLA conference and onE of the committee members of Argentina’s National HPC system.

The post Webinar: HPC system and job monitoring with LLview first appeared on RISC2 Project.

]]>