Lustre User Group 2024: Difference between revisions

From Lustre Wiki
Jump to navigation Jump to search
(→‎Tuesday, May 7 – Day 1: add links to videos)
(→‎LUG 2023 Agenda: link to final few videos)
 
(One intermediate revision by the same user not shown)
Line 25: Line 25:
|-
|-
|''' 11:00-11:30'''
|''' 11:00-11:30'''
|'''[[Media:LUG2024-Scalable_Auto_Tiering-Jabas.pdf|Scalable Auto-Tiering]]''' ([https://www.youtube.com/watch?v=S-Gz5nGmjZ8&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=3&pp=iAQB video])
|'''[[Media:LUG2024-Scalable_Auto_Tiering-Jabas.pdf|Scalable Auto-Tiering]]''' ([https://www.youtube.com/watch?v=S-Gz5nGmjZ8&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=3&pp=iAQB video])
:  Tom Jabas, Hewlett Packard Enterprise (HPE)
:  Tom Jabas, Hewlett Packard Enterprise (HPE)


|-
|-
|''' 11:30-12:00'''
|''' 11:30-12:00'''
|'''[[Media:LUG2024-Leveraging_Lustre_for_US_and_Illinois_Researchers-Maloney.pdf|Leveraging Lustre as a Global File System for US and Illinois Researchers]]''' ([https://www.youtube.com/watch?v=eAxFxJ35u80&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=4&pp=iAQB video])
|'''[[Media:LUG2024-Leveraging_Lustre_for_US_and_Illinois_Researchers-Maloney.pdf|Leveraging Lustre as a Global File System for US and Illinois Researchers]]''' ([https://www.youtube.com/watch?v=eAxFxJ35u80&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=4&pp=iAQB video])
:  JD Maloney, National Center for Supercomputing Applications
:  JD Maloney, National Center for Supercomputing Applications


|-
|-
|''' 13:15-13:30'''
|''' 13:15-13:30'''
|'''Sponsor Talk: [[Media:LUG2024-Optimizations_and_Strategies_for_Managing_Exabyte_AI_Data_Environments-Skupinsky.pdf|Optimizations and Strategies for Managing Exabyte AI Data Environments and Accelerated Computing Demands]]''' ([https://www.youtube.com/watch?v=S1Von0NpAJs&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=5&pp=iAQB video])
|'''Sponsor Talk: [[Media:LUG2024-Optimizations_and_Strategies_for_Managing_Exabyte_AI_Data_Environments-Skupinsky.pdf|Optimizations and Strategies for Managing Exabyte AI Data Environments and Accelerated Computing Demands]]''' ([https://www.youtube.com/watch?v=S1Von0NpAJs&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=5&pp=iAQB video])
:  Morris Skupinsky, DDN
:  Morris Skupinsky, DDN


|-
|-
|''' 13:30-14:30'''
|''' 13:30-14:30'''
|'''[[Media:LUG2024-Lustre_2.17_and_Beyond-Dilger.pdf|Lustre 2.17 and Beyond]]''' ([https://www.youtube.com/watch?v=PxfFN4cfsiM&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=16&pp=iAQB video])
|'''[[Media:LUG2024-Lustre_2.17_and_Beyond-Dilger.pdf|Lustre 2.17 and Beyond]]''' ([https://www.youtube.com/watch?v=PxfFN4cfsiM&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=16&pp=iAQB video])
:  Andreas Dilger, Whamcloud
:  Andreas Dilger, Whamcloud


|-
|-
|''' 14:30-15:00'''
|''' 14:30-15:00'''
|'''[[Media:LUG2024-Performance_Monitoring_Lustre_MSFT_Cloud-Wilson.pdf|Design of Performance and Health Monitoring, Alerting, and Logging Infrastructure for Lustre in the Cloud]]''' ([https://www.youtube.com/watch?v=-x6_dB8MPrU&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=15&pp=iAQB video])
|'''[[Media:LUG2024-Performance_Monitoring_Lustre_MSFT_Cloud-Wilson.pdf|Design of Performance and Health Monitoring, Alerting, and Logging Infrastructure for Lustre in the Cloud]]''' ([https://www.youtube.com/watch?v=-x6_dB8MPrU&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=15&pp=iAQB video])
:  Ellis Wilson, Microsoft
:  Ellis Wilson, Microsoft


|-
|-
|''' 15:30-16:00'''
|''' 15:30-16:00'''
|'''[[Media:LUG2024-Utilization_Trends_Patterns_Orion_Filesystem-Mohr.pdf|Utilization Trends and I/O Patterns in the Orion-Lustre Filesystem]]''' ([TBD video])
|'''[[Media:LUG2024-Utilization_Trends_Patterns_Orion_Filesystem-Mohr.pdf|Utilization Trends and I/O Patterns in the Orion-Lustre Filesystem]]''' ([https://www.youtube.com/watch?v=FqDBOSWiSQ0&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=18&pp=iAQB video])
:  Rick Mohr, Oak Ridge National Laboratory
:  Rick Mohr, Oak Ridge National Laboratory
Line 73: Line 73:
|-
|-
|''' 09:00-09:30'''
|''' 09:00-09:30'''
|'''Keynote: [[Media:LUG2024-Perspectives_Considerations_Academic_Research_Computing-Sill.pdf|Perspectives in Storage Considerations for Academic and Research Computing
|'''Keynote: [[Media:LUG2024-Perspectives_Considerations_Academic_Research_Computing-Sill.pdf|Perspectives in Storage Considerations for Academic and Research Computing]]''' ([https://www.youtube.com/watch?v=9ks_unXfzW4&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=6&pp=iAQB video])
]]'''  
:  Alan Sill, High Performance Computing Center, Texas Tech University  
:  Alan Sill, High Performance Computing Center, Texas Tech University  


|-
|-
|''' 09:30-10:00'''
|''' 09:30-10:00'''
|'''[[Media:LUG2024-TASSI-Bent.pdf|Automated AI-Analysis of the Lustre-Development Mailing List]]'''
|'''[[Media:LUG2024-TASSI-Bent.pdf|Automated AI-Analysis of the Lustre-Development Mailing List]]''' ([https://www.youtube.com/watch?v=FhgPQYrTIfw&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=7&pp=iAQB video])
:  John Bent, Los Alamos National Lab
:  John Bent, Los Alamos National Lab


|-
|-
|''' 10:30-11:00'''
|''' 10:30-11:00'''
|'''[[Media:LUG2024-PoliMOR_In_Action-Brumgard.pdf|PoliMOR In Action]]'''
|'''[[Media:LUG2024-PoliMOR_In_Action-Brumgard.pdf|PoliMOR In Action]]''' ([https://www.youtube.com/watch?v=Ae6DArC9kJ4&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=8&pp=iAQB video])
:  Christopher Brumgard, Oak Ridge National Laboratory
:  Christopher Brumgard, Oak Ridge National Laboratory


|-
|-
|''' 11:00-11:30'''
|''' 11:00-11:30'''
|'''[[Media:LUG2024-Asynchronous_IO_A_Practical_Guide_for_Optimizing_HPC_Workflows-Platonov.pdf|Asynchronous I/O: A Practical Guide for Optimizing HPC Workflows]]'''
|'''[[Media:LUG2024-Asynchronous_IO_A_Practical_Guide_for_Optimizing_HPC_Workflows-Platonov.pdf|Asynchronous I/O: A Practical Guide for Optimizing HPC Workflows]]''' ([https://www.youtube.com/watch?v=wIW_JOLjLDw&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=9&pp=iAQB video])
:  Sergei Platonov, Xinnor
:  Sergei Platonov, Xinnor


|-
|-
|''' 11:30-12:00'''
|''' 11:30-12:00'''
|'''[[Media:LUG2024-Hybrid_IO_Path_Update-Farrell.pdf|Hybrid IO Path Update]]'''
|'''[[Media:LUG2024-Hybrid_IO_Path_Update-Farrell.pdf|Hybrid IO Path Update]]''' ([https://www.youtube.com/watch?v=wGSKu5IVh5c&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=10&pp=iAQB video])
:  Patrick Farrell, Oracle
:  Patrick Farrell, Oracle


|-
|-
|''' 13:00-13:30'''
|''' 13:00-13:30'''
|'''[[Media:LUG2024-Sunfish_Management_of_Lustre_On-Demand_FAM-based_Filesystem-Aguilar.pdf|Sunfish Management of Lustre On-Demand FAM-based Filesystem]]'''
|'''[[Media:LUG2024-Sunfish_Management_of_Lustre_On-Demand_FAM-based_Filesystem-Aguilar.pdf|Sunfish Management of Lustre On-Demand FAM-based Filesystem]]''' ([https://www.youtube.com/watch?v=QkUukmA2Anw&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=11&pp=iAQB video])
:  Michael Aguilar, Sandia National Laboratories
:  Michael Aguilar, Sandia National Laboratories


|-
|-
|''' 13:30-14:00'''
|''' 13:30-14:00'''
|'''[[Media:LUG2024-AI_ML_Benchmarking_and_Lustre-Samar.pdf|AI/ML Benchmarking and Lustre]]'''
|'''[[Media:LUG2024-AI_ML_Benchmarking_and_Lustre-Samar.pdf|AI/ML Benchmarking and Lustre]]''' ([https://www.youtube.com/watch?v=51W7xI4anrE&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=12&pp=iAQB video])
:  Sakib Samar, Hewlett Packard Enterprise
:  Sakib Samar, Hewlett Packard Enterprise


|-
|-
|''' 14:00-14:30'''
|''' 14:00-14:30'''
|'''[[Media:LUG2024-AI_Workload_Optimization_with_Lustre-Dauchy-Degremont.pdf|AI workloads and Lustre: Diving into an LLM in a Kerberized Production Environment]]'''
|'''[[Media:LUG2024-AI_Workload_Optimization_with_Lustre-Dauchy-Degremont.pdf|AI workloads and Lustre: Diving into an LLM in a Kerberized Production Environment]]''' ([https://www.youtube.com/watch?v=3ZEjfvLtito&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=13&pp=iAQB video])
:  Nathan Dauchy, Aurélien Degrémont, NVIDIA
:  Nathan Dauchy, Aurélien Degrémont, NVIDIA


|-
|-
|''' 14:30-15:00'''
|''' 14:30-15:00'''
|'''[[Media:LUG2024-Lustre_IPv6_Support-Simmons.pdf|Lustre IPv6 support]]'''
|'''[[Media:LUG2024-Lustre_IPv6_Support-Simmons.pdf|Lustre IPv6 support]]''' ([https://www.youtube.com/watch?v=QUQ_lD3Vm7A&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=14&pp=iAQB video])
:  James Simmons, Oak Ridge National Laboratory
:  James Simmons, Oak Ridge National Laboratory


|-
|-
|''' 15:30-16:00'''
|''' 15:30-16:00'''
|'''[[Media:LUG2024-Managing_High_Availability_Lustre_Using_Multiple_Namespaces-Gipson.pdf|Managing a High Availability Lustre Environment Using Multiple Namespaces]]'''
|'''[[Media:LUG2024-Managing_High_Availability_Lustre_Using_Multiple_Namespaces-Gipson.pdf|Managing a High Availability Lustre Environment Using Multiple Namespaces]]''' ([https://www.youtube.com/watch?v=riqFfWaZpAM&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=19&pp=iAQB video])
:  Bradley Gipson, Oak Ridge National Labratory
:  Bradley Gipson, Oak Ridge National Labratory


|-
|-
|''' 16:00-16:30'''
|''' 16:00-16:30'''
|'''[[Media:LUG2024-Continous_Testing_Integration_Keeps_Lustre_Code_Great-Drokin.pdf|Continuous Testing and Integration to Keep Lustre Code Great]]'''
|'''[[Media:LUG2024-Continous_Testing_Integration_Keeps_Lustre_Code_Great-Drokin.pdf|Continuous Testing and Integration to Keep Lustre Code Great]]''' ([https://www.youtube.com/watch?v=7N7kIk7X8A4&list=PLA5dHg1_l3V-W9mhbSGC9cx7KzJW_YCMF&index=20&pp=iAQB video])
:  Oleg Drokin, DDN-Whamcloud
:  Oleg Drokin, DDN-Whamcloud



Latest revision as of 13:00, 26 June 2024

LUG 2024 was held at Texas Tech University in Lubbock, Texas on May 6-9 2024

LUG 2023 Agenda

Monday, May 6 – Day 0

Developer Day

Tuesday, May 7 – Day 1

09:15-09:30 Welcome Talk
Megan Larko, Kevin Harms, OpenSFS
09:30-10:00 Lustre Community Update (video)
Peter Jones, Whamcloud
10:00-10:30 Native Linux client status (video)
James Simmons, Oak Ridge National Laboratory
11:00-11:30 Scalable Auto-Tiering (video)
Tom Jabas, Hewlett Packard Enterprise (HPE)
11:30-12:00 Leveraging Lustre as a Global File System for US and Illinois Researchers (video)
JD Maloney, National Center for Supercomputing Applications
13:15-13:30 Sponsor Talk: Optimizations and Strategies for Managing Exabyte AI Data Environments and Accelerated Computing Demands (video)
Morris Skupinsky, DDN
13:30-14:30 Lustre 2.17 and Beyond (video)
Andreas Dilger, Whamcloud
14:30-15:00 Design of Performance and Health Monitoring, Alerting, and Logging Infrastructure for Lustre in the Cloud (video)
Ellis Wilson, Microsoft
15:30-16:00 Utilization Trends and I/O Patterns in the Orion-Lustre Filesystem (video)
Rick Mohr, Oak Ridge National Laboratory
16:00-17:00 OpenSFS Update
OpenSFS Board
17:00-17:40 Student Mixer Event
17:00-19:00 Networking and Social Event
National Ranching Heritage Center

Wednesday, May 8 – Day 2

09:00-09:30 Keynote: Perspectives in Storage Considerations for Academic and Research Computing (video)
Alan Sill, High Performance Computing Center, Texas Tech University
09:30-10:00 Automated AI-Analysis of the Lustre-Development Mailing List (video)
John Bent, Los Alamos National Lab
10:30-11:00 PoliMOR In Action (video)
Christopher Brumgard, Oak Ridge National Laboratory
11:00-11:30 Asynchronous I/O: A Practical Guide for Optimizing HPC Workflows (video)
Sergei Platonov, Xinnor
11:30-12:00 Hybrid IO Path Update (video)
Patrick Farrell, Oracle
13:00-13:30 Sunfish Management of Lustre On-Demand FAM-based Filesystem (video)
Michael Aguilar, Sandia National Laboratories
13:30-14:00 AI/ML Benchmarking and Lustre (video)
Sakib Samar, Hewlett Packard Enterprise
14:00-14:30 AI workloads and Lustre: Diving into an LLM in a Kerberized Production Environment (video)
Nathan Dauchy, Aurélien Degrémont, NVIDIA
14:30-15:00 Lustre IPv6 support (video)
James Simmons, Oak Ridge National Laboratory
15:30-16:00 Managing a High Availability Lustre Environment Using Multiple Namespaces (video)
Bradley Gipson, Oak Ridge National Labratory
16:00-16:30 Continuous Testing and Integration to Keep Lustre Code Great (video)
Oleg Drokin, DDN-Whamcloud
16:30-17:00 Closing Remarks
OpenSFS