The Journey of Building a Computing Service in Our Department

Time
2025年8月10日 14:10 ~ 14:40
Speaker
Ray Huang
Room
TR210
Collaborative Notes
https://hackmd.io/SyRzJiZuxx
EnglishElementary
Open Source DevOps / SRE, Monitoring & Observability

Abstract

As computing demands grow rapidly, effectively managing and maximizing the utilization of computing resources has become a major challenge. As the number of nodes increases, resource scheduling becomes more complex. How can we ensure that jobs from different users run smoothly while maximizing overall computational efficiency?

In this talk, the speaker will share the challenges encountered while setting up computing services at the CS department of NYCU, including resource allocation, quality of service and so on. The speaker will also discuss solutions to these issues and introduce the role of Slurm in high-performance computing (HPC) environments, covering both fundamental concepts and practical applications.

Whether you’re interested in managing several compute nodes or want to understand how Slurm operates, this session will provide insights into the key concepts and hands-on experience of managing computing services.

About the Speaker

Ray Huang

Ray Huang

本名為黃柏竣,GitHub ID 是 ExplorerRay,是一名交大資工大四學生暨系計中助教,正在努力往 data center infra 相關研究邁進