Building a new GGML backend: How, Challenges and Opportunities with Novel Accelerators

Time
2025年8月09日 13:20 ~ 13:50
Speaker
Martin Chang
Room
TR412-2
Collaborative Notes
https://hackmd.io/H1y-tc-ulg
MandarinAdvanced
Open Source AI and Machine Learning

Abstract

llama.cpp/GGML is a popular piece of software to run (mostly) large language models. It has support for common consumer and enterprise hardware like NVIDIA, AMD and Intel GPUs. But what if you want to onboarding new accelerators? Say a new architecture that promises to reduce power by a few fold. This talk aims to share the experience and knowledge learned building a (work in progress) GGML backend for Tenstorrent’s Grayskull and Wormhole AI processor. And what’s like to work with a brand new software stack.

About the Speaker

Martin Chang

Martin Chang

沒事自幹開源專案,C++, HPC, AI 工程師