Whisper Workshop 2025, AKL, NZ

September 5, 9AM

We look forward to seeing you there.

Whisper Workshop Practice

Day(s)

:

Hour(s)

:

Minute(s)

:

Second(s)

About the Event

A Deep Dive into the Whisper

Whisper is one of the most advanced speech processing models, enabling state-of-the-art transcription and translation capabilities. This workshop will introduce the fundamentals of speech processing, explore Whisper and Transformer architectures, and provide a hands-on interactive notebook session to help you customize Whisper for your own projects.

 

📍 Venue: Room 405-328, 20 Symonds Street, Auckland CBD

đź’˛ Cost: Free (refreshments and lunch provided)

🎟  Tickets: Limited to 30 participants – register early

Learning Objectives

By participating in this workshop, you’ll: 

    • Understand core concepts in speech processing.
    • Explore Whisper’s architecture and why it leads the field.
    • Practice using Whisper’s built-in functions and tuning its settings.
    • Apply fine-tuning for specialized vocabularies, accents, and audio types.

Workshop details

 Prerequisites: Basic Programming knowledge

Suggested materials to satisfy prerequisites: Python Beginner’s Guide.

Technologies: PyTorch, Jupyter Notebook

Hardware Requirements: Please bring your own laptop

Featured Sessions

Speech Processing

Associate Professor Waleed will open the workshop by presenting the foundations of speech processing to secure a solid base for the entire event.

Fri, Sep 5, 2025

9:20 AM to 10:00 AM

Foundation of Whisper

This session unpacks Whisper’s core architecture and training strategy, explaining the mechanisms that make it a state-of-the-art speech model. By understanding these foundations, you will be fully prepared for the practical exercises that follow.

Fri, Sep 5, 2025

10:00 AM to 12:00 AM

Lunch Time

Take a break, grab a plate, and enjoy lunch with us!

Fri, Sep 5, 2025

12:00 AM to 1:00 PM

Hands-On Whisper Workflows

After the break we will dive into hands-on labs using Whisper’s built-in functions. You will experiment with transcription, translation, and automatic timestamping—capabilities that can facilitate your workflow such as turning interview recordings into text and much more.

Fri, Sep 5, 2025

1:00 PM to 2:00 PM

Whisper Customization

In the last session, we will provide a step-by-step guide on how to fine-tune Whisper with your own custom dataset. You will leave with the skills to build a customized model tailored to your project’s unique needs.

Fri, Sep 5, 2025

2:00 PM to 4:00 PM

Registration

The DeepNet Discovery Network cordially invite you to our upcoming workshop, exclusively for academics and students in the Department of Electrical, Computer and Software Engineering. Please register using your university email address to secure your place.

Keynote speakers

Satwinder Singh

Dr

Waleed Abdulla

Associate Professor

Zihan Zhong

PhD candidate 

Ben Wang

PhD Candidate