Skip to content

This material contains content on how to profile and optimize simple Pytorch mnist code using NVIDIA Nsight Systems and Pytorch Profiler

License

Notifications You must be signed in to change notification settings

openhackathons-org/Profiling-AI-Software-Bootcamp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

License

Profiling AI Software Bootcamp

This lab discusses profiling using NVIDIA® Nsight™ Systems, focusing on steps to optimize a deep neural network (DNN) training program that detects handwritten digits using a PyTorch Modified National Institute of Standards and Technology (MNIST) dataset. The techniques and strategies discussed in this lab will translate to optimizing any application that uses NVIDIA's graphic processing units (GPUs).

Bootcamp Content

This content contains 4 Labs:

  • Lab 1: Start the NVIDIA Nsight Systems lab
  • Lab 2: PyTorch MNIST and Optimization Workflow
  • Lab 3: Data Transfers between Host and GPU
  • Lab 4: Tensor Core
  • Lab 5: Summary

Bootcamp Duration

The duration of the tutorial is 2 hours.

Tools and Frameworks

The tools and frameworks used in this bootcamp are as follows

Deploying the Bootcamp Material

To deploy the Labs, please refer to the deployment guide presented here

Attribution

This material originates from the OpenHackathons GitHub repository. Check out additional materials here.

Don't forget to check out additional Open Hackathons Resources and join our OpenACC and Hackathons Slack Channel to share your experience and get more help from the community.

Licensing

Copyright © 2026 OpenACC-Standard.org. This material is released by OpenACC-Standard.org, in collaboration with NVIDIA Corporation, under the Creative Commons Attribution 4.0 International (CC BY 4.0). These materials may include references to hardware and software developed by other entities; all applicable licensing and copyrights apply.

About

This material contains content on how to profile and optimize simple Pytorch mnist code using NVIDIA Nsight Systems and Pytorch Profiler

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •