IBM at PyTorch 2023

  • San Francisco, CA, United States
This event has ended.

About

IBM is a proud sponsor of the 2023 PyTorch Conference in San Francisco on October 16-17. Join us to learn more about watsonx and its capabilities, and how open-source frameworks like PyTorch play a critical role in scaling enterprise AI. Here’s how:   

  • Visit the IBM booth Monday evening and Tuesday to view interactive demos of our conversational AI platform watsonx Assistant, and IBM’s geospatial AI foundation model developed in collaboration with NASA.   
  • Gather at the Women and Non-Binary in PyTorch Breakfast hosted by Quiana Berry, Red Hat on Tuesday morning to discuss navigating the AI revolution responsibly in the open-source community.   
  • Attend keynotes by IBM researchers Raghu Ganti on how to leverage PyTorch to scale AI training and inference, and Priya Nagpurkar on the value of open-source for the enterprise.   
  • Hear from IBM in poster presentations and lightning talks that discuss topics such as enabling generative AI on new hardware platforms, using PyTorch for geospatial AI model training, and scaling cloud-native PyTorch FSDP to 20B parameters for watsonx. 

We look forward to seeing you in San Francisco!

For presentation times of workshops, demos, and papers see the agenda section below.
Note: All times are displayed in your local time.


Agenda

  • Description:

    I am a middleware developer / architect, who has worked in open source a maintainer for many years. Prior to that I was a middle ware developer. I had a lot of success in bring in architectural ideals and middleware developer thought processes to my previous open source project Egeria. I have just joined PyTorch full time. This session details my thoughts on how PyTorch as a project, can be enhanced and made more accessible by using architectural / middleware development approaches. The idea is not to replace existing working process, but to add additional enrichment. I hope that the audience and community will buy into this approach and help me realise it. I think this approach is key to moving PyTorch from being research centric towards being more widely adopted. The session will cover: - my initial impressions of a noob to PyTorch. - the need for a big picture conceptual overview of PyTorch at a higher level than the code in the documentation. With the need for many more overview pictures. - bringing in the need for a Glossary for users to have terms defined once and referred to.

    Speaker: David Radley (IBM)

  • Description:

    Visit the IBM booth Monday evening and Tuesday to view interactive demos of our conversational AI platform watsonx Assistant, and IBM’s geospatial AI foundation model developed in collaboration with NASA.

  • Description:

    The computer hardware evolution has accelerated. In just the last few years we have seen new ideas on combining heterogeneous CPU cores, multiple types of memory, a renewed focus on power saving, expectations for dynamically changing resource properties, as well as enhancements for security and confidentiality for cloud workloads and data in traditional applications and new AI model processing. This evolution forces us to revisit the resource management domain of Kubernetes. This domain is one of the most fundamental and complex areas in Kubernetes. Under the hood there are several components that Kubernetes relies on: Kubelet, Runtimes, Monitoring Agents, the OS kernel and hardware. With a focus on specifics of the AI workloads and use cases, this presentation will touch on UX changes for end users, interfaces between k8s stack components and how different pluggable algorithms, implemented via code and new policies, can help achieve evolved performance, power and resource utilization and prioritization goals.

    Speakers: Mike Brown (IBM) & Alexander Kanevskiy (Intel)

  • Description:

    This talk explains our journey in enabling generative AI applications on a new hardware (HW) platform. We are working on running generative AI applications on IBM z from correctness and runtime performance perspectives. We share experiences for developers to write PyTorch and its ecosystem for a new HW. IBM z has a unique feature that uses big-endian byte order. While most HW platforms use little-endian, big-endian was not supported well in PyTorch and its pip packages. We supported both endians, for example, to exchange pre-trained models among any platform by fixing test and application failures. Our 32 PRs make no test failure on IBM z. The ecosystem, like the Hugging Face (HF) transformer framework, now works well. We will share our experience to enable CI for a new HW to keep the main branch healthy for a new HW. We enable HW acceleration features in PyTorch runtime and TorchInductor, such as SIMD. We will also briefly explain exploiting an in-core AI accelerator. Here are the takeaways: - Enabling a new HW without test failures in PyTorch and its ecosystem, like HF transformers - Adding CI for a HW new platform in the upstream - Enabling performance features for a new HW.

    Speaker: Kazuaki Ishizaki (IBM)

More events