GTC Europe 2017 Keynote

A NEW COMPUTING ERA J ensen Huang, Founder & CEO, GTC Europe 2017

TWO FORCES DRIVING THE FUTURE OF COMPUTING Transistors (thousands) 107 1.1X per year 105 103 1.5X per year Single-threaded perf 1980 1990 40 Years of CPU Trend Data 2000 2010 2020 Original data up to the year 2010 collected and plotted by M. Horowitz, F. Labonte, O. Shacham, K. Olukotun, L. Hammond, and C. Batten New plot and data collected for 2010-2015 by K. Rupp 2

TWO FORCES DRIVING THE FUTURE OF COMPUTING Transistors (thousands) 107 1.1X per year 105 103 1.5X per year Single-threaded perf 1980 1990 40 Years of CPU Trend Data 2000 2010 2020 The Big Bang of Deep Learning Original data up to the year 2010 collected and plotted by M. Horowitz, F. Labonte, O. Shacham, K. Olukotun, L. Hammond, and C. Batten New plot and data collected for 2010-2015 by K. Rupp 3

RISE OF NVIDIA GPU COMPUTING GPU-Computing perf 1.5X per year 107 1.1X per year 105 103 1.5X per year Single-threaded perf 1980 1990 40 Years of CPU Trend Data 2000 2010 2020 NVIDIA Volta GPU | 21B xtors | 120 TFLOPS Original data up to the year 2010 collected and plotted by M. Horowitz, F. Labonte, O. Shacham, K. Olukotun, L. Hammond, and C. Batten New plot and data collected for 2010-2015 by K. Rupp 4

RISE OF NVIDIA GPU COMPUTING 22,000 645,000 1.8M 2012 2012 2017 2012 2017 2017 Global GTC Attendees 10X in 5 Years GPU Developers 15X in 5 Years CUDA Downloads 5X in 5 Years 5

NVIDIA GPU-ACCELERATED APPLICATIONS Computer Graphics Scientific Computing Deep Learning Data Science SIMULIA Abaqus Finite-Element Analysis AMBER Molecular Dynamics K-Means Clustering Gradient Boosting COSMO Climate Weather ChaNGa Astrophysics Support Vector Machine Generalized Linear Model ANALYTICS Gaussian Quantum Chemistry Schlumberger WG Seismic Processing ANSYS Fluent Computational Fluid Dynamics DATABASES PowerGrid Medical Imaging NVIDIA CUDA 6

NVIDIA GPU-ACCELERATED INDUSTRIES Weather Photo Album Ride Sharing Radiology AV Truck Molecular Dynamics Volumetric Rendering Route Management Recommendations Self-Driving Cars Design & Simulation Warehouse Automation Materials Science Social Media 3D CT HPC INTERNET SERVICES TRANSPORTATION MEDICAL IMAGING LOGISTICS 7

NVIDIA GPU ACCELERATES 2017 NOBEL PRIZES IN CHEMISTRY AND PHYSICS Resolution before 2013 Resolution at present Cryogenic Electron Microscopy J acques Dubochet, J oachim Frank, Richard Henderson Detection of Gravitational Waves Rainer Weiss, Barry Barish, Kip Thorne 8

ANNOUNCING NVIDIA HOLODECK THE DESIGN LAB OF THE FUTURE Photorealistic Models Physically Simulated Interaction Virtual Team Collaboration GPU-accelerated AI 9

CREATE AND COLLABORATE IN NVIDIA HOLODECK CATIA / Siemens NX Creo / Alias Maya / 3dsMAX 10

ANNOUNCING NVIDIA HOLODECK THE DESIGN LAB OF THE FUTURE Photorealistic Models Physically Simulated Interaction Virtual Team Collaboration GPU-accelerated AI Early Access NOW nvidia.com/holodeck 12

THE ERA OF AI 2017 6000 4500 $6.6B 3,000 2016 3000 1500 2002 2012 2017 2014 2017 -50 -25 0 25 AI Startup Funding 12X in 5 Years Deep Learning Papers Published 13X in 3 Years NIPS Registration 13

SOLVING THE UNSOLVABLE NVIDIA Interactive Ray Tracing 14

SOLVING THE UNSOLVABLE NVIDIA NVIDIA / Remedy Audio-driven Facial Animation Interactive Ray Tracing 15

SOLVING THE UNSOLVABLE NVIDIA NVIDIA / Remedy Audio-driven Facial Animation WRNCH Interactive Ray Tracing Pose Estimation 16

SOLVING THE UNSOLVABLE NVIDIA NVIDIA / Remedy Audio-driven Facial Animation WRNCH Interactive Ray Tracing Pose Estimation University of Edinburgh Character Animation 17

SOLVING THE UNSOLVABLE NVIDIA NVIDIA / Remedy Audio-driven Facial Animation WRNCH Interactive Ray Tracing Pose Estimation University of Edinburgh Character Animation UC Berkeley / OpenAI One-shot Imitation Learning 18

THE WORLD’S AI PLATFORM Every Framework NVIDIA Inception: 1,900 DL Startups Every Cloud and Data Center Other IT Services Fin. Services Drones Manufacturing Automotive Smart City Healthcare NVIDIA AI PLATFORM 19

AI INFERENCE IS THE NEXT GREAT CHALLENGE Training DNN Model Inferencing 20

EXPLOSION OF INTELLIGENT MACHINES 20M Inference Servers 100s of Millions of Autonomous Machines Trillions of IoT Devices 21

EXPLOSION OF NETWORK DESIGN Convolution Networks Recurrent Networks Generative Adversarial Networks Reinforcement Learning 3D-GAN PRelu ReLu BatchNorm LSTM GRU Highway Rank GAN Conditional GAN DQN Dueling DQN Coupled GAN Concat Dropout Pooling Projection Embedding BiDirectional Speech Enhancement GAN Latent space GAN A3C 22

EXPLOSION OF NETWORK COMPLEXITY 30X 10X 350X MoE DeepSpeech 3 Inception-v4 GNMT ResNet-50 DeepSpeech 2 OpenNMT DeepSpeech AlexNet GoogLeNet Inception-v2 2014 2015 2016 2017 2018 2011 2013 2015 2017 2013 2014 2015 2016 2017 2018 2012 2014 2016 Image Network Complexity GOPS * Bandwidth Speech Network Complexity GOPS * Bandwidth Translation Network Complexity GOPS * Bandwidth 23

NEW NVIDIA TENSORRT 3 Programmable Inference Accelerator TESLA P4 J ETSON TX2 TensorRT DRIVE PX 2 NVIDIA DLA TESLA V100 Compile and Optimize Neural Networks | Support for Every Framework Optimize for Each Target Platform 24

NEW NVIDIA TENSORRT 3 Programmable Inference Accelerator next input concat next input relu relu relu relu batch nm batch nm batch nm batch nm copy 3x3 CR 5x5 CR 1x1 CR 1x1 conv 3x3 conv 5x5 conv 1x1 conv 1x1 CR max pool relu relu max pool batch nm batch nm input 1x1 conv 1x1 conv input Weight & Activation Precision Calibration | Layer & Tensor Fusion Kernel Auto-Tuning | Multi-Stream Execution 25

NEW NVIDIA TENSORRT 3 Programmable Inference Accelerator Images/Sec (ResNet-50) Sentences/Sec (OpenNMT) 5,700 600 550 6,000 450 4,500 300 3,000 150 1,500 300 25 140 4 - CPU + TensorFlow CPU + TensorFlow 0 V100 + TensorFlow V100 + TensorFlow V100 + TensorRT V100 + TensorRT CPU + Torch V100 + Torch V100 + TensorRT 40x Speed-up on ResNet-50 | 140x Speed-up on OpenNMT 26

NEW NVIDIA TENSORRT 3 Programmable Inference Accelerator Images/Sec (ResNet-50) Sentences/Sec (OpenNMT) 600 6,000 450 4,500 14ms 280ms 300 3,000 153ms 7ms 7ms 150 1,500 117ms - CPU + TensorFlow CPU + TensorFlow 0 V100 + TensorFlow V100 + TensorFlow V100 + TensorRT V100 + TensorRT CPU + Torch V100 + Torch V100 + TensorRT 40x Speed-up on ResNet-50 | 140x Speed-up on OpenNMT | Half the Latency 27

NVIDIA TENSORRT 10X BETTER DATA CENTER TCO 160 CPU servers 45,000 images / second 65 KWatts 28

NVIDIA TENSORRT 10X BETTER DATA CENTER TCO 1 NVIDIA HGX with 8 Tesla V100 GPUs 45,000 images / second 3 KWatts 1/6 the Cost | 1/20 the Power 4 Racks in a Box 29

INFERENCE ON IMAGES 30

INFERENCE ON SPEECH 32

VINCENT AI 34

THE AUTONOMOUS VEHICLE REVOLUTION 36

NVIDIA DRIVE AV COMPUTING PLATFORM LIDAR Camera Deep Learning (Detection) Camera Deep Learning (Freespace) HD Map Camera Computer Vision (SLAM) RADAR Fusion Path Planning Point-Cloud Processing (Localizing to Map) DRIVE AV DRIVEWORKS SDK DRIVE OS DRIVE PX —AI CAR COMPUTER Sensor Fusion: RADAR, LIDAR, Camera | Deep Learning, CV, Parallel Computing Diversity of Algorithms | ASIL-D Functional Safety | Fully Integrated into NVIDIA BB8 37

“ RIDE-HAILING INDUSTRY EXPECTED TO GROW EIGHTFOLD TO $285B BY 2030” —Goldman Sachs Uber Lyft Waymo GM / Cruise Ford / Argo.ai Baidu Zoox nuTonomy Yandex 39

ROBOTAXI DEMANDS EXTREME COMPUTING + 10X camera resolution + Surround LIDAR point-cloud processing + Camera & LIDAR localization to HD map + Tracking all surrounding objects + New map generation + Sophisticated path planning & control + Algorithm diversity + Sensor & computing fail-operate + Excess computing capacity Level 2 Level 5 40

STATE-OF-THE-ART DRIVERLESS VEHICLES 41

STATE-OF-THE-ART DRIVERLESS VEHICLES 44

ANNOUNCING NVIDIA DRIVE IX SDK Intelligent Experience Toolkit Exterior Driver Recognition Automatic Personalization Inattentive Driver Alert Cyclist Alert Distracted Driver Alert Driver/Passenger Recognition Customer Application DRIVE AV DRIVE IX Object, Path, Wait Perception Gaze, Head Pose, Gestures, Recognize Face, Voice Recognition & Lip Reading DRIVE OS Sense Inside & Outside the Vehicle | Deep Learning Powered | Early Access Q4 Your Car is an AI 46

NVIDIA DRIVE AI CAR PLATFORM SIL HIL Data Factory Training on DGX 3D-SIM on DGX DRIVE AV on DPX RE-SIM on DGX From Training to Driving | From Processor to AI to AV Stack | From AV to IX From 3D Simulation to Super-Real-Time Testing | Open Platform for Partners 47

SUPER REAL-TIME SIMULATION WITH DGX AND NEW TENSORRT 3 With 8 NVIDIA DGX Re-Sim 300,000 miles in 5 hours Virtually Drive Every Paved Road in U.S. in 2 Days 48

145 AV STARTUPS ON NVIDIA DRIVE 50

GTC Europe 2017 Keynote

GTC Europe 2017 Keynote

Presentation Transcript

PDF Days Europe 2017

Georgia Transmission Corporation GTC

KeyNote

Keynote

Regional Perspective to Keynote Challenges - Europe

Keynote

GTC instrumentation plan

KEYNOTE

News from GTC

sxsw keynote

2017 Europe Boutique River Cruises

Complexitools - Keynote at Agile Consortium Conference 2017 (Brussels/BE)

2017 MTV Europe Music Awards

Keynote

1390 ±190 GTC