570 likes | 741 Views
NVIDIA founder and CEO Jensen Huang took the stage in Munich — one of the hubs of the global auto industry — to introduce a powerful new AI computer for fully autonomous vehicles and a new VR application for those who design them.
E N D
A NEW COMPUTING ERA J ensen Huang, Founder & CEO, GTC Europe 2017
TWO FORCES DRIVING THE FUTURE OF COMPUTING Transistors (thousands) 107 1.1X per year 105 103 1.5X per year Single-threaded perf 1980 1990 40 Years of CPU Trend Data 2000 2010 2020 Original data up to the year 2010 collected and plotted by M. Horowitz, F. Labonte, O. Shacham, K. Olukotun, L. Hammond, and C. Batten New plot and data collected for 2010-2015 by K. Rupp 2
TWO FORCES DRIVING THE FUTURE OF COMPUTING Transistors (thousands) 107 1.1X per year 105 103 1.5X per year Single-threaded perf 1980 1990 40 Years of CPU Trend Data 2000 2010 2020 The Big Bang of Deep Learning Original data up to the year 2010 collected and plotted by M. Horowitz, F. Labonte, O. Shacham, K. Olukotun, L. Hammond, and C. Batten New plot and data collected for 2010-2015 by K. Rupp 3
RISE OF NVIDIA GPU COMPUTING GPU-Computing perf 1.5X per year 107 1.1X per year 105 103 1.5X per year Single-threaded perf 1980 1990 40 Years of CPU Trend Data 2000 2010 2020 NVIDIA Volta GPU | 21B xtors | 120 TFLOPS Original data up to the year 2010 collected and plotted by M. Horowitz, F. Labonte, O. Shacham, K. Olukotun, L. Hammond, and C. Batten New plot and data collected for 2010-2015 by K. Rupp 4
RISE OF NVIDIA GPU COMPUTING 22,000 645,000 1.8M 2012 2012 2017 2012 2017 2017 Global GTC Attendees 10X in 5 Years GPU Developers 15X in 5 Years CUDA Downloads 5X in 5 Years 5
NVIDIA GPU-ACCELERATED APPLICATIONS Computer Graphics Scientific Computing Deep Learning Data Science SIMULIA Abaqus Finite-Element Analysis AMBER Molecular Dynamics K-Means Clustering Gradient Boosting COSMO Climate Weather ChaNGa Astrophysics Support Vector Machine Generalized Linear Model ANALYTICS Gaussian Quantum Chemistry Schlumberger WG Seismic Processing ANSYS Fluent Computational Fluid Dynamics DATABASES PowerGrid Medical Imaging NVIDIA CUDA 6
NVIDIA GPU-ACCELERATED INDUSTRIES Weather Photo Album Ride Sharing Radiology AV Truck Molecular Dynamics Volumetric Rendering Route Management Recommendations Self-Driving Cars Design & Simulation Warehouse Automation Materials Science Social Media 3D CT HPC INTERNET SERVICES TRANSPORTATION MEDICAL IMAGING LOGISTICS 7
NVIDIA GPU ACCELERATES 2017 NOBEL PRIZES IN CHEMISTRY AND PHYSICS Resolution before 2013 Resolution at present Cryogenic Electron Microscopy J acques Dubochet, J oachim Frank, Richard Henderson Detection of Gravitational Waves Rainer Weiss, Barry Barish, Kip Thorne 8
ANNOUNCING NVIDIA HOLODECK THE DESIGN LAB OF THE FUTURE Photorealistic Models Physically Simulated Interaction Virtual Team Collaboration GPU-accelerated AI 9
CREATE AND COLLABORATE IN NVIDIA HOLODECK CATIA / Siemens NX Creo / Alias Maya / 3dsMAX 10
ANNOUNCING NVIDIA HOLODECK THE DESIGN LAB OF THE FUTURE Photorealistic Models Physically Simulated Interaction Virtual Team Collaboration GPU-accelerated AI Early Access NOW nvidia.com/holodeck 12
THE ERA OF AI 2017 6000 4500 $6.6B 3,000 2016 3000 1500 2002 2012 2017 2014 2017 -50 -25 0 25 AI Startup Funding 12X in 5 Years Deep Learning Papers Published 13X in 3 Years NIPS Registration 13
SOLVING THE UNSOLVABLE NVIDIA Interactive Ray Tracing 14
SOLVING THE UNSOLVABLE NVIDIA NVIDIA / Remedy Audio-driven Facial Animation Interactive Ray Tracing 15
SOLVING THE UNSOLVABLE NVIDIA NVIDIA / Remedy Audio-driven Facial Animation WRNCH Interactive Ray Tracing Pose Estimation 16
SOLVING THE UNSOLVABLE NVIDIA NVIDIA / Remedy Audio-driven Facial Animation WRNCH Interactive Ray Tracing Pose Estimation University of Edinburgh Character Animation 17
SOLVING THE UNSOLVABLE NVIDIA NVIDIA / Remedy Audio-driven Facial Animation WRNCH Interactive Ray Tracing Pose Estimation University of Edinburgh Character Animation UC Berkeley / OpenAI One-shot Imitation Learning 18
THE WORLD’S AI PLATFORM Every Framework NVIDIA Inception: 1,900 DL Startups Every Cloud and Data Center Other IT Services Fin. Services Drones Manufacturing Automotive Smart City Healthcare NVIDIA AI PLATFORM 19
AI INFERENCE IS THE NEXT GREAT CHALLENGE Training DNN Model Inferencing 20
EXPLOSION OF INTELLIGENT MACHINES 20M Inference Servers 100s of Millions of Autonomous Machines Trillions of IoT Devices 21
EXPLOSION OF NETWORK DESIGN Convolution Networks Recurrent Networks Generative Adversarial Networks Reinforcement Learning 3D-GAN PRelu ReLu BatchNorm LSTM GRU Highway Rank GAN Conditional GAN DQN Dueling DQN Coupled GAN Concat Dropout Pooling Projection Embedding BiDirectional Speech Enhancement GAN Latent space GAN A3C 22
EXPLOSION OF NETWORK COMPLEXITY 30X 10X 350X MoE DeepSpeech 3 Inception-v4 GNMT ResNet-50 DeepSpeech 2 OpenNMT DeepSpeech AlexNet GoogLeNet Inception-v2 2014 2015 2016 2017 2018 2011 2013 2015 2017 2013 2014 2015 2016 2017 2018 2012 2014 2016 Image Network Complexity GOPS * Bandwidth Speech Network Complexity GOPS * Bandwidth Translation Network Complexity GOPS * Bandwidth 23
NEW NVIDIA TENSORRT 3 Programmable Inference Accelerator TESLA P4 J ETSON TX2 TensorRT DRIVE PX 2 NVIDIA DLA TESLA V100 Compile and Optimize Neural Networks | Support for Every Framework Optimize for Each Target Platform 24
NEW NVIDIA TENSORRT 3 Programmable Inference Accelerator next input concat next input relu relu relu relu batch nm batch nm batch nm batch nm copy 3x3 CR 5x5 CR 1x1 CR 1x1 conv 3x3 conv 5x5 conv 1x1 conv 1x1 CR max pool relu relu max pool batch nm batch nm input 1x1 conv 1x1 conv input Weight & Activation Precision Calibration | Layer & Tensor Fusion Kernel Auto-Tuning | Multi-Stream Execution 25
NEW NVIDIA TENSORRT 3 Programmable Inference Accelerator Images/Sec (ResNet-50) Sentences/Sec (OpenNMT) 5,700 600 550 6,000 450 4,500 300 3,000 150 1,500 300 25 140 4 - CPU + TensorFlow CPU + TensorFlow 0 V100 + TensorFlow V100 + TensorFlow V100 + TensorRT V100 + TensorRT CPU + Torch V100 + Torch V100 + TensorRT 40x Speed-up on ResNet-50 | 140x Speed-up on OpenNMT 26
NEW NVIDIA TENSORRT 3 Programmable Inference Accelerator Images/Sec (ResNet-50) Sentences/Sec (OpenNMT) 600 6,000 450 4,500 14ms 280ms 300 3,000 153ms 7ms 7ms 150 1,500 117ms - CPU + TensorFlow CPU + TensorFlow 0 V100 + TensorFlow V100 + TensorFlow V100 + TensorRT V100 + TensorRT CPU + Torch V100 + Torch V100 + TensorRT 40x Speed-up on ResNet-50 | 140x Speed-up on OpenNMT | Half the Latency 27
NVIDIA TENSORRT 10X BETTER DATA CENTER TCO 160 CPU servers 45,000 images / second 65 KWatts 28
NVIDIA TENSORRT 10X BETTER DATA CENTER TCO 1 NVIDIA HGX with 8 Tesla V100 GPUs 45,000 images / second 3 KWatts 1/6 the Cost | 1/20 the Power 4 Racks in a Box 29
INFERENCE ON IMAGES 30
INFERENCE ON SPEECH 32
VINCENT AI 34
THE AUTONOMOUS VEHICLE REVOLUTION 36
NVIDIA DRIVE AV COMPUTING PLATFORM LIDAR Camera Deep Learning (Detection) Camera Deep Learning (Freespace) HD Map Camera Computer Vision (SLAM) RADAR Fusion Path Planning Point-Cloud Processing (Localizing to Map) DRIVE AV DRIVEWORKS SDK DRIVE OS DRIVE PX —AI CAR COMPUTER Sensor Fusion: RADAR, LIDAR, Camera | Deep Learning, CV, Parallel Computing Diversity of Algorithms | ASIL-D Functional Safety | Fully Integrated into NVIDIA BB8 37
“ RIDE-HAILING INDUSTRY EXPECTED TO GROW EIGHTFOLD TO $285B BY 2030” —Goldman Sachs Uber Lyft Waymo GM / Cruise Ford / Argo.ai Baidu Zoox nuTonomy Yandex 39
ROBOTAXI DEMANDS EXTREME COMPUTING + 10X camera resolution + Surround LIDAR point-cloud processing + Camera & LIDAR localization to HD map + Tracking all surrounding objects + New map generation + Sophisticated path planning & control + Algorithm diversity + Sensor & computing fail-operate + Excess computing capacity Level 2 Level 5 40
STATE-OF-THE-ART DRIVERLESS VEHICLES 41
ANNOUNCING “ PEGASUS” ROBOTAXI DRIVE PX 320 TOPS CUDA TensorCore | 16x GMSL | 4x 10G | 8x 1G | 16x 100M | Auto-grade | ASIL D 500W | Late Q1 Early Access Partners Supercomputing Data Center in your Trunk 42
ANNOUNCING “ PEGASUS” ROBOTAXI DRIVE PX 320 TOPS CUDA TensorCore | 16x GMSL | 4x 10G | 8x 1G | 16x 100M | Auto-grade | ASIL D 500W | Late Q1 Early Access Partners Supercomputing Data Center in your Trunk 43
ANNOUNCING “ PEGASUS” ROBOTAXI DRIVE PX 320 TOPS CUDA TensorCore | 16x GMSL | 4x 10G | 8x 1G | 16x 100M | Auto-grade | ASIL D 500W | Late Q1 Early Access Partners Supercomputing Data Center in your Trunk 45
ANNOUNCING NVIDIA DRIVE IX SDK Intelligent Experience Toolkit Exterior Driver Recognition Automatic Personalization Inattentive Driver Alert Cyclist Alert Distracted Driver Alert Driver/Passenger Recognition Customer Application DRIVE AV DRIVE IX Object, Path, Wait Perception Gaze, Head Pose, Gestures, Recognize Face, Voice Recognition & Lip Reading DRIVE OS Sense Inside & Outside the Vehicle | Deep Learning Powered | Early Access Q4 Your Car is an AI 46
NVIDIA DRIVE AI CAR PLATFORM SIL HIL Data Factory Training on DGX 3D-SIM on DGX DRIVE AV on DPX RE-SIM on DGX From Training to Driving | From Processor to AI to AV Stack | From AV to IX From 3D Simulation to Super-Real-Time Testing | Open Platform for Partners 47
SUPER REAL-TIME SIMULATION WITH DGX AND NEW TENSORRT 3 With 8 NVIDIA DGX Re-Sim 300,000 miles in 5 hours Virtually Drive Every Paved Road in U.S. in 2 Days 48
145 AV STARTUPS ON NVIDIA DRIVE 50