1 / 17

Infinite Data & Instant Finite Solutions: AI Compute, 5G Convergence

Explore the intersection of AI compute and 5G convergence, unlocking the potential of infinite data solutions. Discover the role of distributed AI engines and the dynamic real-time AI data flows across the network. Learn how AI requires 5G architecture and functionality for efficient execution.

lmargie
Download Presentation

Infinite Data & Instant Finite Solutions: AI Compute, 5G Convergence

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Infinite Data & Instant Finite Solutions: AI Compute, 5G Convergence • Princeton University • May 17, 2019 • Reed Hundt • rehund@gmail.com

  2. Drones Reed’s Conjecture: A I x 5 G = Yin/Yang • Decentralization: end-points equal nodes; unstructured, coded world analyzed at maximum efficiency; AI gets network it needs, uses network as delivery platform; training centralized; inference decentralized • Network Platform: In sufficiently dense network, comms & AI indistinguishable. = Processing • Each end-point-node code world (data approaches infinity). • Communications, intelligence dynamically located

  3. G’s drive change 5G AI operates near data collection points; runs on network itself Generation: inflection Points 4G Broadband 5G connects computing w/o fiber, nearly eliminates latency Social Demands Aggregated Computing 3G Big Bandwidth Wireless Scales UGC Up, Out Networks as platforms Inflection points Threaten old leaders Digital takes mobile global

  4. 156_85 157_85 158_85 161_85 134_88 60_84 58_84 149_88 125_84 Compute, Comms evolve, approach convergence 1960-80’s 1990-2000’s 2010’s 2020+ Connectivity PCs Main- frames PCs Thin Clients Cloud Mobile devices Cloud Intelligent Agents Servers Intelligent Network • Isolated Compute; ASICs abound • Highly Structured Input • Sneakerware Data Transfer • Rise of IBM • User Generated Content Scales Up (Video), Out (Global) • Return of ASICS, end of General Purpose Computing, fiber enables Cloud • 3g, 4g enable social, depersonalized data • Rise of Google, Facebook • IOT codes objects, experience: infinite data • 5G fuses distributed computing; sorts, slims, studies, sends data • AI manages coded world: to predict is to control • Race for Control • General Purpose Computing • Client-Server Connectivity Poor; Personal Use Predominates • Fixed wide area connectivity • Rise of Intel Source: IDC; IBM

  5. Drones A I requires near-zero latency, infinite data access, finite data solutions delivery just in time at infinite points • Artificial Intelligence needs real-time contextual data. Infer from what, when, where? • AI predicts what, where, when, how? • AI must sort, select, optimize; needles in infinite hay • AI must deliver innumerable point solutions: managing world requires constant small • AI needs wireless broadband platform; 5G makes A I ubiquitous • Imagine coded world where today’s 3D gaming anticipates objectified experience, permits privacy-as-a-service. • Context (distinction creates meaning) subsumes connection

  6. AI requires 5G ever-changing, distributed information flow • AI Engines – Distributed Throughout the Network • AI compute engines distributed through the network – with different capabilities and roles • Cloud AI Engines – massive compute engines and databases – ideal for initial deep learning (DL) model training • Edge AI Engines – ultra-high performance AI engines enabled by multiple high power cores, GPU and neural accelerators, deployed locally and securely within the enterprise and factory – ideal for inference execution and reinforcement learning feedback • Device AI Engines – low power, lower performance cores with some AI accelerators – ideal for data ingestion processing for Edge and Cloud engines • Result – Dynamic Real-Time AI Data Flows Across Network • Device to Edge Flows – pre-processed data and reinforcement feedback from device to edge AI engines, DL models for execution in edge engine • Edge to Device Flows – complex inference results, DL model updates to Device engine models • Cloud to Edge Flows – DL models generated by initial training processing in Cloud AI Engine Dynamically Connected and Distributed – the 5G + AI Future Dynamic, Real-Time AI Data Flows Delivered by QoS-Managed 5G URLLC Networks

  7. AI execution requires 5g architecture, functionality • Wide-Area Network Insufficient • Cellular and DSL backhaul latencies excessive • Latency to regional data center typically >40ms and often >100ms • Massive Local Compute, Memory Viable • Local compute can be based on low-cost, 24-48 core GPU engines • Large (10GB+) local DRAM and Multi-Terrabyte SSD local storage • Opportunity to deploy GPU or neural-network accelerators without power limitations • Enables massive, high performance local neural network implementations – with billions of parameters • Local Security, Personalized Privacy • Critical user private data remains local to home or enterprise • Local hardware key management • Smartphones Limited Compute Power • Highly diverse set of smartphone architectures and software loads – severe AI software management challenges • Smartphone Li-Ion energy density improving at only 7% per annum • Slowing Moore’s Law limiting mobile compute capability evolution • Aging population of smartphones – CPU’s often 6+ years old Locus of AI Compute Lies Between Edge Device and Aggregated Data Center: where distributed computing exists

  8. Drones Client-Server pendulum swings from A to B to (DC)C • Centralized networks limit data aggregation, constrain computing, maintain control, size matters (V= N(N-1)) • Decentralized networks challenge control, engender difference, distribute value creation, lack bottlenecks • Distributed networks dynamically suit form to function, optimize efficiency, enable AI in real-time, enable systems, create services. Cooperation supplants control. • Centralized Control: Horizontal • Decentralized Control: Vertical • Distributed Control: Contestable, Service Model Wins

  9. Must surf many waves 1. 2. 3. 4. AI Security, Privacy Concerns Criticality of privacy and security for AI data and models emerge – drive to local storage and encrypted execution within the Enterprise and Factory. 5G Deployments Proliferate Local enterprise and Industrial URLLC 5G Systems deploy providing ultra high speed (Gbps) transfer with low and ultra-low (ms) latency. Massive AI Tasks Emerge Ever-increasing complexity of deep-learning model execution in real-time exceeds limited device execution and storage capabilities. Edge AI Engines Deploy Deployment of low-cost real-time capable AI engines deployed locally within Enterprise and Factory – connected to devices via 5G. Result - Emergence of Distributed, Connected, Real-Time 5G + AI Systems

  10. Waves of change intersect: find pipeline to success

  11. Waves of change create many value opportunities Emerging use cases require exponentially increasing endpoint compute power

  12. …distributed through new media…

  13. Imagination only limit… Mammalian Near Infra Red Vision ThroughInjectable and Self-Powered Retinal Nanoantennae Ma et al., 2019, Cell 177, 1–13, April 4, 2019 • Key Innovation • Injectable nanoparticles • Provides ability to convert photons from low-energy to high-energy forms • Is this a 5G product? • Internet of things, biotech fusion, photonic conversion; photoreceptors can be linked, applies to machines/creatures alike, uses rf waves Key: Centralized Data Centers Give way to distributed computing on 5G networks

  14. Services rule, require systems integrationIndustrial Robotic Control Example • Real-Time Industrial Control – Local Inference-Training Feedback Over 5G • Deep Learning (DL ) model initial training via AI cloud Data Center AI Server deep learning training process. • DL Model loaded Enterprise AI Server to initialize inference loop over group of industrial robots via 5G. • Federated learning feedback from multiple robots to Enterprise AI server – updated DL model used for inference.

  15. What do you do when you don’t know what to do? • Explore, explore – multiple small teams, many fronts, diagnose situation • Choose technically challenging, compelling projects – invest in the best people (from all tiers) • Rapidly eliminate deadwood – projects that aren’t yielding value or insight • M & A roadmap • Find partners (complementary products)

  16. Policy in coded world:--FB is too powerful in publishing--Sole control of FB in one person is unacceptable--Coming joblessness unacceptable--Every firm needs CEO:chief ethics officer

  17. Follow Andy Grove’s 5 rules • The strategic inflection point is the time to wake up and listen • The most powerful tool of all is the word ‘no.’ • Not all problems have a technological answer, but when they do, that is the more lasting solution. • How well we communicate is determined not by how well we say things but how well we are understood. • The ability to recognize that the winds have shifted …is crucial to the future of an enterprise.

More Related