110 likes | 242 Views
The AR250 A New Architecture for Ray Traced Rendering. SIGGRAPH ‘99 – Slides. Architecture overview AR250 Rendering Processor RenderDrive Network Appliance How and why it works Problems with Parallelism The ART Model Main AR250 Data Flows AR250 Statistics Analysis Performance
E N D
The AR250A New Architecture forRay Traced Rendering Advanced Rendering Technology
SIGGRAPH ‘99 – Slides Architecture overview • AR250 Rendering Processor • RenderDrive Network Appliance How and why it works • Problems with Parallelism • The ART Model • Main AR250 Data Flows • AR250 Statistics • Analysis Performance • Functionality • Rendering Speed Advanced Rendering Technology
Local DRAM Memory Manager Ray Tracing Engine On Chip Data Cache Control Unit Host Bus Interface Host SystemPCI Interface Additional AR250 processors AR250 Rendering Processor Advanced Rendering Technology
RenderDrive Network Appliance Internet 3D Workstation 3D Application Web Server RenderPipe RenderGate LAN RenderDrive Flash memory boot disk Compressed RenderMan protocol HTML job control TCP / IP Embedded OS RenderMan compliant renderer Images CPU / Motherboard AR250s x 16 Advanced Rendering Technology
Problems with Parallelism • Data distribution • Load balancing • Scalability of calculation • Complexity Advanced Rendering Technology
The ART Model • Hardware intersection pipeline • Ray-parallel data distribution • Broadcast parallelism of geometry • Hierarchical geometry • Distributed concurrent shading • Vector parallel programmable shading acceleration Advanced Rendering Technology
Main AR250 Data Flows Fine grain distribution of rays between AR250s for load balancing DRAM Rays Pipeline-parallel ray geometry engine (permutes multiple geometry and rays) Hierarchical geometry Intersections Geometry Programmable shading co-processor On-chip data caches Ray generation & control (CPU) Additional AR250 processors Shader ops. Host systeminterface Geometry & shaders Pixels Host bus interface Geometry and shaders (broadcast) Pixels (sequential read-back) Programmable shaders Advanced Rendering Technology
AR250 Statistics • 0.35um drawn, LSI Logic G10 silicon process • 650K gates, 106mm2 die • Custom RISC processor core • 32, single-stage, 32 bit IEEE compatible floating-point units • Multi-dimensional noise, square root and trig. functions • 50 MHz operation Advanced Rendering Technology
Analysis • Load balancing through fine-grain parallelism • Efficient data distribution by caching rays locally and broadcasting geometry • Scalability by performing whole rendering calculation, including shading. • Low complexity programming model by support for programmable shading in silicon Advanced Rendering Technology
Functionality Current • Full shading model • Full ray-traced: Camera motion blur Depth of field Area lights Volumes Planned • Global illumination • Completion of RenderMan support Advanced Rendering Technology
Rendering Speed Speed C x P 0.82 Hebe mirror (1322 x 2000) RenderDrive (16 x AR250) 0 hrs 13 min 2 sec BMRT (PII - 333) 22 hrs 8 min Advanced Rendering Technology