70 likes | 161 Views
Explore Sqoop, a Java-based tool by Apache, for efficient data transfer to and from Hadoop. Learn about its interfaces, architecture, and support for incremental loads. Navigate its working mechanisms and supported data formats.
E N D
Apache Sqoop • What is it ? • How does it work ? • Interfaces • Example • Architecture www.semtech-solutions.co.nz info@semtech-solutions.co.nz
Scoop – What is it ? • A command line interface • ( plus web in scoop2 ) • For data import / export to Hadoop • Uses Map jobs from Map Reduce • Supports incremental loads • Written in Java • Licensed by Apache • Uses plugins for new types of data source www.semtech-solutions.co.nz info@semtech-solutions.co.nz
Scoop – How does it work ? • Data sliced into partitions • Mappers transfer data • Data types determined via meta data • Many data transfer formats supported • i.e. CSV, Avro • Can import into • Hive ( use --hive-import flag ) • Hbase ( use –hbase* flags ) www.semtech-solutions.co.nz info@semtech-solutions.co.nz
Interesting, right? This is just a sneak preview of the full presentation. We hope you like it! To see the rest of it, just click here to view it in full on PowerShow.com. Then, if you’d like, you can also log in to PowerShow.com to download the entire presentation for free.