510 likes | 527 Views
Learn how to offload work from people to computers with Continuous Integration in the cloud using Hudson. With Hudson, you can spend more CPU power to help you, do more frequent build/test executions, and use static code analysis tools. Hudson is an open-source CI server that is easy to install and configure. It supports various source code repositories and build tools, records results, and notifies people. It also supports distributed builds and can automate system installations. Hudson is localized to 8 languages and has wide adoption in businesses.
E N D
Continuous Integrationin the Cloudwith Hudson Kohsuke Kawaguchi Jesse Glick Sun Microsystems, Inc. Hudson committers
Rise of Continuous Integration • Offload from people, push to computers $ computers us time
Spend more CPU power to help you • … even if it only helps a little • First on your laptops and workstations • IDEs are at the forefront • And then to the servers • a.k.a. “Continuous Integration” • More frequent build/test executions • Static code analysis tools • And more to come
Hudson https://hudson.dev.java.net/ • Open-source CI server at java.net • Emphasis on ease of installation and use • “java -jar hudson.war” execution • Configure everything from browsers • Extensibility • 140+ community-developed public plugins • By 150+ contributors • Estimated 13,000 installations
It basically does builds and tests • Check out the source code • Subversion, Perforce, Git, Mercurial, CVS, … • Do builds and/or tests • Ant, Maven, MSBuild, shell script, … • Record results • Binary, test results, code coverage, static analysis • Notify people • E-mail, IM, RSS, tray apps, IDEs
Why Distributed Builds? • You need to use multiple computers because… • You need different environments • You need isolation • There’s only so much you can do with 1 computer
Before we talk about clouds… • Going virtual doesn’t solve… • Software installation problem • Node failure problem • Remote maintenance problem • … • What does Hudson do to help you with these?
Installing new slaves • For first 20 or so slaves, we did it manually • Insert CD, click, type, click, type, click, … • But that doesn’t scale • Then we automated • Available as “Hudson PXE Plugin”
Automated System Installations • Hudson + PXE plugin • ISO images of OS • Slaves • Power on, hit F12 • PC boots from network (PXE)
Automated System Installations • Hudson + PXE plugin • ISO images of OS • Slaves • Power on, hit F12 • PC boots from network (PXE) • Choose OS from menu • Installs non-interactively Your corporate IT guy & his DHCP server
Automated System Installations • Supports OpenSolaris, Ubuntu, CentOS, Fedora • Trivial with most Linux • Cooperate with Windows, too • Quite useful outside Hudson, too • No more broken CD drives • No more CD-Rs
Distributed builds with Hudson • Master • Serves HTTP requests • Stores all important info • Slaves • 170KB single JAR • Assumed to be unreliable • Scale to at least 100 • Link • Single bi-di byte stream • No other requirements
How master and slaves start talking • For Unix slaves, via SSH • Only need SSH and JRE on slaves • We just need a host name
How master and slaves start talking • For Windows, DCOM • We just need admin user name and password • No manual intervention • Works even from Unix masters
How master and slaves start talking • Via Java Web Start • When master cannot see slaves • A separate socket connection is made
Automating JNLP launch • Once started, can be installed as Windows service
Automating JNLP launch • Emulate the JNLP client headless $ java -jar slave.jar -jnlpUrl URL
Automated Tool Installation - JDK • JDK from http://java.sun.com/ • Hudson automatically chooses the right bundle • Always up to date with new releases
Automated Tool Installation - Apache • Ant and Maven from Apache
Automated Tool Installation - Custom • Download arbitrary archive and unpack
Automated Tool Installation - Custom • Run arbitrary shell commands • Can have variants by OS
Automated Tool Installation - Extensible • Write your own • Simple Hudson extension • Just write Java code to create tool on slave • In progress: • SCMs – Mercurial, … • Install from Subversion
Heterogeneous Cluster Challenge • Your builds/tests need to run in specific environment • Dependency on individual nodes hurts utilization jobs slaves WombatWindows test Windows #1 GlassFishWindows test Windows #2 Hudson Windows test Solaris#1 Hudson Solaris test
Labels to rescue • Label is a group of slaves • Tie jobs to labels jobs slaves WombatWindows test Windows #1 Windows GlassFishWindows test Windows #2 Hudson Windows test Solaris#1 Solaris Hudson Solaris test Windows #3
Forecasting failures • Hudson monitors key health metrics of slaves • Low disk space, insufficient swap • Clock out of synch • Extensible • Slaves go offline automatically • Catch problems before they break builds
Clean up mess after builds • Kill runaway processes • Daemons, background processes left by your build • Works on Windows, Linux, Mac, and Solaris
Hudson made this extensible • Hudson detects excessive workload • Hudson notifies plugins • Plugins can provision more slaves • … assuming that you have that infrastructure
Amazon EC2: The Good • Pay as you go (10¢/h or so) • Loads on Hudson tend to be spiky • Programmable API • Instances launch at machine-speed • EC2 instances are forgetful
Amazon EC2: The Bad • Your data is still inside your firewall • Takes time to check out code • … or to archive build artifacts • Some data just can’t be moved • EC2 instances are forgetful • Can your tests run in parallel?
Hudson EC2 plugin • Built on top of typica* • What does it do? • Automatically provisions slaves on EC2 on demand • Picks the right AMI depending on demand • Starts slave agent • Shuts down unused instances * http://code.google.com/p/typica/
Putting it all together capacity # of executors queue length usage time
Hudson “Appliance” on EC2 • Run the master in the cloud too, if you like • Hudson on stock OpenSolaris AMI • Data stored persistently in Elastic Block Storage • Dynamically expandable thanks to ZFS • Online, too • Packaged as a wizard
Hudson Hadoop plugin • Just a few mouse clicks to install • Turn every Hudson slave into a Hadoop node • Distributed file system • Automatic data replication (fault tolerant) • Nice for storing old artifacts, logs, test records, … • Map/reduce framework • Large scale test results analysis / datamining • More interesting work to be done in the future
Hudson Selenium Plugin • Selenium • Tests webapps by scripting browsers • Selenium Grid • Runs Selenium over a grid of computers • Allow Hudson labels to specify where to start browsers • Hudson & Selenium both need heterogeneous cluster
Selenium Grid Hudson slaves Hudson master (selenium hub)
Conclusion • CI is here to stay • We’ll continue to push more workload to servers • Hudson makes this easy for you • Reap the benefit of a cluster in multiple ways
Resources • http://hudson.dev.java.net/ • BOF-5105 “Hudson Community Meet up” • Today 7:45pm same room • Hudson booth inside Sun Pavilion • Support Subscription • hudson@sun.com
Kohsuke Kawaguchi Jesse Glick kohsuke.kawaguchi@sun.com jesse.glick@sun.com http://hudson.dev.java.net/
Agenda • Quick Hudson introduction • Doing distributed builds • How Hudson make distributed builds easier • Taking infrastructure to cloud
Setting up slaves • Keeping slaves consistent is a good thing • Particularly hard on heterogeneous environment • General system administration tasks • Network configuration • Package installations for native tools • Tools like Puppet or cfEngine are supposed to help • Install build tools in the cluster • Prepare tools on one file system • rsync to everywhere • This part of Hudson needs improvements
Hudson EC2 plugin usage • Tell Hudson your AWS account information