310 likes | 496 Views
Deoendable Software Everywhere. Tao Xie Automated Software Engineering Group Department of Computer Science North Carolina State University https://sites.google.com/site/asergrp/ . Automated Software Engineering @NCSU. Software Dependability Matters.
E N D
Deoendable Software Everywhere • Tao Xie • Automated Software Engineering Group • Department of Computer Science • North Carolina State University • https://sites.google.com/site/asergrp/
Software Dependability Matters • Loss of Money: Software faults cost the U.S. economy about $59.5 billion each year (0.6% GDP) [NIST 02] • Loss of Life: Faulty medical devices caused 30,000 deaths and 600,000 injuries (1985-2005), with likely 8% due to software faults [FDA 06] • …
Improving Software Dependability Titles of Major Conference Pubs (2005-Present) http://people.engr.ncsu.edu/txie/reppubs.html
Improving Software Dependability Major Conference Pubs (2005-Present) Testing & Analysis Analytics Reliability ICSE 11, ICSE 10a, ICSE 10b ICSE 09b, ICSE 07 FSE 10, FSE 12b, FSE 12c ISSTA 11, ISSTA 10, ISSTA 09 ASE 11a, ASE 08b, ASE 06 OOPSLA 11, ECOOP 06 ICSE 12a, ICSE 09a ICSE 08, ICSE 05 FSE 09, FSE 07 ASE 11b, ASE 10, ASE 09a ASE 09b, ASE 08a, ASE 07 ECOOP 09 Security/Privacy FSE 12a ACSAC 12 FSE 11, SIGMETRICS 08 WWW 07, ACSAC 08 Performance ICSE 12b ASE 12sp, SRDS 12sp 10 ICSE, 7 FSE 3 ISSTA, 9 ASE 3 OOPLSA/ECOOP SIGMETRICS 08
Improving Software Dependability Major Conference Pubs (2005-Present) • Artifacts Under Analysis • DB apps • GUI apps • Web/SOA apps • Mobile apps • Cloud apps • Search engines • AC/Firewall policies • API docs • Bug reports • Requirements doc • Execution traces • … Testing & Analysis Analytics Reliability ICSE 11, ICSE 10a, ICSE 10b ICSE 09b, ICSE 07 FSE 10, FSE 12b, FSE 12c ISSTA 11, ISSTA 10, ISSTA 09 ASE 11a, ASE 08b, ASE 06 OOPSLA 11, ECOOP 06 ICSE 12a, ICSE 09a ICSE 08, ICSE 05 FSE 09, FSE 07 ASE 11b, ASE 10, ASE 09a ASE 09b, ASE 08a, ASE 07 ECOOP 09 Security/Privacy FSE 12a FSE 11, SIGMETRICS 08 WWW 07, ACSAC 08 FSE 11, SIGMETRICS 08 WWW 07 Performance ICSE 12b ASE 12sp, SRDS 12sp 10 ICSE, 7 FSE 3 ISSTA, 9 ASE 3 OOPLSA/ECOOP SIGMETRICS 08
Microsoft Research PexIncubation Project for Visual Studio • Download counts (20 months)(Feb. 2008 - Oct. 2009 ) • Academic: 17,366 • Devlabs: 13,022 • Total: 30,388 The contributed Fitnex search strategy [DSN 2009] included in Pex releases since Sept. 2008 http://research.microsoft.com/en-us/projects/pex/ 7
Challenges of Dynamic Symbolic Execution • Loops • Fitnex [DSN 09] • Environments • File system apps [AST 09] • Database apps [ASE 10-sp, ASE 11] • Cloud apps [IEEE Software 12] • Method sequences • Seeker [OOPSLA 11], MSeqGen [ESEC/FSE 09] Opportunities • Regression testing [ISSTA 11] • Developer guidance (cooperative developer testing) [ICSE 12] http://research.microsoft.com/en-us/projects/pex/community.aspx#publications
Microsoft Research Pex for FunTeaching and Learning CS via Social Gaming www.pexforfun.com 1,013,336clicked 'Ask Pex!' The contributed concept of Coding Duel games as major game type of Pex for Fun since Summer 2010 9 http://www.pexforfun.com/
Behind the Scene of Pex for Fun behavior Secret Impl== Player Impl Player Implementation class Player { public static int Puzzle(int x) { return x; } } Secret Implementation class Secret { public static int Puzzle(int x) { if (x <= 0) return 1; return x * Puzzle(x-1); } } class Test { public static void Driver(int x) { if (Secret.Puzzle(x) != Player.Puzzle(x)) throw new Exception(“Mismatch”); } }
Microsoft Research AsiaSoftware Analytics Recent and ongoing work (e.g., StackMine [ICSE 12b], XIAO [ACSAC 12]) with successful technology transfer in collaboration with Microsoft Research Asia 12 http://research.microsoft.com/en-us/groups/sa/
Performance debugging in the large via mining millions of stack traces StackMine [ICSE 2012] http://people.engr.ncsu.edu/txie/publications.htm#icse12-stackmine
Performance debugging in the large Pattern Matching Bugupdate Problematic Pattern Repository BugDatabase Network Bug filing How many issues are still unknown? Key to issue discovery Trace Storage Bottleneck of scalability Trace collection Which trace file should I investigate first? Trace analysis ICSE 2012
Impact ICSE 2012
Scalable code clone analysis XIAO [ACSAC 2012] http://people.engr.ncsu.edu/txie/publications.htm#acsac12-xiao
XIAO: Code Clone Analysis • Motivation • Copy-and-paste is a common developer behavior • A real tool widely adopted internally and externally • XIAO enables code clone analysis in the following way • High tunability • High scalability • High compatibility • High explorability ICSE 2012
Benefiting developer community Available in Visual Studio 2012 RC Searching similar snippets for fixing bug once Finding refactoring opportunity ICSE 2012
More secure Microsoft products Code Clone Search service integrated into workflow of Microsoft Security Response Center Over 590 million lines of code indexed across multiple products Real security issues proactively identified and addressed ICSE 2012
Example – MS Security Bulletin MS12-034 Combined Security Update for Microsoft Office, Windows, .NET Framework, and Silverlight, published: Tuesday, May 08, 2012 3 publicly disclosed vulnerabilities and seven privately reported involved. Specifically, one is exploited by the Duqu malware to execute arbitrary code when a user opened a malicious Office document Insufficient bounds check within the font parsing subsystem of win32k.sys Cloned copy in gdiplus.dll, ogl.dll (office), Silver Light, Windows Journal viewer Microsoft Technet Blog about this bulletin However, we wanted to be sure to address the vulnerable code wherever it appeared across the Microsoft code base. To that end, we have been working with Microsoft Research to develop a “Cloned Code Detection” system that we can run for every MSRC case to find any instance of the vulnerable code in any shipping product. This system is the one that found several of the copies of CVE-2011-3402 that we are now addressing with MS12-034. MSRC: Microsoft Security Response Center
Government AgencyNIST Jointly-developed ACPT (Access Control Policy Tool) beta release being beta-tested in ~130 organizations/users http://csrc.nist.gov/groups/SNS/acpt/index.html
Government Agencies FDA Our PhD student Rahul Pandita, part of the FDA Semantic Data Mining Development Team, co-recognized with 2012 FDA Group Recognition Award Test a point-of-care assistant medical device [ASE 10sp] and mine FDA incident reports
Mining Textual Software Artifacts • Detect duplicate bug reports [ICSE 08] • Identify security bug reports [MSR 10] • Mine resource specs from Javadoc[ASE 09, Best Paper Award] • Mine code contracts from API docs [ICSE 12] • Mine security policies from requirements docs [FSE 12] • javax.resource.cci.Connection • createInteraction():“Creates an interaction associated with this connection.” • getMetaData():“Gets the information on the underlying EIS instance represented through an active connection.” • close():“Initiates close of the connection handle at the application level.”
Dependable Software Everywhere • Various countries/regions • Software internationalization • Locating constant strings to translate [ICSE 09, FSE 10] • E.g., translating Megamek (a realtime strategy game) • Various programming languages • PL translation • E.g., translating Java to C# [ICSE 10]
Dependable Software Everywhere • Various types of software • Database applications [ASE 10-sp] • Network/file-system applications [AST 09] • Game applications [ICSE 09] • Cyber-physical systems (power grid, medical device software, …) • Mobile/could applications • Social network applications • …
TouchDevelop @MSR Mobile application development environment • create applications (aka “scripts”) on the phone itself • no PC required • access to phone sensors, camera, music, web, etc. • share scripts with other people • After 9 months, > 6000 games/apps written and published by users http://research.microsoft.com/projects/touchdevelop/ NCSU ASE Group: Source code license to analysis infrastructure
Teaching • for students • engaging experience • work with your personal data (pictures, songs, …) • create games • on the go • lowers bar of entry to programming • create tailored apps for micro-business Nikolai Tillmann, Michal Moskal, Jonathan de Halleux, Manuel Fahndrich, Judith Bishop, Arjmand Samuel, and Tao Xie. The Future of Teaching Programming is on Mobile Devices. In Proceedings of 17th Annual Conference on Innovation and Technology in Computer Science Education (ITiCSE 2012), 2012.
Language+Editor • typed language • enables precise auto-completion suggestions • imperative • object-oriented, but doesn’t allow definition of new objects • semi-structured editor • structured at statement level • unstructured at expression level • structured at token level
Rich APIs All phone features are available via TouchDevelop: • camera, touch, accelerometer, compass, gyro, microphone, … • contacts, calendar, email, … • pictures, songs, … • web queries, search, maps, social networks • graphics with physics engine • tiles
Script Sharing Script bazaar in cloud:central authority for script sharing • backup • anyone can publish a script • script source codeis made available • discover new scripts! • all published scripts are analyzed (privacy) • Xusheng Xiao’s internship work! Xusheng Xiao, Nikolai Tillmann, Manuel Fahndrich, Jonathan de Halleux, and Michal Moskal, User-Aware Privacy Control via Extended Static-Information-Flow Analysis. In Proc. ASE 2012
Drustworthy Software Everywhere http://people.engr.ncsu.edu/txie/ https://sites.google.com/site/asergrp/