120 likes | 233 Views
Welcome to Data Science ATL. Raj Bandyopadhyay Travis Turney. Why this meetup ?. We have excellent meetup groups in Atlanta for various data tools: Hadoop R LexisNexis-HPCC Tools are only a part of Data Science
E N D
Welcome to Data Science ATL Raj Bandyopadhyay Travis Turney
Why this meetup? • We have excellent meetup groups in Atlanta for various data tools: • Hadoop • R • LexisNexis-HPCC • Tools are only a part of Data Science • We need a place for discussing broader Data Science techniques and other topics • Which brings us to the question…
What is Data Science? Term coined jointly by Jeff Hammerbacher and DJ Patil • Machine Learning • Data Mining • Statistical Modeling • Exploratory Statistics • BIG DATA • Coding/Scripting/Hacking • Data visualization • Infographics/mashups • Business Intelligence • Qualitative skills Drew Conway’s famous Venn Diagram
That looks like a lot… • Companies want ALL of those skills • Ideally in ONE person (!!) • This rare person is probably already... • Working at Google, Facebook, LinkedIn etc. • Running their own $$$ hedge fund from their cushy Manhattan penthouse • CEO of Cloudera • Serious talent shortage for other firms!
How do WE get there? • Learn • Create • Compete • Collaborate
Learn! • Books/Papers/Blogs • Whatever Nick recommends in his talk • O’Reilly and Manning Publishers • Online courses • Coursera: ML, NLP, PGM • Udacity: Robotic cars, Statistics • iTunes U: Lots of stuff!
Create! • Lots of public data sets • data.gov • The Sunlight Foundation • Amazon public data sets • Most new companies provide APIs • Programmable web • Create mashups • Correlate diverse data sources
Compete! • Kaggle.com: “We make data science a sport” • Win for analyzing and extracting insights from data sets • $500 to $3M – depending on the problem • Interview for a data scientist position at Facebook • Lots of other competitions out there • NYC BigApps • ChallengePost • Knight News Challenge
Collaborate! • Form study groups (for a topic, course) • Work on projects • Compete on Kaggle (or other contests) • Tell us how you did it! • Use this meetup’s message board to get together for working/studying • Hackathons
Let’s use this meetup • Present your stuff • Full-length talks (45-60 minutes) : in-depth discussion of a project, a topic you’re studying or a book/paper you’ve read • Lightning talks (10-15 minutes): quick overviews on particular problems e.g. • “This is how I’m using data analysis at work” • “We have this data that we’ve collected, but we have no idea what to do with it” • “Here’s a cool visualization tool/technique I’ve used” • Get people involved in brainstorming
Let’s use this meetup • Find interesting guest speakers • Anyone doing cool work with data, no matter what field they’re from • Engineering, Statistics and other tech fields • Political Science, Economics and other social sciences • Business intelligence etc. • Companies building cool data products • Use the discussion boards! • Organize study sessions, group projects, Kaggle competition teams • Beer outings
Summary • Data Science is an interdisciplinary field • Very few experts and lots of demand • Lots of resources to learn • Let’s use those resources, and collaborate using this meetup to build Atlanta up as a Data Science hotspot