Christmas Posting Dates
Doing Data Science

Doing Data Science

Paperback

By (author) Cathy O'Neill, By (author) Rachel Schutt

$29.75
List price $45.42
You save $15.67 34% off

Free delivery worldwide
Available
Dispatched in 3 business days
When will my order arrive?

  • Publisher: O'Reilly Media, Inc, USA
  • Format: Paperback | 406 pages
  • Dimensions: 152mm x 228mm x 26mm | 580g
  • Publication date: 30 November 2013
  • Publication City/Country: Sebastopol
  • ISBN 10: 1449358659
  • ISBN 13: 9781449358655
  • Edition: 1
  • Sales rank: 73,950

Product description

Now that people are aware that data can make the difference in an election or a business model, data science as an occupation is gaining ground. But how can you get started working in a wide-ranging, interdisciplinary field that's so clouded in hype? This insightful book, based on Columbia University's Introduction to Data Science class, tells you what you need to know. In many of these chapter-long lectures, data scientists from companies such as Google, Microsoft, and eBay share new algorithms, methods, and models by presenting case studies and the code they use. If you're familiar with linear algebra, probability, and statistics, and have programming experience, this book is an ideal introduction to data science. Topics include: Statistical inference, exploratory data analysis, and the data science process Algorithms Spam filters, Naive Bayes, and data wrangling Logistic regression Financial modeling Recommendation engines and causality Data visualization Social networks and data journalism Data engineering, MapReduce, Pregel, and Hadoop Doing Data Science is collaboration between course instructor Rachel Schutt, Senior VP of Data Science at News Corp, and data science consultant Cathy O'Neil, a senior data scientist at Johnson Research Labs, who attended and blogged about the course.

Other people who viewed this bought:

Showing items 1 to 10 of 10

Other books in this category

Showing items 1 to 10 of 10
Categories:

Author information

Cathy O'Neil earned a Ph.D. in math from Harvard, was postdoc at the MIT math department, and a professor at Barnard College where she published a number of research papers in arithmetic algebraic geometry. She then chucked it and switched over to the private sector. She worked as a quant for the hedge fund D.E. Shaw in the middle of the credit crisis, and then for RiskMetrics, a risk software company that assesses risk for the holdings of hedge funds and banks. She is currently a data scientist on the New York start-up scene, writes a blog at mathbabe.org, and is involved with Occupy Wall Street. Rachel Schutt is a Senior Statistician at Google Research in the New York office and adjunct assistant professor at Columbia University. She earned a PhD from Columbia University in statistics, and masters degrees in mathematics and operations research from the Courant Institute and Stanford University, respectively. Her statistical research interests include modeling and analyzing social networks, epidemiology, hierarchical modeling and Bayesian statistics. Her education-related research interests include curriculum design.

Review quote

"I enjoyed Rachel and Cathy's book, it's readable, informative, and like no other book I've read on the topic of statistics or data science." --Andrew Gelman Professor of statistics and political science, and director of the Applied Statistics Center at Columbia University "I got a lot out of Doing Data Science, finding the chapter organization on business problem specification, analytics formulation, data access/wrangling, and computer code to be very helpful in understanding DS solutions."--Steve Miller Co-founder, OpenBI, LLC, a Chicago-based business intelligence services firm