Exploring Data in Engineering, the Sciences, and Medicine

Exploring Data in Engineering, the Sciences, and Medicine

5 (1 rating by Goodreads)
By (author) 

Free delivery worldwide

Available. Dispatched from the UK in 3 business days
When will my order arrive?


Two recent and ongoing developments have greatly increased both the range of opportunities for exploratory data analysis and the variety of tools to support this type of analysis. First has been the dramatic rise in the number of publicly available datasets available free from the Internet and second has been the similarly dramatic evolution of the Open Source software movement, making powerful analysis packages like R also freely available. The objective of this book is to provide a reasonably thorough introduction to a useful subset of these analysis tools, illustrating what they are, what they do, and when and how they sometimes fail or do something very different than we expect them to. Specific topics covered include descriptive characterizations like summary statistics (mean, median, standard deviation, MAD scale estimate, etc.), graphical techniques like boxplots and nonparametric density estimates, various forms of regression modeling (standard linear regression models, logistic regression, and highly robust techniques like least trimmed squares), and the recognition and treatment of important data anomalies like outliers and missing data.
In addition, the book also introduces a variety of dynamic data analysis tools, including autocorrelation analysis, parametric and nonparametric spectrum estimation, and the use of nonlinear data cleaning filters to improve dynamic characterization results. The book assumes familiarity with calculus and linear algebra, but does not assume any prior exposure to probability or statistics. Both simulation-based and real data examples are included and the book is intended either as an introductory textbook for an exploratory data analysis course like ones the author taught at the ETH where some of this material was used, or for self-study. Exercises are included at the end of each chapter and both R code and datasets are available through the associated OUP website.
show more

Product details

  • Hardback | 792 pages
  • 160.02 x 238.76 x 48.26mm | 1,315.41g
  • Oxford University Press Inc
  • New York, United States
  • English
  • 250 line illustrations
  • 0195089650
  • 9780195089653
  • 1,068,302

About Ronald Pearson

Ronald Pearson has held a wide variety of technical positions in both academia and industry, including the DuPont Company, the Swiss Federal Institute of Technology (ETH, Zurich), the Tampere University of Technology in Tampere, Finland, and most recently, the Travelers Companies. Dr. Pearson's experience has included the analysis and modeling of industrial process operating data, the design of nonlinear digital filters for data cleaning
applications, the analysis of historical clinical data, and he is currently involved in developing models for predictive analytics applied to large business datasets. His research interests include model structure selection for nonlinear discrete-time dynamic models of empirical data, the algebraic characterization and design
of nonlinear digital filters, and the development of exploratory data analysis techniques for large datasets involving mixed data types.
show more

Table of contents

1. The Art of Analyzing Data ; 2. Data: Types, Uncertainty and Quality ; 3. Characterizing Categorical Variables ; 4. Uncertainty in Real Variables ; 5. Fitting Straight Lines ; 6. A Brief Introduction to Estimation Theory ; 7. Outliers: Distributional Monsters That Lurk in Data ; 8. Characterizing a Dataset ; 9. Confidence Intervals and Hypothesis Testing ; 10. Associations between Variables ; 11. Regression Models I: Real Data ; 12. Re-expression: Data Transformations ; 13. Regression Models II: Mixed Data Types ; 14. Characterizing Analysis Results ; 15. Regression Models III: Diagnostics and Refinements ; 16. Dynamic Data Characterization ; 17. Linear Data Filters ; 18. Nonparametric Spectrum Estimation ; 19. Irregularities in Dynamic Analysis ; 20. Dealing with Missing Data
show more

Rating details

1 ratings
5 out of 5 stars
5 100% (1)
4 0% (0)
3 0% (0)
2 0% (0)
1 0% (0)
Book ratings by Goodreads
Goodreads is the world's largest site for readers with over 50 million reviews. We're featuring millions of their reader ratings on our book pages to help you find your new favourite book. Close X