Primer to Analysis of Genomic Data Using R

Primer to Analysis of Genomic Data Using R

4.66 (3 ratings by Goodreads)
By (author) 

Free delivery worldwide

Available. Dispatched from the UK in 2 business days
When will my order arrive?

Description

Through this book, researchers and students will learn to use R for analysis of large-scale genomic data and how to create routines to automate analytical steps. The philosophy behind the book is to start with real world raw datasets and perform all the analytical steps needed to reach final results. Though theory plays an important role, this is a practical book for graduate and undergraduate courses in bioinformatics and genomic analysis or for use in lab sessions. How to handle and manage high-throughput genomic data, create automated workflows and speed up analyses in R is also taught. A wide range of R packages useful for working with genomic data are illustrated with practical examples.



The key topics covered are association studies, genomic prediction, estimation of population genetic parameters and diversity, gene expression analysis, functional annotation of results using publically available databases and how to work efficiently in R with large genomic datasets. Important principles are demonstrated and illustrated through engaging examples which invite the reader to work with the provided datasets. Some methods that are discussed in this volume include: signatures of selection, population parameters (LD, FST, FIS, etc); use of a genomic relationship matrix for population diversity studies; use of SNP data for parentage testing; snpBLUP and gBLUP for genomic prediction. Step-by-step, all the R code required for a genome-wide association study is shown: starting from raw SNP data, how to build databases to handle and manage the data, quality control and filtering measures, association testing and evaluation of results, through to identification and functional annotation of candidate genes. Similarly, gene expression analyses are shown using microarray and RNAseq data.



At a time when genomic data is decidedly big, the skills from this book are critical. In recent years R has become the de facto< tool for analysis of gene expression data, in addition to its prominent role in analysis of genomic data. Benefits to using R include the integrated development environment for analysis, flexibility and control of the analytic workflow. Included topics are core components of advanced undergraduate and graduate classes in bioinformatics, genomics and statistical genetics. This book is also designed to be used by students in computer science and statistics who want to learn the practical aspects of genomic analysis without delving into algorithmic details. The datasets used throughout the book may be downloaded from the publisher's website.
show more

Product details

  • Paperback | 270 pages
  • 155 x 235 x 15.24mm | 4,918g
  • Cham, Switzerland
  • English
  • 2015 ed.
  • 10 Tables, black and white; 74 Illustrations, color; 5 Illustrations, black and white; XVI, 270 p. 79 illus., 74 illus. in color.
  • 331914474X
  • 9783319144740
  • 645,198

Back cover copy

Through this book, researchers and students will learn to use R for analysis of large-scale genomic data and how to create routines to automate analytical steps. The philosophy behind the book is to start with real world raw datasets and perform all the analytical steps needed to reach final results. Though theory plays an important role, this is a practical book for advanced undergraduate and graduate classes in bioinformatics, genomics and statistical genetics or for use in lab sessions. This book is also designed to be used by students in computer science and statistics who want to learn the practical aspects of genomic analysis without delving into algorithmic details. The datasets used throughout the book may be downloaded from the publisher's website.

Chapters show how to handle and manage high-throughput genomic data, create automated workflows and speed up analyses in R. A wide range of R packages useful for working with genomic data are illustrated with practical examples. In recent years R has become the de facto tool for analysis of gene expression data, in addition to its prominent role in the analysis of genomic data. Benefits to using R include the integrated development environment for analysis, flexibility and control of the analytic workflow.

At a time when genomic data is decidedly big, the skills from this book are critical. The key topics covered are association studies, genomic prediction, estimation of population genetic parameters and diversity, gene expression analysis, functional annotation of results using publically available databases and how to work efficiently in R with large genomic datasets. Important principles are demonstrated and illustrated through engaging examples which invite the reader to work with the provided datasets. Some methods that are discussed in this volume include: signatures of selection; population parameters (LD, FST, FIS, etc); use of a genomic relationship matrix for population diversity studies; use of SNP data for parentage testing; snpBLUP and gBLUP for genomic prediction. Step-by-step, all the R code required for a genome-wide association study is shown: starting from raw SNP data, how to build databases to handle and manage the data, quality control and filtering measures, association testing and evaluation of results, through to identification and functional annotation of candidate genes. Similarly, gene expression analyses are shown using microarray and RNAseq data.
show more

Table of contents

R basics.- Simple marker association tests.- Genome wide association studies.- Population and genetic architecture.- Gene expression analysis.- Databases and functional information.- Extending R.- Final comments.- Index.- References.
show more

Review Text

"The book is timely and practical, not only through its approach on data analysis, but also due to the numerous examples and further reading indications (including R packages and books) at the end of each chapter. The targeted audience consists of undergraduates and graduates with some experience in bioinformatics analyses. ... the style of the book can accommodate also researchers with a computing or biological background." (Irina Ioana Mohorianu, zbMATH 1327.92002, 2016)
show more

Review quote

"The book is timely and practical, not only through its approach on data analysis, but also due to the numerous examples and further reading indications (including R packages and books) at the end of each chapter. The targeted audience consists of undergraduates and graduates with some experience in bioinformatics analyses. ... the style of the book can accommodate also researchers with a computing or biological background." (Irina Ioana Mohorianu, zbMATH 1327.92002, 2016)
show more

About Cedric Gondro

Cedric Gondro is Associate Professor of computational genetics at the University of New England. He has extensive experience in analysis of livestock projects using data from various genomic platforms. His main research interests are in the development of computational methods for optimization of biological problems; statistical and functional analysis methods for high throughput genomic data (expression arrays, SNP chips, sequence data); estimation of population genetic parameters using genome-wide data; and simulation of biological systems.
show more

Rating details

3 ratings
4.66 out of 5 stars
5 67% (2)
4 33% (1)
3 0% (0)
2 0% (0)
1 0% (0)
Book ratings by Goodreads
Goodreads is the world's largest site for readers with over 50 million reviews. We're featuring millions of their reader ratings on our book pages to help you find your new favourite book. Close X