Linear Models with R, Second Edition

Author: Julian J. Faraway

Publisher: CRC Press

ISBN: 1439887349

Category: Mathematics

Page: 286

View: 8994

A Hands-On Way to Learning Data Analysis Part of the core of statistics, linear models are used to make predictions and explain the relationship between the response and the predictors. Understanding linear models is crucial to a broader competence in the practice of statistics. Linear Models with R, Second Edition explains how to use linear models in physical science, engineering, social science, and business applications. The book incorporates several improvements that reflect how the world of R has greatly expanded since the publication of the first edition. New to the Second Edition Reorganized material on interpreting linear models, which distinguishes the main applications of prediction and explanation and introduces elementary notions of causality Additional topics, including QR decomposition, splines, additive models, Lasso, multiple imputation, and false discovery rates Extensive use of the ggplot2 graphics package in addition to base graphics Like its widely praised, best-selling predecessor, this edition combines statistics and R to seamlessly give a coherent exposition of the practice of linear modeling. The text offers up-to-date insight on essential data analysis topics, from estimation, inference, and prediction to missing data, factorial models, and block designs. Numerous examples illustrate how to apply the different methods using R.

Generalized Additive Models

An Introduction with R, Second Edition

Author: Simon N. Wood

Publisher: CRC Press

ISBN: 1498728375

Category: Mathematics

Page: 496

View: 9649

The first edition of this book has established itself as one of the leading references on generalized additive models (GAMs), and the only book on the topic to be introductory in nature with a wealth of practical examples and software implementation. It is self-contained, providing the necessary background in linear models, linear mixed models, and generalized linear models (GLMs), before presenting a balanced treatment of the theory and applications of GAMs and related models. The author bases his approach on a framework of penalized regression splines, and while firmly focused on the practical aspects of GAMs, discussions include fairly full explanations of the theory underlying the methods. Use of R software helps explain the theory and illustrates the practical application of the methodology. Each chapter contains an extensive set of exercises, with solutions in an appendix or in the book’s R data package gamair, to enable use as a course text or for self-study. Simon N. Wood is a professor of Statistical Science at the University of Bristol, UK, and author of the R package mgcv.

An Introduction to Generalized Linear Models, Third Edition

Author: Annette J. Dobson,Adrian Barnett

Publisher: Chapman and Hall/CRC

ISBN: 9781584889502

Category: Mathematics

Page: 320

View: 896

Continuing to emphasize numerical and graphical methods, An Introduction to Generalized Linear Models, Third Edition provides a cohesive framework for statistical modeling. This new edition of a bestseller has been updated with Stata, R, and WinBUGS code as well as three new chapters on Bayesian analysis. Like its predecessor, this edition presents the theoretical background of generalized linear models (GLMs) before focusing on methods for analyzing particular kinds of data. It covers normal, Poisson, and binomial distributions; linear regression models; classical estimation and model fitting methods; and frequentist methods of statistical inference. After forming this foundation, the authors explore multiple linear regression, analysis of variance (ANOVA), logistic regression, log-linear models, survival analysis, multilevel modeling, Bayesian models, and Markov chain Monte Carlo (MCMC) methods. Using popular statistical software programs, this concise and accessible text illustrates practical approaches to estimation, model fitting, and model comparisons. It includes examples and exercises with complete data sets for nearly all the models covered.

Introduction to General and Generalized Linear Models

Author: Henrik Madsen,Poul Thyregod

Publisher: CRC Press

ISBN: 1439891141

Category: Mathematics

Page: 316

View: 5176

Bridging the gap between theory and practice for modern statistical model building, Introduction to General and Generalized Linear Models presents likelihood-based techniques for statistical modelling using various types of data. Implementations using R are provided throughout the text, although other software packages are also discussed. Numerous examples show how the problems are solved with R. After describing the necessary likelihood theory, the book covers both general and generalized linear models using the same likelihood-based methods. It presents the corresponding/parallel results for the general linear models first, since they are easier to understand and often more well known. The authors then explore random effects and mixed effects in a Gaussian context. They also introduce non-Gaussian hierarchical models that are members of the exponential family of distributions. Each chapter contains examples and guidelines for solving the problems via R. Providing a flexible framework for data analysis and model building, this text focuses on the statistical methods and models that can help predict the expected value of an outcome, dependent, or response variable. It offers a sound introduction to general and generalized linear models using the popular and powerful likelihood techniques. Ancillary materials are available at www.imm.dtu.dk/~hm/GLM

A Primer on Linear Models

Author: John F. Monahan

Publisher: CRC Press

ISBN: 9781420062045

Category: Mathematics

Page: 304

View: 9989

A Primer on Linear Models presents a unified, thorough, and rigorous development of the theory behind the statistical methodology of regression and analysis of variance (ANOVA). It seamlessly incorporates these concepts using non-full-rank design matrices and emphasizes the exact, finite sample theory supporting common statistical methods. With coverage steadily progressing in complexity, the text first provides examples of the general linear model, including multiple regression models, one-way ANOVA, mixed-effects models, and time series models. It then introduces the basic algebra and geometry of the linear least squares problem, before delving into estimability and the Gauss–Markov model. After presenting the statistical tools of hypothesis tests and confidence intervals, the author analyzes mixed models, such as two-way mixed ANOVA, and the multivariate linear model. The appendices review linear algebra fundamentals and results as well as Lagrange multipliers. This book enables complete comprehension of the material by taking a general, unifying approach to the theory, fundamentals, and exact results of linear models.

Statistical Regression and Classification

From Linear Models to Machine Learning

Author: Norman Matloff

Publisher: CRC Press

ISBN: 1351645897

Category: Business & Economics

Page: 490

View: 6825

Statistical Regression and Classification: From Linear Models to Machine Learning takes an innovative look at the traditional statistical regression course, presenting a contemporary treatment in line with today's applications and users. The text takes a modern look at regression: * A thorough treatment of classical linear and generalized linear models, supplemented with introductory material on machine learning methods. * Since classification is the focus of many contemporary applications, the book covers this topic in detail, especially the multiclass case. * In view of the voluminous nature of many modern datasets, there is a chapter on Big Data. * Has special Mathematical and Computational Complements sections at ends of chapters, and exercises are partitioned into Data, Math and Complements problems. * Instructors can tailor coverage for specific audiences such as majors in Statistics, Computer Science, or Economics. * More than 75 examples using real data. The book treats classical regression methods in an innovative, contemporary manner. Though some statistical learning methods are introduced, the primary methodology used is linear and generalized linear parametric models, covering both the Description and Prediction goals of regression methods. The author is just as interested in Description applications of regression, such as measuring the gender wage gap in Silicon Valley, as in forecasting tomorrow's demand for bike rentals. An entire chapter is devoted to measuring such effects, including discussion of Simpson's Paradox, multiple inference, and causation issues. Similarly, there is an entire chapter of parametric model fit, making use of both residual analysis and assessment via nonparametric analysis. Norman Matloff is a professor of computer science at the University of California, Davis, and was a founder of the Statistics Department at that institution. His current research focus is on recommender systems, and applications of regression methods to small area estimation and bias reduction in observational studies. He is on the editorial boards of the Journal of Statistical Computation and the R Journal. An award-winning teacher, he is the author of The Art of R Programming and Parallel Computation in Data Science: With Examples in R, C++ and CUDA.

Environmental and Ecological Statistics with R

Author: Song S. Qian

Publisher: CRC Press

ISBN: 9781420062083

Category: Mathematics

Page: 440

View: 8281

Emphasizing the inductive nature of statistical thinking, Environmental and Ecological Statistics with R connects applied statistics to the environmental and ecological fields. It follows the general approach to solving a statistical modeling problem, covering model specification, parameter estimation, and model evaluation. The author uses many examples to illustrate the statistical models and presents R implementations of the models. The book first builds a foundation for conducting a simple data analysis task, such as exploratory data analysis and fitting linear regression models. It then focuses on statistical modeling, including linear and nonlinear models, classification and regression tree, and the generalized linear model. The text also discusses the use of simulation for model checking, provides tools for a critical assessment of the developed model, and explores multilevel regression models, which are a class of models that can have a broad impact in environmental and ecological data analysis. Based on courses taught by the author at Duke University, this book focuses on statistical modeling and data analysis for environmental and ecological problems. By guiding readers through the processes of scientific problem solving and statistical model development, it eases the transition from scientific hypothesis to statistical model.

Graphics for Statistics and Data Analysis with R

Author: Kevin J. Keen

Publisher: CRC Press

ISBN: 0429632215

Category: Mathematics

Page: 590

View: 6066

Praise for the First Edition "The main strength of this book is that it provides a unified framework of graphical tools for data analysis, especially for univariate and low-dimensional multivariate data. In addition, it is clearly written in plain language and the inclusion of R code is particularly useful to assist readers’ understanding of the graphical techniques discussed in the book. ... It not only summarises graphical techniques, but it also serves as a practical reference for researchers and graduate students with an interest in data display." -Han Lin Shang,?Journal of Applied Statistics Graphics for Statistics and Data Analysis with R, Second Edition, presents the basic principles of graphical design and applies these principles to engaging examples using the graphics and lattice packages in R. It offers a wide array of modern graphical displays for data visualization and representation. Added in the second edition are coverage of the ggplot2 graphics package, material on human visualization and color rendering in R, on screen, and in print. Features Emphasizes the fundamentals of statistical graphics and best practice guidelines for producing and choosing among graphical displays in R Presents technical details on topics such as: the estimation of quantiles, nonparametric and parametric density estimation; diagnostic plots for the simple linear regression model; polynomial regression, splines, and locally weighted polynomial regression for producing a smooth curve; Trellis graphics for multivariate data Provides downloadable R code and data for figures at www.graphicsforstatistics.com Kevin J. Keen is a Professor of Mathematics and Statistics at the University of Northern British Columbia (Prince George, Canada) and an Accredited Professional StatisticianTM by the Statistical Society of Canada and the American Statistical Association.

Design of Experiments

An Introduction Based on Linear Models

Author: Max Morris

Publisher: CRC Press

ISBN: 1439894906

Category: Mathematics

Page: 376

View: 9917

Offering deep insight into the connections between design choice and the resulting statistical analysis, Design of Experiments: An Introduction Based on Linear Models explores how experiments are designed using the language of linear statistical models. The book presents an organized framework for understanding the statistical aspects of experimental design as a whole within the structure provided by general linear models, rather than as a collection of seemingly unrelated solutions to unique problems. The core material can be found in the first thirteen chapters. These chapters cover a review of linear statistical models, completely randomized designs, randomized complete blocks designs, Latin squares, analysis of data from orthogonally blocked designs, balanced incomplete block designs, random block effects, split-plot designs, and two-level factorial experiments. The remainder of the text discusses factorial group screening experiments, regression model design, and an introduction to optimal design. To emphasize the practical value of design, most chapters contain a short example of a real-world experiment. Details of the calculations performed using R, along with an overview of the R commands, are provided in an appendix. This text enables students to fully appreciate the fundamental concepts and techniques of experimental design as well as the real-world value of design. It gives them a profound understanding of how design selection affects the information obtained in an experiment.

Analysis of Variance, Design, and Regression

Linear Modeling for Unbalanced Data, Second Edition

Author: Ronald Christensen

Publisher: CRC Press

ISBN: 1498774059

Category: Mathematics

Page: 610

View: 8363

Analysis of Variance, Design, and Regression: Linear Modeling for Unbalanced Data, Second Edition presents linear structures for modeling data with an emphasis on how to incorporate specific ideas (hypotheses) about the structure of the data into a linear model for the data. The book carefully analyzes small data sets by using tools that are easily scaled to big data. The tools also apply to small relevant data sets that are extracted from big data. New to the Second Edition Reorganized to focus on unbalanced data Reworked balanced analyses using methods for unbalanced data Introductions to nonparametric and lasso regression Introductions to general additive and generalized additive models Examination of homologous factors Unbalanced split plot analyses Extensions to generalized linear models R, Minitab®, and SAS code on the author’s website The text can be used in a variety of courses, including a yearlong graduate course on regression and ANOVA or a data analysis course for upper-division statistics students and graduate students from other fields. It places a strong emphasis on interpreting the range of computer output encountered when dealing with unbalanced data.

Extending the Linear Model with R

Generalized Linear, Mixed Effects and Nonparametric Regression Models

Author: Julian J. Faraway

Publisher: CRC Press

ISBN: 9780203492284

Category: Mathematics

Page: 312

View: 5410

Linear models are central to the practice of statistics and form the foundation of a vast range of statistical methodologies. Julian J. Faraway's critically acclaimed Linear Models with R examined regression and analysis of variance, demonstrated the different methods available, and showed in which situations each one applies. Following in those footsteps, Extending the Linear Model with R surveys the techniques that grow from the regression model, presenting three extensions to that framework: generalized linear models (GLMs), mixed effect models, and nonparametric regression models. The author's treatment is thoroughly modern and covers topics that include GLM diagnostics, generalized linear mixed models, trees, and even the use of neural networks in statistics. To demonstrate the interplay of theory and practice, throughout the book the author weaves the use of the R software environment to analyze the data of real examples, providing all of the R commands necessary to reproduce the analyses. All of the data described in the book is available at http://people.bath.ac.uk/jjf23/ELM/ Statisticians need to be familiar with a broad range of ideas and techniques. This book provides a well-stocked toolbox of methodologies, and with its unique presentation of these very modern statistical techniques, holds the potential to break new ground in the way graduate-level courses in this area are taught.

Generalized Linear Mixed Models

Modern Concepts, Methods and Applications

Author: Walter W. Stroup

Publisher: CRC Press

ISBN: 1439815135

Category: Mathematics

Page: 555

View: 1359

Generalized Linear Mixed Models: Modern Concepts, Methods and Applications presents an introduction to linear modeling using the generalized linear mixed model (GLMM) as an overarching conceptual framework. For readers new to linear models, the book helps them see the big picture. It shows how linear models fit with the rest of the core statistics curriculum and points out the major issues that statistical modelers must consider. Along with describing common applications of GLMMs, the text introduces the essential theory and main methodology associated with linear models that accommodate random model effects and non-Gaussian data. Unlike traditional linear model textbooks that focus on normally distributed data, this one adopts a generalized mixed model approach throughout: data for linear modeling need not be normally distributed and effects may be fixed or random. With numerous examples using SAS® PROC GLIMMIX, this book is ideal for graduate students in statistics, statistics professionals seeking to update their knowledge, and researchers new to the generalized linear model thought process. It focuses on data-driven processes and provides context for extending traditional linear model thinking to generalized linear mixed modeling. See Professor Stroup discuss the book.

Modelling Survival Data in Medical Research, Second Edition

Author: David Collett

Publisher: CRC Press

ISBN: 1584883251

Category: Mathematics

Page: 410

View: 5818

Critically acclaimed and resoundingly popular in its first edition, Modelling Survival Data in Medical Research has been thoroughly revised and updated to reflect the many developments and advances--particularly in software--made in the field over the last 10 years. Now, more than ever, it provides an outstanding text for upper-level and graduate courses in survival analysis, biostatistics, and time-to-event analysis.The treatment begins with an introduction to survival analysis and a description of four studies that lead to survival data. Subsequent chapters then use those data sets and others to illustrate the various analytical techniques applicable to such data, including the Cox regression model, the Weibull proportional hazards model, and others. This edition features a more detailed treatment of topics such as parametric models, accelerated failure time models, and analysis of interval-censored data. The author also focuses the software section on the use of SAS, summarising the methods used by the software to generate its output and examining that output in detail. Profusely illustrated with examples and written in the author's trademark, easy-to-follow style, Modelling Survival Data in Medical Research, Second Edition is a thorough, practical guide to survival analysis that reflects current statistical practices.

Generalized Linear Models, Second Edition

Author: P. McCullagh,John A. Nelder

Publisher: CRC Press

ISBN: 9780412317606

Category: Mathematics

Page: 532

View: 8970

The success of the first edition of Generalized Linear Models led to the updated Second Edition, which continues to provide a definitive unified, treatment of methods for the analysis of diverse types of data. Today, it remains popular for its clarity, richness of content and direct relevance to agricultural, biological, health, engineering, and other applications. The authors focus on examining the way a response variable depends on a combination of explanatory variables, treatment, and classification variables. They give particular emphasis to the important case where the dependence occurs through some unknown, linear combination of the explanatory variables. The Second Edition includes topics added to the core of the first edition, including conditional and marginal likelihood methods, estimating equations, and models for dispersion effects and components of dispersion. The discussion of other topics-log-linear and related models, log odds-ratio regression models, multinomial response models, inverse linear and related models, quasi-likelihood functions, and model checking-was expanded and incorporates significant revisions. Comprehension of the material requires simply a knowledge of matrix theory and the basic ideas of probability theory, but for the most part, the book is self-contained. Therefore, with its worked examples, plentiful exercises, and topics of direct use to researchers in many disciplines, Generalized Linear Models serves as ideal text, self-study guide, and reference.

Statistical Rethinking

A Bayesian Course with Examples in R and Stan

Author: Richard McElreath

Publisher: CRC Press

ISBN: 1315362619

Category: Mathematics

Page: 487

View: 4402

Statistical Rethinking: A Bayesian Course with Examples in R and Stan builds readers’ knowledge of and confidence in statistical modeling. Reflecting the need for even minor programming in today’s model-based statistics, the book pushes readers to perform step-by-step calculations that are usually automated. This unique computational approach ensures that readers understand enough of the details to make reasonable choices and interpretations in their own modeling work. The text presents generalized linear multilevel models from a Bayesian perspective, relying on a simple logical interpretation of Bayesian probability and maximum entropy. It covers from the basics of regression to multilevel models. The author also discusses measurement error, missing data, and Gaussian process models for spatial and network autocorrelation. By using complete R code examples throughout, this book provides a practical foundation for performing statistical inference. Designed for both PhD students and seasoned professionals in the natural and social sciences, it prepares them for more advanced or specialized statistical modeling. Web Resource The book is accompanied by an R package (rethinking) that is available on the author’s website and GitHub. The two core functions (map and map2stan) of this package allow a variety of statistical models to be constructed from standard model formulas.

Generalized Linear Models with Random Effects

Unified Analysis via H-likelihood

Author: Youngjo Lee,John A. Nelder,Yudi Pawitan

Publisher: CRC Press

ISBN: 9781420011340

Category: Mathematics

Page: 416

View: 8661

Since their introduction in 1972, generalized linear models (GLMs) have proven useful in the generalization of classical normal models. Presenting methods for fitting GLMs with random effects to data, Generalized Linear Models with Random Effects: Unified Analysis via H-likelihood explores a wide range of applications, including combining information over trials (meta-analysis), analysis of frailty models for survival data, genetic epidemiology, and analysis of spatial and temporal models with correlated errors. Written by pioneering authorities in the field, this reference provides an introduction to various theories and examines likelihood inference and GLMs. The authors show how to extend the class of GLMs while retaining as much simplicity as possible. By maximizing and deriving other quantities from h-likelihood, they also demonstrate how to use a single algorithm for all members of the class, resulting in a faster algorithm as compared to existing alternatives. Complementing theory with examples, many of which can be run by using the code supplied on the accompanying CD, this book is beneficial to statisticians and researchers involved in the above applications as well as quality-improvement experiments and missing-data analysis.

Linear Models and the Relevant Distributions and Matrix Algebra

Author: David A. Harville

Publisher: CRC Press

ISBN: 1351264672

Category: Mathematics

Page: 524

View: 7830

Linear Models and the Relevant Distributions and Matrix Algebra provides in-depth and detailed coverage of the use of linear statistical models as a basis for parametric and predictive inference. It can be a valuable reference, a primary or secondary text in a graduate-level course on linear models, or a resource used (in a course on mathematical statistics) to illustrate various theoretical concepts in the context of a relatively complex setting of great practical importance. Features: Provides coverage of matrix algebra that is extensive and relatively self-contained and does so in a meaningful context Provides thorough coverage of the relevant statistical distributions, including spherically and elliptically symmetric distributions Includes extensive coverage of multiple-comparison procedures (and of simultaneous confidence intervals), including procedures for controlling the k-FWER and the FDR Provides thorough coverage (complete with detailed and highly accessible proofs) of results on the properties of various linear-model procedures, including those of least squares estimators and those of the F test. Features the use of real data sets for illustrative purposes Includes many exercises David Harville served for 10 years as a mathematical statistician in the Applied Mathematics Research Laboratory of the Aerospace Research Laboratories at Wright-Patterson AFB, Ohio, 20 years as a full professor in Iowa State University’s Department of Statistics where he now has emeritus status, and seven years as a research staff member of the Mathematical Sciences Department of IBM’s T.J. Watson Research Center. He has considerable relevant experience, having taught M.S. and Ph.D. level courses in linear models, been the thesis advisor of 10 Ph.D. graduates, and authored or co-authored two books and more than 80 research articles. His work has been recognized through his election as a Fellow of the American Statistical Association and of the Institute of Mathematical Statistics and as a member of the International Statistical Institute.

Linear Algebra and Matrix Analysis for Statistics

Author: Sudipto Banerjee,Anindya Roy

Publisher: CRC Press

ISBN: 1420095382

Category: Mathematics

Page: 580

View: 308

Linear Algebra and Matrix Analysis for Statistics offers a gradual exposition to linear algebra without sacrificing the rigor of the subject. It presents both the vector space approach and the canonical forms in matrix theory. The book is as self-contained as possible, assuming no prior knowledge of linear algebra. The authors first address the rudimentary mechanics of linear systems using Gaussian elimination and the resulting decompositions. They introduce Euclidean vector spaces using less abstract concepts and make connections to systems of linear equations wherever possible. After illustrating the importance of the rank of a matrix, they discuss complementary subspaces, oblique projectors, orthogonality, orthogonal projections and projectors, and orthogonal reduction. The text then shows how the theoretical concepts developed are handy in analyzing solutions for linear systems. The authors also explain how determinants are useful for characterizing and deriving properties concerning matrices and linear systems. They then cover eigenvalues, eigenvectors, singular value decomposition, Jordan decomposition (including a proof), quadratic forms, and Kronecker and Hadamard products. The book concludes with accessible treatments of advanced topics, such as linear iterative systems, convergence of matrices, more general vector spaces, linear transformations, and Hilbert spaces.

Time Series

Modeling, Computation, and Inference

Author: Raquel Prado,Mike West

Publisher: CRC Press

ISBN: 1420093363

Category: Mathematics

Page: 368

View: 1682

Focusing on Bayesian approaches and computations using simulation-based methods for inference, Time Series: Modeling, Computation, and Inference integrates mainstream approaches for time series modeling with significant recent developments in methodology and applications of time series analysis. It encompasses a graduate-level account of Bayesian time series modeling and analysis, a broad range of references to state-of-the-art approaches to univariate and multivariate time series analysis, and emerging topics at research frontiers. The book presents overviews of several classes of models and related methodology for inference, statistical computation for model fitting and assessment, and forecasting. The authors also explore the connections between time- and frequency-domain approaches and develop various models and analyses using Bayesian tools, such as Markov chain Monte Carlo (MCMC) and sequential Monte Carlo (SMC) methods. They illustrate the models and methods with examples and case studies from a variety of fields, including signal processing, biomedicine, and finance. Data sets, R and MATLAB® code, and other material are available on the authors’ websites. Along with core models and methods, this text offers sophisticated tools for analyzing challenging time series problems. It also demonstrates the growth of time series analysis into new application areas.