Statistical Models for Test Equating Scaling and Linking

Author: Alina von Davier
Publisher: Springer Science & Business Media
ISBN: 9780387981383
Format: PDF, Kindle
Download Now
The goal of this book is to emphasize the formal statistical features of the practice of equating, linking, and scaling. The book encourages the view and discusses the quality of the equating results from the statistical perspective (new models, robustness, fit, testing hypotheses, statistical monitoring) as opposed to placing the focus on the policy and the implications, which although very important, represent a different side of the equating practice. The book contributes to establishing “equating” as a theoretical field, a view that has not been offered often before. The tradition in the practice of equating has been to present the knowledge and skills needed as a craft, which implies that only with years of experience under the guidance of a knowledgeable practitioner could one acquire the required skills. This book challenges this view by indicating how a good equating framework, a sound understanding of the assumptions that underlie the psychometric models, and the use of statistical tests and statistical process control tools can help the practitioner navigate the difficult decisions in choosing the final equating function. This book provides a valuable reference for several groups: (a) statisticians and psychometricians interested in the theory behind equating methods, in the use of model-based statistical methods for data smoothing, and in the evaluation of the equating results in applied work; (b) practitioners who need to equate tests, including those with these responsibilities in testing companies, state testing agencies, and school districts; and (c) instructors in psychometric, measurement, and psychology programs.

Test Equating Scaling and Linking

Author: Michael J. Kolen
Publisher: Springer Science & Business Media
ISBN: 1493903179
Format: PDF, Docs
Download Now
This book provides an introduction to test equating, scaling and linking, including those concepts and practical issues that are critical for developers and all other testing professionals. In addition to statistical procedures, successful equating, scaling and linking involves many aspects of testing, including procedures to develop tests, to administer and score tests and to interpret scores earned on tests. Test equating methods are used with many standardized tests in education and psychology to ensure that scores from multiple test forms can be used interchangeably. Test scaling is the process of developing score scales that are used when scores on standardized tests are reported. In test linking, scores from two or more tests are related to one another. Linking has received much recent attention, due largely to investigations of linking similarly named tests from different test publishers or tests constructed for different purposes. In recent years, researchers from the education, psychology and statistics communities have contributed to the rapidly growing statistical and psychometric methodologies used in test equating, scaling and linking. In addition to the literature covered in previous editions, this new edition presents coverage of significant recent research. In order to assist researchers, advanced graduate students and testing professionals, examples are used frequently and conceptual issues are stressed. New material includes model determination in log-linear smoothing, in-depth presentation of chained linear and equipercentile equating, equating criteria, test scoring and a new section on scores for mixed-format tests. In the third edition, each chapter contains a reference list, rather than having a single reference list at the end of the volume The themes of the third edition include: * the purposes of equating, scaling and linking and their practical context * data collection designs * statistical methodology * designing reasonable and useful equating, scaling, and linking studies * importance of test development and quality control processes to equating * equating error, and the underlying statistical assumptions for equating

Contemporary Issues in Exploratory Data Mining in the Behavioral Sciences

Author: John J. McArdle
Publisher: Routledge
ISBN: 1135044090
Format: PDF, ePub, Docs
Download Now
This book reviews the latest techniques in exploratory data mining (EDM) for the analysis of data in the social and behavioral sciences to help researchers assess the predictive value of different combinations of variables in large data sets. Methodological findings and conceptual models that explain reliable EDM techniques for predicting and understanding various risk mechanisms are integrated throughout. Numerous examples illustrate the use of these techniques in practice. Contributors provide insight through hands-on experiences with their own use of EDM techniques in various settings. Readers are also introduced to the most popular EDM software programs. A related website at http://mephisto.unige.ch/pub/edm-book-supplement/offers color versions of the book’s figures, a supplemental paper to chapter 3, and R commands for some chapters. The results of EDM analyses can be perilous – they are often taken as predictions with little regard for cross-validating the results. This carelessness can be catastrophic in terms of money lost or patients misdiagnosed. This book addresses these concerns and advocates for the development of checks and balances for EDM analyses. Both the promises and the perils of EDM are addressed. Editors McArdle and Ritschard taught the "Exploratory Data Mining" Advanced Training Institute of the American Psychological Association (APA). All contributors are top researchers from the US and Europe. Organized into two parts--methodology and applications, the techniques covered include decision, regression, and SEM tree models, growth mixture modeling, and time based categorical sequential analysis. Some of the applications of EDM (and the corresponding data) explored include: selection to college based on risky prior academic profiles the decline of cognitive abilities in older persons global perceptions of stress in adulthood predicting mortality from demographics and cognitive abilities risk factors during pregnancy and the impact on neonatal development Intended as a reference for researchers, methodologists, and advanced students in the social and behavioral sciences including psychology, sociology, business, econometrics, and medicine, interested in learning to apply the latest exploratory data mining techniques. Prerequisites include a basic class in statistics.

Computerized Multistage Testing

Author: Duanli Yan
Publisher: CRC Press
ISBN: 146650580X
Format: PDF, ePub
Download Now
Devising tests that evaluate a nation’s educational standing and implement efficacious educational reforms requires a careful balance among the contributions of technology, psychometrics, test design, and the learning sciences. Unlike other forms of adaptive testing, multistage testing (MST) is highly suitable for testing educational achievement because it can be adapted to educational surveys and student testing. Computerized Multistage Testing: Theory and Applications covers the methodologies, underlying technology, and implementation aspects of this type of test design. The book discusses current scientific perspectives and practical considerations for each step involved in setting up an MST program. It covers the history of MST, test design and implementation for various purposes, item pool development and maintenance, IRT-based and classical test theory-based methodologies for test assembly, routing and scoring, equating, test security, and existing software. It also explores current research, existing operational programs, and innovative future assessments using MST. Intended for psychologists, social scientists, and educational measurement scientists, this volume provides the first unified source of information on the design, psychometrics, implementation, and operational use of MST. It shows how to apply theoretical statistical tools to testing in novel and useful ways. It also explains how to explicitly tie the assumptions made by each model to observable (or at least inferable) data conditions. Winner of the 2016 AERA Award for Significant Contribution to Educational Measurement and Research Methodology The 2016 American Education Research Association (AERA) Div. D award committee for Significant Contributions to Educational Measurement and Research Methodology has recognized unanimously this collaborative work advancing the theory and applications of computerized MST. This annual award recognizes published research judged to represent a significant conceptual advancement in the theory and practice of educational measurement and/or educational research methodology. The 2016 award was made under the heading: Measurement, Psychometrics, and Assessment. This collective work, published in 2014 as an edited volume titled Computerized Multistage Testing: Theory and Applications, was cited by the committee both for the originality of the conceptual foundations presented in support of multistage testing and for arguing persuasively for its potential impact on the practice of educational measurement.

Linking and Aligning Scores and Scales

Author: Neil J. Dorans
Publisher: Springer Science & Business Media
ISBN: 0387497714
Format: PDF, Mobi
Download Now
In this book, experts in statistics and psychometrics describe classes of linkages, the history of score linkings, data collection designs, and methods used to achieve sound score linkages. They describe and critically discuss applications to a variety of domains. They define what linking is, to distinguish among the varieties of linking and to describe different procedure for linking. Furthermore, they convey the complexity and diversity of linking by covering different areas of linking and providing diverse perspectives.

Test Equating

Author: Michael J. Kolen
Publisher: Springer Science & Business Media
ISBN: 1475724128
Format: PDF, Kindle
Download Now
In recent years, many researchers in the psychology and statistical communities have paid increasing attention to test equating as issues of using multiple test forms have arisen and in response to criticisms of traditional testing techniques. This book provides a practically oriented introduction to test equating which both discusses the most frequently used equating methodologies and covers many of the practical issues involved. The main themes are: - the purpose of equating - distinguishing between equating and related methodologies - the importance of test equating to test development and quality control - the differences between equating properties, equating designs, and equating methods - equating error, and the underlying statistical assumptions for equating. The authors are acknowledged experts in the field, and the book is based on numerous courses and seminars they have presented. As a result, educators, psychometricians, professionals in measurement, statisticians, and students coming to the subject for the first time as part of their graduate study will find this an invaluable text and reference.

Handbook of Modern Item Response Theory

Author: Wim J. van der Linden
Publisher: Springer Science & Business Media
ISBN: 1475726910
Format: PDF, Mobi
Download Now
Item response theory has become an essential component in the toolkit of every researcher in the behavioral sciences. It provides a powerful means to study individual responses to a variety of stimuli, and the methodology has been extended and developed to cover many different models of interaction. This volume presents a wide-ranging handbook to item response theory - and its applications to educational and psychological testing. It will serve as both an introduction to the subject and also as a comprehensive reference volume for practitioners and researchers. It is organized into six major sections: the nominal categories model, models for response time or multiple attempts on items, models for multiple abilities or cognitive components, nonparametric models, models for nonmonotone items, and models with special assumptions. Each chapter in the book has been written by an expert of that particular topic, and the chapters have been carefully edited to ensure that a uniform style of notation and presentation is used throughout. As a result, all researchers whose work uses item response theory will find this an indispensable companion to their work and it will be the subject's reference volume for many years to come.

Linear Models for Optimal Test Design

Author: Wim J. van der Linden
Publisher: Springer Science & Business Media
ISBN: 9780387202723
Format: PDF, Kindle
Download Now
This book begins with a reflection on the history of test design--the core activity of all educational and psychological testing. It then presents a standard language for modeling test design problems as instances of multi-objective constrained optimization. The main portion of the book discusses test design models for a large variety of problems from the daily practice of testing, and illustrates their use with the help of numerous empirical examples. The presentation includes models for the assembly of tests to an absolute or relative target for their information functions, classical test assembly, test equating problems, item matching, test splitting, simultaneous assembly of multiple tests, tests with item sets, multidimensional tests, and adaptive test assembly. Two separate chapters are devoted to the questions of how to design item banks for optimal support of programs with fixed and adaptive tests. Linear Models for Optimal Test Design, which does not require any specific mathematical background, has been written to be a helpful resource on the desk of any test specialist. Wim J. van der Linden is Professor of Measurement and Data Analysis, University of Twente, The Netherlands. His specialization is psychometric theory and methods, and he has been an active researcher of item response theory throughout his career. His current research is on test design, adaptive testing, test equating, and response-time modeling. Professor van der Linden is a past president of the Psychometric Society and a recipient of the NCME lifetime achievement award for his work on educational measurement.

Psychometric Methods

Author: Larry R. Price
Publisher: Guilford Publications
ISBN: 1462524788
Format: PDF, Docs
Download Now
Grounded in current knowledge and professional practice, this book provides up-to-date coverage of psychometric theory, methods, and interpretation of results. Essential topics include measurement and statistical concepts, scaling models, test design and development, reliability, validity, factor analysis, item response theory, and generalizability theory. Also addressed are norming and test equating, topics not typically covered in traditional psychometrics texts. Examples drawn from a dataset on intelligence testing are used throughout the book, elucidating the assumptions underlying particular methods and providing SPSS (or alternative) syntax for conducting analyses. The companion website presents datasets for all examples as well as PowerPoint slides of figures and key concepts. Pedagogical features include equation boxes with explanations of statistical notation, and end-of-chapter glossaries. The Appendix offers extensions of the topical chapters with example source code from SAS, SPSS, IRTPRO, BILOG-MG, PARSCALE, TESTFACT, and DIMTEST.

Handbook of Item Response Theory Volume Three

Author: Wim J. van der Linden
Publisher: CRC Press
ISBN: 1351643703
Format: PDF
Download Now
Drawing on the work of internationally acclaimed experts in the field, Handbook of Item Response Theory, Volume 3: Applications presents applications of item response theory to practical testing problems. While item response theory may be known primarily for its advances in theoretical modeling of responses to test items, equal progress has been made in its providing innovative solutions to daily testing problems. This third volume in a three-volume set highlights the major applications. Specifically, this volume covers applications to test item calibration, item analysis, model fit checking, test-score interpretation, optimal test design, adaptive testing, standard setting, and forensic analyses of response data. It describes advances in testing in areas such as large-scale educational assessment, psychological testing, health measurement, and measurement of change. In addition, it extensively reviews computer programs available to run any of the models and applications in Volume One and Three. Features Includes contributions from internationally acclaimed experts with a history of advancing applications of item response theory Provides extensive cross-referencing and common notation across all chapters in this three-volume set Underscores the importance of treating each application in a statistically rigorous way Reviews major computer programs for item response theory analyses and applications. Wim J. van der Linden is a distinguished scientist and director of research and innovation at Pacific Metrics Corporation. Dr. van der Linden is also a professor emeritus of measurement and data analysis at the University of Twente. His research interests include test theory, adaptive testing, optimal test assembly, parameter linking, test equating, and response-time modeling as well as decision theory and its applications to problems of educational decision making.