Home >> Science >> Social Sciences >> Psychology >> Tests and Testing >> Psychometrics


  Intelligence and Ability Tests
  Personality Tests
   


For tools on a parapsychology phenomenon of distance noesis, view psychometry.

Psychometrika is a field of survey caring by using the theory & system of psychological measurement, which includes the mensuration of cognition, abilities, attitudes, & personality traits. A field is primarily caring by owning a learn of differences between people. It involves 2 major search tasks, viz.: (we) a construction of instruments & procedures for measuring; & (deuce) a development & filtration of theoretical approaches to measuring.

Origins and background

Tremendously of the early theoretical & applied operate witharound psychometry was undertaken in an attempt to measure intelligence. A origin of psychometry has modems to the related field of psychophysics. Charles Spearman, a pioneer within psychometry world health organization developed approaches to the measure of intelligence, exposed under Wilhelm Wundt and was trained in psychophysics. A psychometrician L. L. Thurstone later developed and applied a theoretical approach to the measurement referred to as the law of comparative judgment, an approach which has close connections to the psychophysical theory developed by Ernst Heinrich Weber and Gustav Fechner. Additionally, Spearman & Thurstone each manufactured crucial contributions to the theory & application of factor analysis, a statistical procedure that has been utilized extensively within psychometry.

Extra recently, psychometric theory has been applied in the measuring of personality, attitudes & beliefs, academic accomplishment, and around health-related fields. Mensuration one unobservable phenomena is hard, & great deal of the search & accumulated art thereinside discipline has been developed in an attempt to properly define & quantify such phenomena. Critics, including practician in the physical sciences & social activists, st& argued that such definition and quantification is impossibly hard, and that such measuring come typically lost. Exponent of psychometric techniques might reply, though, that their critics typically misuse information by non using psychometric criteria, & too that various quantitative phenomena in the physical sciences, like heat & forces, can't become found directly however must become inferred from either their manifestations.

Numbers world health organization manufactured important contributions to psychometry include Karl Pearson, L. L. Thurstone, Georg Rasch and Arthur Jensen.

Definition of measurement in the social sciences

the definition of measure in the social sciences has been a controversial issue. The presently far flung definition, projected by Stanley Smith Stevens (1946), is that measurement is "the assignment of numerals to objects or events according to some rule". This definition was introduced in the paper where Stevens projected quartet levels of measurement. Although widely adoptive, this definition differs inside significant respects from either a supplementary authoritative definition of measure adopted throughout a physical sciences, which is that mensuration is the numerical estimation & expression of the magnitude of a single quantity relative to a second (Michell, 1997). Indeed, Stevens' definition of measuring was put send on around response to the British Ferguson Committee, whose chair The. Ferguson was the physicist. A committee was appointed within 1932 per British Association for a Advancement of Science to investigate the possibility of quantitatively estimating receptive cases. Although its chair & more members were physicists, a committee as well comprised many psychologists. A committee's report highlighted a importance of the definition of measuring. When Stevens' response was to propose a freshly definition, which has experienced considerable influence in the field, this was not by a long sight the just response to the report. A second, notably different, response wequally to assume a authoritative definition, as reflected in the as a consequence statement:

These diverging reactions come reflected to the big extent inside guide approaches to measuring. E.g., methods according to covariance matrices come generally listed on the assumption that statistics, like raw scores from either assessments, come measure. Such approaches implicitly entail Stevens' definition of measure, which takes just that figures come assigned based on data from a select few rule. A independent a food & drug administration project, so, is typically considered to become the discovery of associations between scores, and of factors posited to underlie such associations. But then, while mensuration system like a Rasch model are employed, statistics are non assigned according to the rule. Instead, around keeping by having Reese's statement above, specific criteria for even mensuration come stated, & a objective is to construct procedures or operations that provide information which meet a relevant criteria. Measuring come judged according to a system, & tests come conducted to assure whether it has been conceivable to meet a relevant criteria.

Instruments and procedures

A number one psychometric instruments were designed to measure a conception of intelligence. A better known historical approach involves a Stanford-Binet IQ test, developed originally by the French Psychologist Alfred Binet. Contrary to a fairly far flung misconcoption, no compelling grounds to believe that these are imaginable to measure unconditioned intelligence across such instruments, in the feel of an unconditioned learning capacity insensible by own household budget, nor was this the original intention after it were developed. Notwithstanding, IQ tests come utile information for various purposes. An guide conception of intelligence is that cognitive facilities inside souls come the manifestation of the general component, or even general intelligence factor, when well as cognitive capacity specific to the given domain.

Psychometry is applied widely withwithin training assessment to measure abilities in domains like reading, writing, & math. A independent approaches withinside using tests in these domains keep around been Classical Trial Theory & a additional modern Item Response Theory & Rasch measure system. These modern approaches permit joint scaling of souls & assessment things, which will bring the basis for mapping of developmental continuthe by permitting descriptions of the skills displayed at various points along a continuum. Such approaches provide right principles on a nature and severity of developmental incubation inside various domains.

An additional major focus within psychometrika keep close at hand get on personality testing. There st& been the range of theoretical approaches to conceptualising and with measurements of personality. A bit of of a better known instruments include the Minnesota Multiphasic Personality Inventory and the Myers-Briggs Type Indicator. Attitudes use besides been exposed extensively around psychometry. The most common approach to the measuring of attitudes is the have of the Likert scale. An guide approach involves a application of unfolding mensuration system, a virtually all general existence a Hyperbolic Cosine Model (Andrich & Luo, 1993).

Theoretical approaches

Psychometric theory involves many distinct areas of survey. Number one, psychometricians use developed the big body of theory utilized in the development of mental tests & analysis of information collected from either these tests. This act may be about divided into classical test theory (CTT) and a other recent item response theory (IRT). An approach which is similar to IRT however likewise quite distinctive, inside terms of its origins & features, is represented per Rasch model for measurement. A development of a Rasch model, & the wide class of system to which it belongs, was explicitly based in requirements of measuring in the physical sciences (Rasch, 1960).

2nd, psychometricians use at times developed methods for working by owning big matrices of correlations & covariances. Techniques in that general tradition include factor analysis (finding significant underlying dimensions in the information), multidimensional scaling (finding the elementary representation for high-dimensional information) & data clustering (finding objects which are then prefer both more). Around these multivariate descriptive methods, users try to simplify big numbers of information. Additional recently, structural equation modeling and path analysis represent more sophisticated approaches to solving this condition of big covariance matrices. These methods allow statistically sophisticated system to become fitted to information & tested to determine whenever it is adequate fits.

Key concepts

A key traditional conception around definitive line 1 text theory come reliability and validity. The dependable measure is with measurements of something systematically, when the valid measure is with measurements of what it is supposed to measure. The dependable measure can be uniform while forgoing necessarily existence valid, .e.g., a measuring instrument rather the broken ruler can universally under-measure the quantity per equivalent total every period (systematically), however the sequent quantity is however wrongly, that is, shut-in. For another example, a dependable rifle have had the pinching bunch of bullets in the target, when the valid of these may center that cluster as much as the center of the target.

Each dependability & validity can be assessed mathematically. Internal consistency can be assessed by correlating performance in 2 halves of a line 3 text (split-half dependability); the value of the Pearson product-moment correlation coefficient is adjusted with a Spearman-Brown prediction formula to correspond to the correlation between two good-length tests. More approaches include the intra-class correlation (a ratio of variance of measure of a given target to the variance of tons targets). The unremarkably utilized measure is Cronbach's α, which is equivalent to the mean of all possible split-half coefficients. Stability all over recurrent measures is assessed by using a Pearson coefficient, when is the equivalence of different versions of the equivalent measure (different forms of an iq test, e.g.). More measures come as well utilized.

Validity can become assessed by correlating measures by having the criterion measure known to be valid. Once a criterion measure is collected at a equivalent period when a measure existence validated a goal is to establish concurrent validity; when a criterion is collected later on a goal is to establish predictive validity. The measure has construct validity if it is related to more variables equally expected by theory. Content validity, or face validity, is just a demonstration that a things of a line 1 text come drawn from either the domain existence measured; it doesn't assure that the line 2 text actually measures phenomena therein domain.

Prognostic or even coinciding validity just can not exceed a square of the correlation between two versions of the equivalent measure.

Item response theory system a relationship between latent traits and responses to trial things. Among more benefits, IRT will bring a basis for obtaining an estimate of the locatiin of the end line text-taker on the given latent trait besides when the standard error of measure of that location. E.g., the university student's noesis of history may exist as deduced from either his or even her score in the university trial and so be equated faithfully by having the high school student's cognition deduced from either the less hard line 2 text. Scores derived by authoritative line 3 text theory don't keep close at h& this characteristic, and assessment of actual ability (like than ability proportional to more line 1 text-takers) must become assessed by comparing scores to victims of the norm group randomly selected from a people. As a matter of fact, a lot measures from either definitive end line text theory come contingent on the sample tested, when, in theory, victims from either item response theory are not.

For a few, a field of psychometry has controversial aspects on to the person implications of applied measuring. Within section, a disputation involves a super notion of standardized tests. For others, a problematic aspects of psychometry require a history of the field, which require aspects of eugenics.

Psychometrics Research Unit
Research results, questionnaires and publications of the Psychometrics Research Unit (University of Valencia, Spain). Content a mixture of English, Spanish, Valencian and Portuguese.

Principal Components and Factor Analysis
Chapter from StatSoft's Electronic Statistics Textbook.

RepGrid
Includes the Centre for Person-Computer Studies, Knowledge Support Systems, RepGrid and WebGrid software details, and the Personal Construct Psychology site. Many full-text articles for download.

Reliability and Validity
Clear introduction to the nature and assessment of test reliability and validity.

Applied Psychometric Society
Resource for professionals in psychometrics. Operated by Fordham University.

The Correlation Coefficient
Online textbook by political scientist RJ Rummel.

Factor Analysis
Introduction to factor analysis, with worked examples.

Personal Construct Psychology Research Group
University of Wollongong based research group. Resources include free software for analysing repertory grids, and searchable citation database of PCP references.

Ingrid Yahoo Group
Free downloadable software for analysing repertory grids.

Multivariate Data Analysis
Outline for Michael Friendly's course on multivariate data analysis and linear statistical models in behavioural science research. Includes bibliographies, computing guides, notes, and SAS examples covering topics such as regression, ANOVA, and factor analysis.


Science: Math: Statistics
Science: Social Sciences: Psychology: Research Methods





© 2005 GeneralAnswers.org