SOCI208 Module 1 - Nature of Data

1.  Data Sets

A data set is a collection of facts assembled for a particular purpose. A rectanguar data set is one where the information is organized in a element (unit) by variable format, where

2.  Levels of Measurement

1.  Levels of Measurement

An influential typology due to <> distinguishes different types of data according to the level of measurement.
 
Levels of Measurement (+ or - indicate presence or absence of property in corresponding column) 
Exclusive & exhaustive set of attributes Ordering of categories defined Interval between categories defined Absolute zero defined (so that ratios make sense)
A.  Categorical (aka nominal, qualitative, or classification) variable
+
-
-
-
B. Ordinal variable
+
+
-
-
C. Interval variable
+
+
+
-
D. Ratio variable
+
+
+
+

Examples:

2.  Implications of Levels of Measurement

The level of measurement has implications for the kinds of analysis that can be performed with a variable:

3.  Data Sources

1.  Experimental & Observational Studies

1.  Experimental Studies
In experimental studies the analyst is able to manipulate the value of one or several variables of interest (the experimental treatment) to estimate their effects on the dependent variable, while using random assignment of subjects to treatments to control statistically for other factors that can affect the dependent variable.  For example Experimental studies allow strong inferences about causal relations among variables.
2.  Observational Studies
In observational studies the analyst cannot manipulate the values of the independent variable(s) of interest and cannot use randomization to control the effects of other factors.  For example It is more difficult to estimate causal relations with observaitonal than with experimental data, because there is always the possibility of omitting an important causal factor.

2.  Individual & Aggregate Units

In the social sciences many studies are based either on individual data (where the elements are people) or on aggregate units each of which represents a group of individuals (such as an organization, country, electoral district, or other administrative unit).  Common differences between these sources of data are




Last modified 21 Aug 2002