[Skip Global Navigation]

Training

Training Home

SPSS Trainer Tip: SPSS Data Validation Module™

Instructor profile

Name: Steve Fink
Home office: SPSS Arlington, VA

Steve Fink

About Steve: Steve has worked for SPSS as an Education Consultant since December 2001. He holds a bachelor's degree from George Washington University and received his master's degree from the University of Connecticut. In Steve's spare time, he enjoys racquetball, tennis, reading, and spending time with his family and friends.

Using the new SPSS Data Validation Module

Cleaning (or scrubbing) data is one of the most important steps in data analysis, but it is also one of the most neglected. Data cleaning can be tedious and time-consuming, and in some cases, the analyst may not even be aware that it should be done. Proper data cleaning, however, should be the first step in any analysis process, to ensure accurate results.

The new SPSS Data Validation module enables you to streamline the data cleaning process by using rules to perform a variety of data checks. For example, you can specify data validation rules for individual variables, such as range checks, and cross-variable checks (e.g., "pregnant males"), and then store these checks in an SPSS data dictionary to use in future applications.

Let's say, for example, that you want to identify respondents who indicate that they watch more than 10 hours of television per day. These data points are considered to be outliers.



Figure 1: The Variables screen within the Validate Data dialog box
(Click to enlarge)



Figure 2: The Single-Variable Rules screen within the Validate Data dialog box (Click to enlarge)



Figure 3: Select ranges for valid values in the Validate Data dialog box
(Click to enlarge)

The table in Figure 4 below presents the 12 respondents who indicated that they watch television more than 10 hours per day, and includes the ID needed to examine each record.



Figure 4: The output from the data validation procedure

This is just one example of how the SPSS Data Validation module can help you identify potential errors in your data before you run statistical analyses.

We offer SPSS courses at locations around the world.
Find a course in the location nearest to you.