The final selection of the 125 Denver II items was based on the following criteria: ease of administration and scoring, item appeal to child and examiner, item test-retest and inter-rater reliability, minimal 'refusal' scores, minimal 'no opportunity' scores, minimal subgroup differences, and a smooth step-like progression of ages at which 90% of children could perform the tasks. Using regression analysis, composite norms for the total sample and norms for subgroups (based on gender, ethnicity, maternal education, and place of residence), were used to determine new age norms. The average number of times each item was administered was 540. For the revision, 336 potential items were administered to more than 2000 children. Concerns raised through the years by test users about specific items and features of the Denver Developmental Screening Test, coupled with a need for more current norms, have prompted a major revision and restandardization of the test.
Since the Denver Developmental Screening Test was first published 23 years ago, it has been utilized worldwide and restandardized in more than a dozen countries.