Back to ABA Resources

Behavioral Treatment and Normal Educational and Intellectual Functioning in Young Autistic Children

O. Ivar Lovaas

University of California, Los Angeles

Autism is a serious psychological disorder with onset in early childhood. Autistic children show minimal emotional attachment, absent or abnormal speech, retarded IQ, ritualistic behaviors, aggression, and self-injury. The prognosis is very poor, and medical therapies have not proven effective. This article reports the results of behavior modification treatment for two groups of similarly constituted young autistic children. Follow-up data from an intensive, long-term experimental treatment group (n-19) showed that 47% achieved normal intellectual and education functioning, with normal-range IQ scores and successful first grade performance in public schools. Another 40% were mildly retarded and assigned to special classes for the language delayed, and only 10% were profoundly retarded and assigned to classes for the autistic/retarded. In contrast, only 2% of the control-group children (n-40) achieved normal educational and intellectual functioning: 45% were mildly retarded and placed in language-delayed classes, and 53% were severely retarded and placed in autistic/retarded classes.

Kanner (1943) defined autistic children as children who exhibit (a) serious failure to develop relationships with other people before 30 months of age, (b) problems in development of normal language, (c) ritualistic and obsessional behaviors (‘insistence on sameness"), and (d) potential for normal intelligence. A more complete behavioral definition has been provided elsewhere (Lovaas, Koegel, Simmons & Long, 1973). The neology of autism is not known, and the outcome is very poor. In a follow-up study on young autistic children, Rutter (1970) reported that only 1.5% of his group (n-63) had achieved normal functioning. About 35% showed fair or good adjustment, usually required some degree of supervision, experienced some difficulties with people, had no personal friends, and showed minor oddities of behavior. The majority (more than 60%) remained severely handicapped and were living in hospitals for mentally retarded or psychotic individuals or in other protective settings. Initial IQ scores appeared stable over time. Other studies (Brown, 1969; DeMyer et al., 1973; Eisenberg, 1956; Freeman, Ritvo, Needleman, & Yokota, 1985; Havelkova, 1968) report similar data. Higher scores on IQ tests, communicative speech, and appropriate play are considered to be prognostic of better outcome (Lotter, 1967).

Medically and psychologically oriented therapies have not proven effective in altering outcome (DeMyer, Hingtgen, & Jackson, 1981). No abnormal environment etiology has been identified within the children’s families (Lotter, 1967). At present, the most promising treatment for autistic persons is behavior modification as derived from modern learning theory (DeMyer, et. al., 1981). Empirical results from behavioral intervention with autistic children have been both positive and negative. On the positive side, behavioral treatment can build complex behaviors, such as language, and can help to suppress pathological behaviors, such as aggression and self-stimulatory behavior. Clients vary widely in the amount of gains obtained but show treatment gains in proportion to the time devoted to treatment. On the negative side, treatment gains have been specific to the particular environment in which the client was treated, substantial relapse has been observed at follow-up, and no client has been reported as recovered (Lovaas et. al., 1973).

The present article reports a behavioral-intervention project (begun in 1970) that sought to maximize behavioral treatment gains by treating autistic children during most of their waking hours for many years. Treatment included all significant persons in all significant environments. Furthermore, the project focused on very young autistic children (below the age of 4 years) because it was assumed that younger children would be less likely to discriminate between environments and therefore more likely to generalize and to maintain their treatment gains. Finally, it was assumed that it would be easier to successfully mainstream a very young autistic child into preschool than it would be to mainstream an older autistic child into primary school.

It may be helpful to hypothesize an outcome of the present study from a developmental or learning point of view. One may assume that normal children learn from their everyday environments most of their waking hours. Autistic children, conversely, do not learn from similar environments. We hypothesized that construction of a special, intense, and comprehensive learning environment for very young autistic children would allow some of them to catch up with their normal peers by first grade.



Subjects were enrolled for treatment if they met three criteria: (a) independent diagnosis of autism from a medical doctor or a licensed Ph.D. psychologist, (b) chronological age (CA) less than 40 months if mute and less than 46 months or more at a CA of 30 months. The last criterion excluded 15% of the referrals.

The clinical diagnosis of autism emphasized emotional detachment, extreme interpersonal isolation, little if any toy or peer play, language disturbance (mutism or echolalia), excessive rituals, and onset in infancy. The diagnosis was based on a structured psychiatric interview with parents, on observations of the child’s free-play behaviors, on psychological testing of intelligence, and on access to pediatric examinations. Over the 15 years of the project, the exact wording of the diagnosis changed slightly in compliance with changes in the Diagnostic and Stanstical Manual of Mental Disorders (DSM-III; American Psychiatric Association, 1980). During the last years, the diagnosis was made in compliance with DSM-III criteria (p.87). In almost all cases, the diagnosis of autism had been made prior to family contact with the project. Except for one case each in the experimental group and Control Group 1. All cases were diagnosed by staff of the Department of Child Psychiatry, University of California, Los Angeles (UCLA) School of Medicine. Members of that staff have contributed to the writing of the DSM-III and to the diagnosis of autism adopted by the National Society for Children and Adults with Autism. If the diagnosis of autism was not made, the case was referred elsewhere. In other words, the project did not select its cases. More than 90% of the subjects received two or more independent diagnosis, and agreement on the diagnosis of autism was 100%. Similarly high agreement was not reached for subjects who scored within the profoundly retarded range on intellectual functioning (PMA<11 months); these subjects were excluded from the study.

Treatment Conditions

Subjects were assigned to one of the two groups: an intensive-treatment experimental group (n-19) that received more than 40 hours of one-to-one treatment per week, or the minimal-treatment Control Group I (n-19) that received 10 hours or less of one-to-one treatment per week. Control Group I was used to gain further information about the rate of spontaneous improvement in very young autistic children, especially those selected by the same agency that provided the diagnostic work-up for the intensive-treatment experimental group. Both treatment groups received treatment for 2 or more years. Strict random assignment (e.g., based on a coin flip) to these groups could not be used due to parent protest and ethical considerations. Instead, subjects were assigned to the experimental group unless there was an insufficient number of staff members available to render treatment (an assessment made prior to contact with the family). Two subjects were assigned to Control Group I because they lived further away from UCLA than a 1-hr. drive, which made sufficient staffing unavailable to those clients. Because fluctuations in staff availability were not associated in any way with client characteristics, it was assumed that this assignment would produce unbiased groups. A large number of pretreatment measures were collected to test this assumption. Subjects did not change group assignment. Except two families who left the experimental group within the first 6 months (this group began with 21 subjects), all families stayed with their groups from beginning to end.


Pretreatment mental age (MA) scores were based on the following scales (in order of the frequency of their use): the Bayley Scales of Infant Development (Bayley, 1955), the Cartell Infant Intelligence Scale (Cattell, 1960), the Stanford-Binet Intelligence Scale (Thorndike, 1972), and the Gesell Infant Development Scale (Gesell, 1949). The first three scales were administered to 90% of the subjects, and relative usage of these scales was similar in each group. Testing was carried out by graduate students in psychology who worked under supervision of clinical psychologists at UCLA or licensed Ph.D. psychologist at other agencies. The examiner chose the test that would best accommodate each subject’s developmental level and this decision was reached independently of the project staff. Five subjects were judged to be untestable (3 in the experimental group and 2 in the Control Group I). Instead, the Vineland Social Maturity Scale (Doll, 1953) was used to estimate their MA’s (with the mother as informant). To adjust for variations in MA scores as a function of the subject’s CA at the time of test administration, PMA scores were calculated for a CA at 30 months (MA/CA x 30).

Behavioral observations were based on videotape recordings of the subject’s free-play behavior in a playroom equipped with several simple early-childhood toys. These videotaped recordings were subsequently scored for the amount of (a) self-stimulatory behaviors, defined as prolonged ritualistic, repetitive, and stereotyped behavior such as body-rocking, prolonged gazing at lights, excessive hand-flapping, twirling the body as a top, spinning or lining of objects, and licking or smelling of objects or wall surfaces; (b) appropriate play behaviors, defined as those limiting to the use of toys in the playroom to their intended purposes, such as pushing the truck on the floor, pushing buttons on the toy cash register, putting a record on the record player, and banging with the toy hammer, and (c) recognizable words, defined to include any recognizable word, independent of whether the subject used it in a meaningful context or for communicative purposes. One observer who was naïve about subjects’ group placement scored all tapes after being trained to agree with two experienced observers (using different training tapes from similar subjects). Interobserver reliability was scored on 20% of the tapes (randomly selected) and was computed for each category of behavior for each subject by dividing the sum of observer agreements by the sum of agreements and disagreements. These scores were then submitted and averaged across subjects. The mean agreement (based both on occurrences and non-occurrences) was 91% for self-stimulatory behavior, 85% for appropriate play behavior, and 100% for recognizable words. A more detailed description of these behavioral recordings has been provided elsewhere (Lovaas, 1973).

A 1-hr. parent interview about the subjects’ earlier history provided some diagnostic and descriptive information. Subjects received a score of 1 for each of the following variables parents reported: no recognizable words; no toy play (failed to use toys for their intended functions); lack of emotional attachment (failed to respond to parents’ affection); apparent deficit (parents had suspected their child to be blind or deaf because the child exhibited no or minimal eye contact and showed an unusually high pain threshold); no peer play (subject did not show interactive play with peers); self-stimulatory behavior, tantrums (aggression toward family members or self); and no toilet training. These 8 measures from parents’ intake interviews were summed to provide a sum pathology score. The intake interview also provided information about abnormal speech (0-normal and meaningful language, however limited; 1-echolalic language used meaningfully (e.g., to express needs); 2-echolalic; and 3-mute); age of walking; number of siblings in the family; socioeconomic status of the father; sex; and neurological examinations (including EEG’s and CAT scans) that resulted in findings of pathology. Finally, CA at first diagnosis and at the beginning of the present treatment were recorded. This yielded a total of 20 pretreatment measures, 8 of which were collapsed into 1 measure (sum pathology).

A brief clinical description of the experimental group at intake follows identical to that for Control Group I): Only 2 of the 19 subjects obtained scores within the normal range of intellectual functioning; 7 scored in the moderately retarded range, and 10 scored in the severely retarded range. No subject evidenced pretend or imaginary play, only 2 evidenced complex (several different or heterogeneous behaviors that together formed one activity) play, and the remaining subjects showed simple (the same elementary but appropriate response made repeatedly) play. One subject showed minimal appropriate speech, 7 were echolalic, and 11 were mute. According to the literature that describes the developmental delays of autistic children in general, the autistic subjects in the present study constituted an average (or below average) sample of such children.

Post-treatment measures were recorded as follows: Between the ages of 6 and 7 years (when a subject would ordinarily have completed first grade), information about the subjects’ first-grade placement was sought and validated; about the same time, and IQ was obtained. Testing was carried out by examiners who were naïve about the subjects’ group placement . Different scales were administered to accommodate different developmental levels. For example, a subject with regular educational placement received a Wechsler Intelligence Scale for Children-Revised (WISC-R: Wechsler, 1974) or a Stanford-Binet Intelligence Scale (Thorndike, 1972), whereas a subject in an autistic/retarded class received a nonverbal test like the Merrill-Palmer Pre-School Performance Test (Stutsman, 1948). In all instances of subjects having achieved a normal IQ score, the testing was eventually replicated by other examiners. The scales (in order of the frequency of usage) included the WISC-R (Wechsler, 1974), the Stanford-Binet (Thorndike, 1972), the Peabody Picture Vocabulary Test (Dunn, 1981), the Wechsler Pre-School Scale (Wechsler, 1967), the Bayley Scales of Infant Development (Bayley, 1955), the Cattell Infant Intelligence Scale (Cattell, 1960), and the Leiter International Performance Scale (Leiter, 1959). Subjects received a score of 3 for normal functioning if they received a score on the WISC-R or Stanford-Biner in the normal range, completed first grade in a normal class in a school for normal children, and were advanced to the second grade by the teacher. Subjects received a score of 2 if they were placed in first-grade in a smaller aphasia (language delayed, language handicapped, or learning disabled) class. Placement in the aphasia class implied a higher level of functioning than placement in classes for the autistic/retarded, but the diagnosis of autism was almost always retained. A score of 1 was given if the first-grade placement was in a class for the autistic/retarded and if the child’s IQ score fell within the severely retarded range.

Treatment Procedure

Each subject in the experimental group was assigned several well trained student therapists who worked (part-time) with the subject in the subject’s home, school, and community for an average of 40 hrs. per week for 2 or more years. The parents worked as part of the treatment team throughout the intervention; they were extensively trained in the treatment procedures so that treatment could take place for almost all of the subjects’ waking hours, 365 days a year. A detailed presentation of the treatment procedure has been presented in a teaching manual (Lovaas et. a., 1980). The conceptual basis of the treatment was reinforcement (operant) theory; treatment relied heavily on discrimination-learning data and methods. Various behavioral deficiencies were targeted, and separate programs were designed to accelerate development for each behavior. High rates of aggressive and self-stimulatory behaviors were reduced by being ignored; by the use of time-out; by the shaping of alternate, more socially acceptable forms of behavior, and (as a last resort) by delivery of a loud "no" or a slap on the thigh contingent upon the presence of the undesirable behavior. Contingent physical aversives were not used in the control group because inadequate staffing in the group did not allow for adequate teaching of alternate, socially appropriate behaviors.

During the first year, treatment goals consisted of reducing self-stimulatory and aggressive behaviors, building compliance to elementary verbal requests, teaching imitation, establishing the beginners of appropriate toy play, and promoting the extension of the treatment into the family. The second year of treatment emphasized teaching expressive and early abstract language and interactive play with peers. Treatment was also extended into the community to teach children to function within a preschool group. The third year emphasized the teaching of appropriate and varied expression of emotions; preacademic tasks like reading, writing, and arithmetic; and observational learning (learning by observing other children learn). Subjects were enrolled only in those preschools where the teacher helped to carry out the treatment program. Considerable effort was exercised to mainstream subjects in a normal (average and public) preschool placement and to avoid initial placement in special education classes with detrimental effects of exposure to other autistic children. This occasionally entailed withholding the subject’s diagnosis of autism. If the child became known as autistic (or as "a very difficult child") during the first year in preschool, the child was encouraged to enroll in another, unfamiliar school (to start fresh). After preschool, placement in public education classes was determined by school personnel. All children who successfully completed normal kindergarten successfully completed first grade and subsequent normal grades. Children who were observed by experiencing educational and psychological problems received their school placement through Individualized Educational Plan (IEP) staffings (attended by educators and psychologists) in accordance with the Education For All Handicapped Children Act of 1975.

All subjects who went on to a normal first grade reduced in treatment from the 40 hour per week characteristic of the first 2 years to 10 hr or less per week during kindergarten. After a subject had started first grade, the project maintained a minimal (at most) consultant relationship with some families. In two cases, this consultation and the subsequent correction of the problem behaviors were judged to be essential in maintaining treatment gains. Subjects who did not recover in the experimental group received 40 hr per week of one-to-one treatment for more than 6 years (more than 14,000 hr of one-to-one treatment), with some improvement shown each year but with only 1 subject recovering.

Subjects in Control Group I received the same kind of treatment as those in the experimental group but with less intensity (less than 10 hr of one-on-one treatment per week) and without systematic physical aversives. In addition, these subects received a variety of treatments from other sources in the community such as those provided by small special education classes.

Control Group 2 consisted of 21 subjects from a larger group (N-62) of young autistic children studied by Freeman et. al. (1985). These subjects came from the same agency that diagnosed 95% of our other subjects. Data from Control Group 2 helped to guard against the possibility that subjects who had been referred to us for treatment constituted a subgroup with particularly favorable or unfavorable outcomes. To provide a group of subjects similar to those in the experimental group and Control Group I, subjects for Control Group 2 were selected if they were 42 months old or younger when first tested, had IQ scores above 40 at intake, and had follow-up testing at 6 years of age. These criteria resulted in the selection of 21 subjects. Subjects in Control Group 2 were tested like Control Group I subjects but were not treated by Young Autism Project described here.


Pre-Treatment Comparisons

Eight pretreatment variables from the experimental group and Control Group I (CA at first diagnosis, CA at onset of treatment, PMA, sum pathology, abnormal speech, self-stimulatory behavior, appropriate toy play, and recognizable words) were subjected to a multivariate analysis of variance (MANOVA; Brecht & Woodward, 1984). The means and F ratios from this analysis are presented in Table I. As can be seen, there were no significant differences between groups except for CA at onset of our treatment (p<.05). Control subjects were 6 months older on the average than experimental subjects (mean CA’s of 35 months vs. 41 months, respectively). These differences probably reflect the delay of control subjects in their initiation into the treatment project because of staff shortages; analysis will show differential CA’s are not significantly related to outcome.

Table 1

Means and F Ratios From Comparisons Between Groups on Intake Variables







Diagnosis CA

Treatment CA
















Control 1


















Note. CA=chronological age; PMA=prorated mental age. Experimental group, n=19; Control Group 1, n=19.

*df=1, 36.


To ascertain whether another test would reveal a statistically significant difference between the groups on toy play, descriptions of the subjects’ toy play (taken from the videotaped recordings) were typed on cards and rated for their developmental level by psychology students who were naïve about the purpose of the ratings and subject group assignment. The ratings were reliable among students (r-.79,p<.001), and an F test showed no significant difference in developmental levels of toy play between the two groups.

The respective means from the experimental group and Control Group I on the eight variables from the parent interview were .89 and .74 for sensory deficit, .63 and .42 for adult rejection, .58 and .47 for no recognizable words, .53 and .63 for no toy play, 1.0 and 1.0 for no peer play, .95 and .89 for body self-stimulation, .89 and .79 for tantrums, and .68 and .63 for no toilet training. The experimental group and Control Group I were also similar in onset of walking (6 vs. 8 early walkers; 1 vs. 2 late walkers), number of siblings in the family (1.26 in each group), socioeconomic status of the father (Level 49 vs. Level 54 according to 1950 Bureau of the Census standards), boys to girls (16:3 vs. 11:8); and number of subjects referred for neurological examinations (10 vs. 15) who showed signs of damage (0 vs. 1). The numbers of favorable versus unfavorable prognostic signs (directions of differences) on the pretreatment variables divide themselves equally between the groups. In short, the two groups appear to have been comparable at intake.

Follow-Up Data

Subjects’ PMA at intake, follow-up educational placement, and IQ scores were subjected to a MANOVA that contrasted the experimental group with Control Groups 1 and 2. At intake, there were no significant differences between the experimental group and the control groups. At follow-up, the experimental group was significantly higher than the control groups on educational placement (p<.001) and IQ (p<.01). The two control groups did not differ significantly at intake or at follow-up. In short, data from Control Group 2 replicate those from Control Group 1 and further validate the effectiveness of our experimental treatment program. Data are given in Table 2 that show the group means from pre-treatment PMA and post-treatment educational placement and IQ scores. The table also shows the F ratios and significance levels of the three group comparisons.

In descriptive terms, the 19-subject experimental group shows 9 children (47%) who successfully passed through normal first grade in a public school and obtained an average or above average score on IQ tests (M=107, range=94-120). Eight subjects (42%) passed first grade in aphasia classes and obtained a mean IQ score within the mildly retarded range of intellectual functioning (M=70, range=56-95). Only two children (10%) were placed in classes for autistic/retarded children and scored in the profoundly retarded range (IQ<.30).

There were substantial increases in the subjects’ levels of intellectual functioning after treatment. The experimental group subjects gained on the average of 30 IQ points over Control Group 1 subjects. Thus the number of subjects who scored within the normal range of intellectual functioning increased from 2 to 12, whereas the number of subjects within the moderate-to-severe range of intellectual retardation dropped from 10 to 3. As of 1986, the achievements of experimental group subjects have remained stable. Only 2 subjects have been reclassified; 1 subject (now 18 years old) was moved from an aphasia to a normal classroom after the sixth grade; 1 subject (now 13 years old) was moved from an aphasia to an autistic/retarded class placement.

Table 2

Means and F Ratios for Measures at

Pre-treatment and Post-treatment



Intake PMA








Control 1




Control 2




F ratios*

Experimental x Control 1




Experimental x Control 2




Control 1 x Control 2




Note. PMA = prorated mental age; EDP = educational placement.

Experimental group, n = 19; Control Group 1, n = 19; Control Group 2, n =21.

n =21.

*df = 1.56.

**p<.01. ***p<.001.


The MA and IQ scores of the two control groups remained virtually unchanged between intake and follow-up, consistent with findings from other studies (Freeman et. al., 1985; Rutter, 1970). The stability of the IQ scores of the young autistic children, as reported in the Freeman et. al. Study, is particularly relevant for the present study because it reduces the possibility of spontaneous recovery effects. In descriptive terms, the combined follow-up data from the control groups show that their subjects fared poorly: Only 1 subject (2%) achieved normal functioning as evidenced by normal first-grade placement and an IQ of 99 on the WISC-R; 18 subjects (45%) were in aphasia classes (mean IQ = 70, range = 30-101); and 21 subjects (53%) were in classes for the autistic/retarded (mean IQ = 40, range = 20-73). Table 3 provides a convenient descriptive summary of the main follow-up data from the three groups.

Table 3

Educational Placement and Mean and Range of IQ

at Follow-up


















Control Group 1













Control Group 2













Note. Dashes indicate no score or no entry.

*Both children received the same score.

One final control procedure subjected 4 subjects in the experimental group (Ackerman, 1980) and 4 subjects in Control Group 1 (McEachin & Leaf, 1984) to a treatment intervention in which one component of treatment (the loud "no" and occasional slap on the thigh contingent on self-stimulatory, aggressive, and non-compliant behavior) was at first withheld and then introduced experimentally. A within-subjects replication design was used across subjects, situations, and behaviors, with baseline observations varying from 3 weeks to 2 years after treatment had started (using contingent positive reinforcement only). During baseline, when the contingent-aversive component was absent, small and unstable reductions were observed in the large amount of inappropriate behaviors, and similar small and unstable increases were observed in appropriate behaviors such as play and language. These changes were insufficient to allow subjects’ to successful mainstreaming. Introduction of contingent aversives resulted in a sudden and stable reduction in the inappropriate behaviors and a sudden and stable increase in appropriate behaviors. This experimental intervention helps to establish two points: First, at least one component in the treatment program functioned to produce change, which helps to reduce the effect of placebo variables. Second, this treatment component affected both the experimental and control groups in a similar manner, supporting the assumption that the two groups contained similar subjects.

Analyses of variance were carried out on the eight pretreatment variables to determine which variables, if any, were significantly related to outcome (gauged by educational placement and IQ) in the experimental group and Control Group 1. Prorated mental age was significantly (p<.03) related to outcome in both groups, a finding that is consistent with reports from other investigators (DeMyer et. al., 1981). In addition, abnormal speech was significantly (p<.01) related to outcome in Control Group 1. Chronological age at onset of our treatment was not related to outcome, which is important because the two groups differed significantly on this variable intake (by 6 months). The failure of CA to relate to outcome may be based on the very young age of all subjects at onset of treatment.

Conceivably, a linear combination of pretreatment variables could have predicted outcome in the experimental group. Using a discriminant analysis (Ray, 1982) with the eight variables used in the first multivariate analysis, it was possible to predict perfectly the 9 subjects who did achieve normal functioning, and no subject was predicted to achieve this outcome who did not. In this analysis, PMA was the only variable that was significantly related to outcome. Finally, when this prediction equation was applied to Control Group 1 subjects, 8 were predicted to achieve normal functioning with intensive treatment; this further verifies the similarity between the experimental group and Control Group 1 prior to treatment.


This article reports the results of intensive behavioral treatment for young autistic children. Pretreatment measures revealed no significant differences between the intensively treated experimental group and the minimally treated control groups. At follow-up, experimental group subjects did significantly better than control group subjects. For example, 47% of the experimental group achieved normal intellectual and educational functioning in contrast to only 2% of the control group subjects.

The study incorporated certain methodological features designed to increase confidence in the effectiveness of the experimental group treatment:

1. Pretreatment differences between the experimental and control groups were minimized in four ways. Fist, the assignment of subjects was as random as was ethically possible. The assignment apparently produced unbiased grouts as evidenced by similar scores on the 20 pretreatment measures and by the prediction that an equal number of Control Group 1 and experimental group subjects would have achieved normal functioning had the former subjects received intensive treatment. Second, the experimental group was not biased by receiving subjects with a favorable diagnosis or biased IQ testing because both diagnosis and IQ tests were constant across groups. Third, the referral process did not favor the project cases because there were no significant differences between Control Groups 1 and 2 at intake or follow-up, even though Control Group 2 subjects were referred to others by the same agency. Fourth, subjects stayed within their groups, which preserved the original (unbiased) group assignment.

2. A favorable outcome have been caused not by the experimental treatment but by the attitudes and expectations of the staff. There are two findings that contradict this possibility of treatment agency (placebo) effects. First, because Control Group 2 had no contact with the project, and because there was no difference between Control Group 1 and 2 at follow-up, placebo effects appear implausible. Second, the within-subjects study showed that at least one treatment component contributed to the favorable outcome in the intensive treatment (experimental) group.

3. It may be argued that the treatment worked because the subjects were not truly autistic. This is counter-indicated by the reliability of the independent diagnosis and by the outcome data from the control groups, which are consistent with those reported by other investigators (Brown, 1969; DeMeyer et. al., 1973; Eisenberg, 1956; Freeman et. al., 1985; Havelkova, 1968; Rutter, 1970) for groups of young autistic children diagnosed by a variety of other agencies.

4. The spontaneous recovery rate among very young autistic children is unknown, and without a control group the favorable outcome in the experimental group could have been attributed to spontaneous recovery. However, the poor outcome in the similarly constituted Control Groups 1 and 2 would seem to eliminate spontaneous recovery as a contributing factor to the favorable outcome in the experimental group. The stability of the IQ test scores in the young autistic children examined by Freeman et. al. (1985) attests once again the chronically of autistic behaviors and serves to further negate the effects of spontaneous recovery.

5. Post-treatment data showed that the effects of treatment (a) were substantial and easily detected, (b) were apparent on comprehensive, objective, and socially meaningful variables (IQ and school placement), and (c) were consistent with a very large body of prior research on the application of learning theory to the treatment and education of developmentally disabled persons and with the very extensive (100-year-old) history of psychology laboratory work on learning processes in man and animals. In short, the favorable outcome reported for the intensive treatment experimental group can in all likelihood be attributed to treatment.

A number of measurement problems remain to be solved. For example, play, communicative speech, and IQ scores define the characteristics of autistic children and are considered predictors of outcome. Yet the measurement of these variables is no easy task. Consider play. First, play undoubtedly varies with the kinds of toys provided. Second, it is difficult to distinguish low levels of toy play (simple and repetitive play associated with young, normal children) from high levels of self-stimulatory behavior (a psychotic attribute associated with autistic children). Such problems introduce variability that needs immediate attention before research can proceed in a meaningful manner.

The term normal functioning has been used to describe children who successfully passed normal first grade and achieved an average IQ on the WISC-R. But questions can be asked about whether these children truly recovered from autism. On the one hand, educational placement is a particularly valuable measure of progress because it is sensitive to both educational accomplishments and social-emotional functions. Also, continual promotion from grade to grade is made not by one particular teacher but by several teachers. School personnel describe these children as indistinguishable from their normal friends. On the other hand, certain residual deficits may remain in the normal functioning group that cannot be detected by teachers and parents and can only be isolated on closer psychological assessment, particularly as these children grow older. Answers to such questions will soon be forthcoming in a more comprehensive follow-up (McEachin, 1987).

Several questions about treatment remain. It is unlikely that a therapist or investigator could replicate our treatment program for the experimental group without prior extensive theoretical and supervised practical experience in on-to-one behavioral treatment with developmentally disabled clients as described clients as described here and without demonstrated effectiveness in teaching complex behavioral repertoires as in imitative behavior and abstract language. In the within-subjects studies that were reported, contingent aversives were isolated as one significant variable. It is therefore unlikely that treatment effects could be replicated without this component. Many treatment variables are left unexplored, such as the effect of normal peers. Furthermore, the successful mainstreaming of a 2-4 year old into a normal preschool group is much easier than mainstreaming of an older autistic child into the primary grades. This last point underscores the importance of early intervention and places limits on the generalization of our data to older autistic children.

Historically, psychodynamic theory has maintained a strong influence on research and treatment with autistic children, offering some hope for recovery through experiential manipulations. By the mid-1960’s, an increasing number of studies reported that psychodynamic practitioners were unable to deliver on that promise (Rimland, 1964). One reaction to those failures was an emphasis on organic theories of autism that offered little or no hope for major improvements through psychological and educational interventions. In a comprehensive review of the research on autism, DeMyer et. al. (1981) concluded that "(in the past) psychotic children were believed to be potentially capable of normal functioning in virtually all areas of development…..during the decade of the 1970’s it was the rare investigator who even gave lip-service to such previously held notions…..Infantile autism is a type of developmental disorder accompanied by severe and, to a large extent, permanent intellectual/behavioral deficits" (p. 432).

The following points can now be made. First, at least two distinctively different groups emerged from the follow-up data in the experimental group. Perhaps this finding implies different etiologies. If so, future theories of autism will have to identify these groups of children. Second, on the basis of testing to date, the recovered children show no permanent intellectual or behavioral deficits and their language appears normal, contrary to the position that many have postulated (Rutter, 1974, Churchill, 1978) but consistent with Kanner’s (1943) position that autistic children possess potentially normal or superior intelligence. Third, at intake, all subjects evidenced deficiencies across a wide rage of behaviors, and during treatment they showed a broad improvement across all observed behaviors. The kind of (hypothesized) neural damage that mediates a particular kind of behavior, such as language (Rutter, 1974), is not consistent with these data.

Although serious problems remain for exactly defining autism or identifying its etiology, one encouraging conclusion can be stated. Given a group of children who show the kinds of behavioral deficits and excesses evident in our pretreatment measures, such children will continue to manifest similar severe psychological handicaps later in life unless subjected to intensive behavioral treatment that can indeed significantly after that outcome.

These data promise a major reduction in the emotional hardships of families with autistic children. The treatment procedures described here may also prove equally effective with other childhood disorders, such as childhood schizophrenia. Certain important, practical implications in these findings may also be noted. The treatment schedule of subjects who achieved normal functioning could be reduced from 40 hr per week to infrequent visits even after the first 2 years of treatment. The assignment of one full-time special education teacher for 2 years would cost an estimated $40,000 in contrast to the nearly $2 million incurred (in direct costs alone) by each client requiring life-long institutionalization.


Ackerman, A.B. (1980). The contribution of punishment to the treatment of preschool aged children. Unpublished doctoral dissertation. University of California, Los Angeles.

American Psychiatric Association. (1980). Diagnostic and statistical manual of mental disorders (3rd ed.). Washington, DC: Author.

Bayley, N. (1955). On the growth of intelligence. American Psychologist. 10. 805-818.

Brecht, M.L. & Woodward, J.A. (1984). GANOVA: A univariate/multivariate analysis of variance program computer. Educational and Psychological Measurement. 44. 169-173.

Brown, J. (1969). Adolescent development of children with infantile psychosis. Seminars in Psychiatry. 1. 79-89.

Cattell, P. (1960). The measurement of intelligence of infants and young children. New York: Psychological Corporation.

Chruchill, D. W. (1978). Language: The problem beyond conditioning. In M. Rutter & E. Schopler (Eds.). Autism: A reappraisal of concepts and treatment. (pp. 71-85). New York: Plenum.

DeMyer, M. K., Barton, S., DeMyer, W. E., Norton, J. A., Allen, J., & Steele, R. (1973). Prognosis in autism: A follow-up study. Journal of Autism and Children Schizophrenia. 3. 199-246.

DeMyer, M. K., Hingtgen, J. N. & Jackson, R. K. (1981). Infantile autism reviewed: A decade of research. Schizophrenia Bulletin. 7. 388-451.

Doll, E. A. (1953). The measurement of social competence. Minneapolis, MN, Minneapolis Educational Test Bureau.

Dunn, L. M. (1981). Peabody Picture Vocabulary Test. Circle River. ME American Guidance Service.

Education for All Handicapped Children Act of 1975. Washington, DC, Congressional Record.

Eisenburg, L. (1956). The autistic child in adolescence. American Journal of Psychiatry. 112. 607-612.

Freeman, B. J., Ritvo, E. R., Needleman, R., & Yokota, A. (1985). The stability of cognitive and linguistic parameters in autism: A 5-year study. Journal of the American Academy of Child Psychiatry. 24. 290-311.

Gesell, A. (1949). Gesell Development Schedules. New York: Psychological Corporation.

Havelkova, M. (1968). Follow-up study of 71 children diagnosed as psychotic in preschool age. American Journal of Orthopsychiatry, 18. 846-857.

Kanner, L. (1943). Autistic disturbances of affective contact. Nervous Child, 2. 217-250.

Leiter, R. G. (1959). Part 1 of the manual for the 1948 revision of the Leiter International Performance Scale: Evidence of the reliability and validity of the Leiter tests. Psychology Service Center Journal, 11. 1-72.

Lotter, V. (1967). Epidemiology of autistic conditions in young children: II. Some characteristics of the parents and children. Social Psychiatry, 1. 163-173.

Lovaas, O. I., Ackerman, A. B., Alexander, D., Firestone, P., Perkins, J., & Young, D. (1980). Teaching developmentally disabled children: The me book. Austin, TX: Pro-Ed.

Lovass, O.I., Koegel, R. L., Simmons, J. Q., & Long, J. (1973). Some generalization and follow-up measures on autistic children in behavior therapy. Journal of Applied Behavior Analysis, 6. 131-166.

McEachin, J. J. (1987). Outcome of autistic children receiving intensive behavioral treatment: Residual deficits. Unpublished behavior dissertation. University of California, Los Angeles.

McEachin, J. J., & Leaf, R. B. (1984, May). The ishment in motivation of autistic children. Paper presented at the convention of the Association for Behavioral Analysis, Nashville, TN.

Ray, A. A., (1982). Statistical Analysis System user’s guide: Statistics. 1982 edition. Cary, NC: SAS Institute.

Rimland, B. (1964). Infantile autism. New York: Appleton-Century-Crofts.

Rutter, M. (1970). Autistic children: Infancy to adulthood. Seminar in Psychiatry, 2. 435-450.

Rutter, M. (1974). The development of infantile autism. Psychological Medicine, 4. 147-163.

Stutsman, R. (1948). Guide for administering the Merrill-Palmer Scale of Mental Tests. New York: Harcourt & World.

Thorndike, R. L. (1972). Manual for Stanford-Binet Intelligence Scale. Boston: Houghton Mifflin.

Wechsler, D., (1967). Manual for the Wechsler Pre-School and Primary Scale of Intelligence. New York: Psychological Corporation.

Wechsler, D. (1974). Manual for the Wechsler Intelligence Scale for Children – Revised. New York: Psychological Corporation.

Received October 10, 1985

Revision received March 28, 1986



Back to ABA Resources

This document is, updated Monday, 15-Jan-2007 18:19:34 EST

Copy? right! 1998-2098 All content written by me may be redistributed provided (1) my text is not substantially altered and (2) my authorship is clearly attributed. Copyright otherwise remains with original authors. How to link to this site

This is a resource guide only, not therapeutic, diagnostic, medical, or legal advice. I am not endorsing any individual, organization, product, or service mentioned here, nor do my opinions represent their views. The information provided is designed to support, not replace, the relationship that exists between a patient/site visitor and his/her physician or other service providers. Site privacy practices