Richard Allington
Key
Idea
State University of New York-Albany, Albany, New York, USA
Board Member, International Reading
Association
Transcript from Critical Balances: Early Instruction for Lifelong Reading
DR. ALLINGTON: In one sense, part of what's going on across the country is a debate,
the debate about, in one sense, a philosophy of education and a philosophy
of teaching and learning and so on. And if you are interested, I'll just recommend an article in this month's
Teachers College Record by Dave Berliner in which he sort of tries to --
to analyze the different notions of what it means to learn and what it
means to read and what it means to teach and how different factions literally
talk right past each other. It's a powerful statement. But I'm going to talk more just briefly today about a little framework
that I -- that I have been trying to develop, and it has to do with the
notion of research-based claims. And I was pushed to do this in part by
this conference and also by Jay Goldman, who is the editor of the School
Administrator, had asked me to do a piece for the fall issue on research-based
claims in reading. And he gave me initially 750 words to do it in, only
he let me take 1,500, actually 1,481. That's without editing, however. He was concerned about it because he said suddenly, at least in the
last few years, there's been a ground swell in their advertising copy that's
coming into the School Administrator and into the booths at the AASA (American Association of School Administrators) conference
and so on, as well as in some of the presentations about research claims:
"This is research based," "This is based in scientific research,"
and "This is based in empirical research," and so on. And he
said, you know, it looks like almost everybody has invented the best way
to teach reading. Is there a way that you could perhaps give a plain language
summary for how we might sort through some of this? I
came up with four criteria that folks might use. (More
on FAQ6) The first one's called the convergence of evidence
criteria. (More on FAQ6) A series of studies by
one investigator or one research team does not a generalization make. But,
instead, when we look at research that's been done in multiple sites by
multiple investigators and we start noticing that in fact people are coming
up with very similar findings, particularly when they're using multiple
measures, I think we can argue there is a convergence of the evidence.
And I'm going to talk a little bit today about some of the areas of beginning
reading where I think there is a convergence and some where I think the
people would like to tell us there is a convergence but there's not. The second criteria I used was what I call the
quality of evidence criteria. (More on FAQ6) The
quality -- the primary criterion I proposed was the peer and publication
in one of the five journals, five journals that routinely produce or publish
rigorous, empirical, as well as I suppose nonempirical -- no -- rigorous
investigations of -- of reading research, literacy acquisition, and so
on. And those are Reading Research Quarterly, Journal of Literacy
Research, American Educational Research Journal -- DR. HOFFMAN: Journal of Ed Psych (Educational Psychology). DR. ALLINGTON: -- Journal of Ed Psych, thank you, and the Journal
of Educational Research. And I said I don't think that those are the
only five, but I can tell you right now as someone who has been working
at a university and working in two different federal research centers for
25 years, everybody is dying to try to get their work published there.
If it doesn't get published there, then we usually go to another journal,
which doesn't mean that I -- my point simply is that all journals are not
equal. I think what I find most amazing is the number of products for which
there are research claims in which there is no published research evidence
in any journal, but instead series of what are called site reports, evaluation
studies, unpublished dissertations, unpublished monographs. There's a reason
they're unpublished. The third criteria, the comparability criteria,
refers to comparability among individuals and school districts. (More
on FAQ6) What does the evidence suggest? What kind of implementation
did we have? What kind of demographics were in place? Perhaps even what
historical era was the study done in? Is a study from 1965 relevant in
1997? It might be, but let's look at the comparability, what kind of kids,
what kind of teachers, what kind of classrooms, what kind of communities,
what kind of support and so on. The final one is the one I call compellingness.
(More on FAQ6) And the compellingness criteria in one
sense, I suppose, is sort of a seat of the pants. It's the most qualitative
of all of the judgments, but I -- I put it in because in my -- very honestly,
as I read, I start asking myself, is this compelling? Even when I have
met the previous three criteria, is this compelling? Am I convinced? Let me talk now just a little bit about the research in reading and
beginning reading in particular. And some of the research that I think
has been ignored is what I am going to focus on. We have three other folks behind me, so there is no reason to try to
cover the whole research field. I would pick the stuff that interests me
most. I would suggest that one of the areas in which we have converging evidence
on an international scale has already been mentioned by Ms. Allen, and
that's that not only are the kids in Texas doing
pretty well, but so are the kids across the U.S. (More
on FAQ7) In fact, you know, the International Education Association's
latest study in 1992 ranked the U.S. fourth graders second in the world.
I mean, it's literally the only comparison, international comparison, in
which I think we can honestly say U.S. kids met world-class standards. This recent issue of Educational Researcher had an article by
Stedman and another response by Gerald Bracey discussing the fact that
that particular study is -- is largely unknown. That meant Bracey says
that he's asked audiences over the last two or three years to raise their
hand how many people have heard of this study. He gets one hand at about
every other meeting. I have to say that I have been using an excerpt this year out of the
local Albany paper that came out last June '96 that was a three-inch
column on page 6 that -- that reports the results of that study. It also
reports in one paragraph, the second paragraph, is a quote from (Education) Secretary
Riley that says this, of course, doesn't mean we're where we want to be
and then goes on to point out that parents need to do more, which is another
whole day we could spend. Since we have fewer parents with fewer hours
these days, when are they supposed to do more and what are they supposed
to do it with? But if you look at the work that the IEA reports that are available
through the publications office of the International Reading Association,
the report by Warwick Elley and the other by Neil Postelwait, you'll see
that there not only is achievement high in the U.S., particularly in the
elementary grades, but it points to a couple of other areas that I think
there is converging evidence also. One is the
importance of -- of sheer opportunity to read and write. (More
on FAQ2) I'll simply argue that the data from the IEA, the data from the National
Assessment of Educational Progress, I mean, enormously large-scale databases,
as well as the data from some sort of classic studies like Linehart, Sigman,
and Cooley back in the 1980s, sort of, again, the heyday of the old skills-based
diagnostics prescriptive model, that point to the incredible power or the
production function of engaging in reading and writing for improved reading
and writing achievement. I'll suggest also that there is an enormous convergence
of evidence. We look at the work of Mike Knapp more recently on teaching
for meaning in high poverty schools, his Teachers College Press work. That
was a large federal study that they completed about two years ago out of
Far West Labs. (Return to PI) A distinguishing characteristic, I mean literally the first one that
jumped out of the data, was the enormous amount of time that kids in these
high-achieving, high-poverty schools spent reading and writing, talking,
and thinking, as opposed to the kids in the low-achieving schools who spent
lots of time, as they did in the Linehart, Sigman, and Cooley study, being
drilled, doing dittos, and playing, you know, go-fish phonics games or
go-fish sight word games or hidden word search puzzles or vocabulary volcano
activities and so on. I mean, we can go back even further, you know, to the studies that Bert
Harris and Al Sewer and others did, and the same findings come out: Kids
need to read a lot, they need to write a lot. I think there is no arguing
that the evidence has converged to suggest that kids need enormous amounts
of time. (More on FAQ2) I think that the evidence suggests that our middle school and high school
kids aren't doing as well internationally or even nationally as the elementary
school kids. The first thing I'd point to would be to ask how much they
read, because it looks from the national assessment data that the high
school kids are reading half as much as the fourth graders and writing
half as much, which is an oddity in the sense that as the kids get more
and more skilled at things, we seem to expect less and less. It seems to
me the higher the grade level the more likely we are to find a teacher
who thinks that they don't have the time to waste by having kids read or
write. One of the things that I will argue, there also
is, I think, a fair amount of converging evidence that teachers do teach
decoding skills, they do teach phonics skills, and so on. (More
on FAQ7) We have this evidence in a variety of both recent surveys,
large scale, you know, the work that Mike Pressley and
Joan Rankin and Linda Yokoi reported in The Elementary School Journal last year, the
-- and more recently in the new book that McIntyre and Pressley have out
from Christopher Gordon (Return to PI)
on strategy instruction, the work that -- that Jim Baumann
and Hoffman and the crew from the Reading Research Center have done on
their large-scale survey of -- of how teachers teach, sort of a replication
of the first-grade studies from the 1960s which found that teachers were
teaching phonics in the 1960s. (Return to PI) I mean, in the Pressley study, you're talking about 95 percent of the
primary teachers reporting, "Yes, we teach phonics." If you look
at the recent, most recent Hoffman and McCarthy studies of schools here
in Texas, of the study from how teachers taught in the last year of using
the old basals and the first year of using the new basals, teachers didn't
change much. You know, teachers taught phonics, lots of them. Lots of them
taught phonics over and above the phonics that was in either basal series,
the older skills-based and the newer language- or literature-based series. In other words, we have, I think, a variety of evidence. And going back,
we could pull out the first-grade studies from the 1960s that showed that
teachers were teaching phonics. Now, I mean the question, I suppose, that there seem to be lots of assumptions
that nobody teaches phonics anymore. One of the points that was raised
at the American Educational Research on California was that, the working
assumption being presented by the speakers seemed to be that in 1987 California
adopted a new framework and adopted a new set of basals that were literature based,
and all teachers changed the way they taught and started immediately not
doing anything like they had done before. A speaker from Great Britain got up and asked the question to Dr. Hoenig,
"You know, does this -- it sounds like you're suggesting that these
teachers changed dramatically and that runs contrary to all of the evidence
on implementation in teacher change. Do you have any evidence?" And Dr. Hoenig sort of said, well, ya-da-da-da-da-da and then Dr. Treadway
said, well, ya-da-da-da-da-da, and then finally the question came, so the
short answer is "no." And they said basically, yeah, we don't
have any evidence on how teachers taught. No one studied how teachers taught
in California after the new adoption. In many respects, one of the points that Hoffman and McCarthy make in
their recent report is almost nobody has studied how reading is taught
in Texas. Almost no one has studied how basals or curriculum frameworks
influence the way teachers teach. There are lots of assertions out there on both sides that basals and
curriculum materials are a form of mind control that enslave teachers,
but the evidence doesn't support it. The evidence doesn't also -- also could
be used on the other end to suggest that picking a new basal, picking a
new framework, is going to have modest effects, if any, on how teachers
teach in Texas or anywhere else. There are a number of assumptions, and I would point
to the work of my mentor, Jerry Duffy, and particularly his most recent
paper in the -- in a book by Steve Stahl and crew from Georgia, David Hays,
called Instructional Models in Reading that's published by Earlbaum.
Duffy's paper is called "Powerful Models are Powerful Teachers."
(Return to PI) Jerry has now just retired from university life, and in this paper he
sort of reflects on what the last 25 years at the Institutes of Research
and Teaching and his work on teachers and strategy instruction and how
to help teachers learn to teach reading better and so on, and he
comes to basically the same conclusion -- that it's teachers that matter.
(More on FAQ5) I mean, that was the conclusion of the first-grade
studies. Curriculum didn't matter much, teachers matter;
that almost any kind of curriculum that was put in place worked for some
people and didn't work for others; (Back to
FAQ5) (Return to PI) that there were implementations
districtwide where the same -- Program A did wonderfully in this district
and terribly in this district; that the difference between the best and
the worst implementations of -- of the same program were always miles wider
than the average differences between programs implemented across a wide
variety of sites. I think there is converging evidence that
kids need to develop strong independent decoding skills. I don't think
there is a convergence of evidence on how to best do that. (More
on FAQ1) And, in fact, one of the things that sort of troubles me is
that we're getting what I will argue is a false dichotomy sort of waved
in front of us. And the false dichotomy that's being waved in front of
us sort of contrasts or pits direct, systematic phonics versus incidental,
opportunistic phonics. And I'll suggest that the work that Mike Pressley has been doing, the
work that Duffy has done, the work that Hoffman and McCarthy have been
doing, and a variety of other folks, including the study that Mike Pressley
and I are now running with sites here in Texas, sites in California, sites
in Wisconsin, New York, and New Jersey, observing teachers, first-grade
teachers in classrooms suggests that the very best
teachers, the ones that have been nominated as exemplary, almost always
teach in a way that I would call planful, direct, and opportunistic.
(Back to FAQ2) If we look at the Pressley survey, you know, less than a quarter of
the teachers report using a commercial decoding curriculum, of these exemplary
teachers. Almost all of them report teaching, you know, phonics and decoding
strategies every day, but they teach it in the course of reading and writing
activity. We see this as an argument between decontextualized and contextualized,
you know, teaching of decoding. And often the direct and systematic is
sort of argued as decontextualized. I think the difference also that I would argue is that phonics as a
strategy versus phonics as knowledge. I mean, a fundamental difference
in some of the data that's being put out is "Does this kid use the
phonics knowledge that he has effectively in an integrated strategy, you
know, to produce meaning from text?" I'll finally argue that perhaps what we really see is the difference
between novice and expert teachers. In other words, we see teachers who
are very expert about kids, about the first grade, about teaching reading
and writing, and they invariably teach quite differently than teachers
that aren't very expert. And teachers that aren't very expert may in fact
find themselves enslaved by materials. They may find themselves doing decontextualized,
unsystematic kinds of instruction. But it would be an argument, I think, from
the weight of the evidence, the converging evidence, that we need to invest
more in teacher expertise, not necessarily more in curriculum materials
or frameworks. (Back to FAQ8) We start
thinking about the notion of -- of what it means to -- to be an effective
first-grade teacher, effective at the beginning. I can say I think there is convergence that suggests,
the data that suggests, phonemic awareness and developing it in kids is
important. I don't think there is a shred of evidence that we have any
convergence of evidence how that might best be done. (More
on FAQ1) We've got data that suggests that you can do it at four and
five by playing a lot of word games with -- with small groups of kids.
We've got some data that suggests you could do whole-class stuff. You've
got some data that suggests one-to-one is pretty powerful, some that suggests
that doing it with drill and Elkonin boxes, and others suggest doing it
with invented spelling and writing activities. In other words, I think that the conclusion one can draw from the research
is that there is no adequate research base, no converging evidence that
suggests that -- that we know how best to teach phonemic awareness. I'd suggest that the convergence of evidence also
says about 80 percent of the kids develop it (phonemic awareness) by the
middle of first grade right now in the classrooms we've got, taught by
teachers who have never heard the word. (Back
to FAQ7) Is the best way to solve the problem of the other 20 percent or 15 percent,
depending on -- on which study you look at going back into the '70s, is
the best way to do that to implement and change kindergarten and first-grade instruction systemwide statewide, or is it to try to be a little
bit more, you know, notion of a surgical strip and target the kids who
seem to have difficulty in the classroom context? I'll simply suggest that we don't have evidence to suggest which way
is the most powerful, most effective. We start looking at how to intervene
to teach kids effective decoding skills, we don't have a lot of convergence.
We start looking at the -- the evidence from a variety of folks. I'll just leave you with a couple of quotes here. The first one is from
Jeanne Chall:
I'll suggest that even with the lack of agreement
on how to go about teaching phonics that what we do know is that traditional
phonics programs don't represent an advance. (Back
to FAQ3) From Marilyn Adam's book, she argues simply, "The basic
phonics curriculum is inherently intractable, slow, inefficient, and worse,
except for students who essentially know how to read before it's begun,
it's likely to be ineffective." In other words, she's arguing here that filling in short vowel dittos
doesn't teach kids to read, that the only kids who can do it are the ones
who can already read, and they don't need it -- and the ones who can't do
it can't learn from it. Pat Cunningham, in her review, her review for the National Reading Conference
research session on phonics, which is reprinted in the back of her book
Phonics They Use, concludes at the very end, in the last paragraph,
"The kinds of phonics instruction we need and which we should advocate
is not the old phonics. It's not rules and jargon in a worksheet."
And she makes a strong case using her work and the work of Linnae Ehri
and Irene Gaskins and others for a decoding- by-analogy approach not a
decoding-by-sounding in the sense of a letter by letter left to right sounding
activity. What I am going to suggest is that the converging
evidence suggests that it's important that kids do in fact learn to decode.
It's important if they're going to learn to decode that they learn about
phonemic awareness, but what we don't have, we don't have data that -- that
provides us with much in the way of saying this is exactly how we should
do it. (Back to FAQ1) The question we were asked to talk to is: "What should policymakers
pay attention to?" And I'm going to close this off with one last overhead
here. And I decided that, you know, policymakers ask questions like this:
"Will smaller class size produce higher achievement?" And the answer they
want is "yes, no." Researchers, as Ernie House has pointed out, talk this way. Researchers
say, well, "If smaller class sizes are met in a rationale manner, and the teachers act upon the reduced class size to deliver more
personalized instruction, then you could expect higher
achievement, but. . ." And in one sense that's part of the problem we're faced with, that people
are trying to take research in which "if," "and," "then,"
"but's" are the only way we can talk about it and turn it into
"yes/no's." We can't turn a lot of the research into "yes/no's." I'll simply argue that the NICHD studies, which have gotten a lot of
press recently, simply have exaggerated claims for what they found, exaggerated
claims for the kinds of effects and for what should be done in terms of
American reading instruction. These exaggerations are just overwhelmingly
obvious when you sit down and read the studies. They have not independently
manipulated important variables like decodable texts. They have not independently
manipulated variables like direct and systematic phonics instruction and
so on. I've got a new pin and I'm going to start marketing them. I don't have
them today, but you can order them. Connie Juel and Diane Roper Schneider's study from the 1985 Reading
Research Quarterly has often been given to me as a reference for the
decodable text. Quoting that article, "The interpretation of the results
of this study do not constitute advocacy of any one specific approach to
beginning reading instruction. In particular, the results should not be
interpreted as supporting an explicit phonics method since induction of
learning of letter sounds," and so on. It seems when you go back and read the studies
people are much more careful when they write than sometimes when they talk.
Sometimes people are more careful, and the original author is more careful,
than people who are using the original author's work to advance a particular
form of evidence as evidence or a particular approach or recommendation.
(Back to FAQ6) We've got three other people that are going to come up here and talk
a little bit. I will simply suggest that we can start looking at converging
evidence, but there are radical claims being made today that can't be supported
by the research. And we can't -- I mean, if we just sit back and say, oh, well, the research
says, and we don't go out and do our homework, then -- then we're the ones
to blame. Thank you. Reading Online, www.readingonline.org
There's
a difference between effective, strategic use of phonics knowledge and
pronouncing nonsense syllables. (More on FAQ6) (Return
to PI)Imposing a set systematic phonics program on a teacher
who is knowledgeable about reading and keenly attuned to the strengths
and weaknesses of her pupils may very well destroy the beauty of what she
has already achieved. A beginning code emphasis program will not cure all
reading ills, it cannot guarantee all children will learn to read easily,
nor have the results of meaning emphasis programs been so disastrous that
all academic and emotional failure can be blamed on them, as some proponents
and publishers of new code emphasis programs proclaim.
This is 1968.
![]()
It's
called "Show Me the Research." And what I am going to suggest
is that everybody in this room may be better off to actually start reading
the research, to actually start reading it and asking, what does it tell
us?
Posted October 1997
© 1997-2000 International Reading Association, Inc. ISSN 1096-1232