Download Practical syntax - (`Dick`) Hudson

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Word-sense disambiguation wikipedia , lookup

Probabilistic context-free grammar wikipedia , lookup

Semantic holism wikipedia , lookup

Japanese grammar wikipedia , lookup

Stemming wikipedia , lookup

Construction grammar wikipedia , lookup

Lojban grammar wikipedia , lookup

Macedonian grammar wikipedia , lookup

Cognitive semantics wikipedia , lookup

Musical syntax wikipedia , lookup

Compound (linguistics) wikipedia , lookup

Chinese grammar wikipedia , lookup

Esperanto grammar wikipedia , lookup

Symbol grounding problem wikipedia , lookup

Preposition and postposition wikipedia , lookup

Scottish Gaelic grammar wikipedia , lookup

Polish grammar wikipedia , lookup

Focus (linguistics) wikipedia , lookup

Latin syntax wikipedia , lookup

Antisymmetry wikipedia , lookup

Agglutination wikipedia , lookup

Spanish grammar wikipedia , lookup

Lexical semantics wikipedia , lookup

Untranslatability wikipedia , lookup

Pipil grammar wikipedia , lookup

Morphology (linguistics) wikipedia , lookup

Dependency grammar wikipedia , lookup

Parsing wikipedia , lookup

Determiner phrase wikipedia , lookup

Malay grammar wikipedia , lookup

Junction Grammar wikipedia , lookup

Pleonasm wikipedia , lookup

Transformational grammar wikipedia , lookup

Transcript
Practical syntax
Richard Hudson, May 1995
1. Practical syntax?
Some people - perhaps most people who are likely to read this paper - have a practical interest
in sentence-structure (the subject-matter of syntax). Here are some practical reasons for such
an interest:
P When learning a foreign language you have to learn how to use words in phrases and
sentences; the more clearly you understand the patterns you are learning, the more
successful you will be.
P The same is true when you teach a foreign language; again, the clearer your
understanding of sentence structure, the more you will be able to help your students.
P If your responsibility is to help young children to develop their language skills, you
need to understand the skills they already have as well as those that are still beyond their
grasp. This applies as much to parents or grandparents as to professional first-language
teachers.
P Some unfortunate individuals lose, or never acquire, the normal skills; speech and
language therapists who try to treat them must understand exactly what it is that is
missing before they try to provide a remedy.
P When you communicate about anything which is at all complicated you have to select
sentence-structures so that they convey the intended message with the minimum of pain
to the hearer or reader. There is never a single way to communicate a given message there are always alternatives. (As I type this, I am writing two or three versions of most
sentence before I find one that I can't improve.) The better we understand sentencestructure, the better we can make these choices.
P In these high-tech days an increasing number of people are trying to program
computers to understand what humans write or say, and to produce human-like writing
and speech. This requires a profound understanding of sentence-structure, and this
understanding must be severely practical because it has to be translated into detailed
instructions for the computer.
Fortunately there is a large measure of agreement on how sentences are structured. For
example, consider this example sentence:
(1)
We drank the coffee.
We grammarians would all agree on a rough classification of the words (personal pronoun +
1
past-tense verb + definite article + singular common noun), using traditional terms which (in
some cases) date back to the Romans or even to the Ancient Greeks. We could also agree that
the coffee `hangs together', as a phrase, because both words help to identify the same bit of
meaning; and that we and the coffee have distinct `roles' or `grammatical functions' in the
sentence, called `subject' and `object'. Furthermore, the same kind of classification can be
applied to sentences in other languages, like the Turkish translation of sentence (1).
(2)
Kahveyi içtik.
In this case of course the details are different: only the common noun and the verb survive the
translation, as the meanings of we and the are expressed differently (by suffix-like elements)
in Turkish. But any grammarian would describe kahveyi as the object of içtik (or of the whole
sentence), and would accept that ik shows what the subject would have been if there had been
one.
I think the agreement extends to the following generalisations about sentence structure:
P Sentence structure is closely linked to meaning, but the same meaning can be
expressed by different sentence structures (especially when we consider different
languages).
P At the heart of sentence structure are the relations among words. These are of two
types: the grammatical functions (e.g. subject and object), and the links which bind
words into larger units - into phrases, into clauses and ultimately into sentences.
P Sentence structures are controlled by the rules (which constitute the grammar of the
language concerned). The rules control the ways in which words are combined and the
meanings which the various combinations express; they refer to the relations among
words, but also to the ways in which these are reflected in the order of the words
concerned and in the word-classes (e.g. verb, common noun, article, singular, past
tense) to which they belong.
However, alongside this agreement there is a great deal of disagreement, both about general
principles and about the details of how particular kinds of sentence should be analysed. This
disagreement is largely a product of the last 40 years, which have seen a great expansion in
research on sentence structure and a proliferation of alternative theories as well as of
grammars of particular languages1.
This expansion is a mixed blessing for someone in need of practical help. On the one
hand, I feel sure that we all understand at least some parts of sentence-structure better now
than we did 40 years ago; but on the other hand the choice is overwhelming. Unfortunately,
1
There are a dozen or so `important' rival theories, bearing names like Minimalism
(Chomsky 1993), Government-Binding Theory (Chomsky 1986), Head-driven Phrase
Structure Grammar (Pollard and Sag 1994), Lexical Functional Grammar (Bresnan 1982),
Categorial Grammar (Wood 1993), Cognitive Grammar (Langacker 1990), Systemic
Functional Grammar (Halliday 1985), Relational Grammar (Blake 1990) and Functional
Grammar (Siewierska 1991).
2
the choice matters; some approaches are much more helpful than others. Different users no
doubt have different needs, but in my opinion the following list covers the most important
things in a truly `practical' approach to syntax:
P Rather obviously, it must be as near as possible to `true', in the sense of being
compatible with all the relevant facts. It is no help to have a grammar which is simply
used, but also simply wrong.
P It must be as complicated as the facts require, but no more so. Simple sentences
should have simple analyses, and complicated sentences correspondingly complicated
analyses.
P It should offer a usable diagramming system. Even a sentence as simple as We drank
the coffee has a structure which is hard to describe in prose, but it should be easy to
present as a diagram. To be usable a diagramming system should be visually clear and
reasonably economical of space.
P It must be easy to learn cumulatively, so that each new item can be used as soon as it
is learned.
P The more comprehensive it is, the better. The ideal is a grammar which allows us to
handle every word in every sentence that we try to analyse. No such grammar exists, but
it is reasonable to look for a grammar which handles the vast majority of words - 99%
coverage is a realistic target.
The aim of this paper is to present a theory of sentence-structure which, in my opinion,
satisfies all these criteria as well as any others that are available, and better than some. It is
called Word Grammar and has been described in two books (Hudson 1984, 1990). For several
years I have been teaching it to undergraduates who learn enough in one term to be able to do
an almost complete syntactic analysis of any ordinary text in English. The analysis they learn
is as near to the truth as I have been able to make it (after 30 years of work in this area); it is
as simple or as complicated as the patterns being analysed; there is a simple diagramming
system; and they can apply it from the first week.
2. Word Grammar: an overview
We start with the Word Grammar (henceforth WG) analysis of our sample sentence:
3
Diagram 1
This diagram is just as simple as the structure it represents. The sentence contains four words,
so the diagram shows precisely four units (one per word) and their interrelations. These
relations are shown by the arrows, which point towards the word which bears the grammatical
function shown by the label (where `s' stands for `subject', `o' for `object' and `c' for
`complement'). The vertical arrow points at the one word which has no grammatical function,
drank. (The reason for it will become clear below.) The fact that the coffee is a phrase is
shown by the arrow which links the directly to coffee.
The simplicity of the diagram isn't the result of ignoring a lot of detail which is
uninteresting or unimportant. It represents a scientific claim that there really is nothing else to
say about the syntax of this sentence, a claim which could be defended against counterclaims. It may be wrong, of course, but I don't know at present of any reason for doubting it.
However, it is important to make clear exactly what I am claiming.
I would certainly agree that a complete analysis of every aspect of this sentence would
go far beyond the analysis shown. On the one hand we can say a great deal about its meaning
- who the word we refers to, the time reference of drank, the semantic relations between the
coffee and the drinking, and so on. On the other hand, we can also say something about the
internal structures of the words (described in terms of consonants, vowels and syllables). The
meaning involves a semantic structure, and the sounds are part of a phonological structure;
both these structures would be part of a complete linguistic analysis of the sentence, but not
part of a syntactic analysis (which is concerned, as I said at the outset, with sentence-structure
- how the words are combined with each other).
Nor am I claiming that the diagram shows the whole of the syntactic analysis. The
diagram contains three kinds of labels: labels for word-classes, for word-features, and for
grammatical functions; but some of these labels imply one or two other more general labels as
well. For example, We is labelled `pN', short for `pronoun'; but (according to WG, and most
other theories) pronouns are a kind of noun, and (of course) nouns are a kind of word, so `pN'
implies `N' (for `noun') and `word' as well. Since these extra labels are completely predictable
there is no need to include them in the diagram. I shall say more later about the grammaticalfunction labels.
A single example gives a `taste' of how WG works - what the diagrams look like, and
the kind of information they contain. As we consider further examples below I hope to show
that they are as complex as required, but no more so; and that they show everything that a
syntactic analysis should show. I can report that the system can be learned cumulatively, that
it is usable, and that it is reasonably comprehensive. My evidence for these claims is that my
undergraduate students do learn it cumulatively (adding a new part of the grammar each
week), that they do learn to apply it to ordinary bits of English (or other languages), and that a
4
term's teaching brings them to the stage where they can probably deal confidently with about
97% of the words in an average text which they themselves have chosen.
But what about truth? Nothing in this life is perfect, and the same is certainly true of
every theory of grammar, including WG. In syntax even some of the facts are unclear and in
such cases it is perhaps unwise to talk of truth; but there are many parts of syntax where the
facts are clear, and truth really is relevant. Anyone who builds a theory of syntax must
confront these facts honestly as a test of the theory, and reject (or at least modify) the theory
when it fails. I have spent decades doing precisely this, and the result is a good approximation
to the truth. Unfortunately the world is full of other linguists who can say precisely the same,
but who have come to very different conclusions from each other (and from me). The study of
syntax is a central part of the science of linguistics, and is notoriously frought with
controversy.
Some of the differences between theories are certainly not important, but equally
certainly others are; so any `consumer' of syntactic theories should treat any theory (including
WG) with caution. Behind every theory lies a large body of arguments, decisions and mindchanging which may have been debated in the technical journals and monographs but which
the consumer may not have time to explore. The result is that some parts of any analysis are
to some extent controversial.
Even our simple sentence contains a controversial point: the analysis of the. According
to Diagram 1, the object of drank is the, not coffee; whereas traditional analyses would
reverse this relationship. Linguists are divided on this question. Some would agree with me
(technically, they recognise `determiner phrases'), and others would disagree, believing that
the should be treated as part of a phrase whose main word is coffee, a `noun-phrase'. It isn't
just a matter of taste; there are relevant facts and arguments for and against. For example, in
English the belongs to a group of words called `determiners' which includes a, this, that,
some, any, his and some others. A word such as book needs a determiner (we cannot say **I
bought book, but any determiner before book makes the sentence grammatical); but most
determiners can be used without a following common noun (I bought this/that/some/any/his).
This suggests that the determiner is the obligatory part of a phrase consisting of a determiner
plus a common noun, so it should be the determiner, rather than the common noun, that
carries the external relations (e.g. the object link to drank). (Another controversial question is
how determiners fit into the system of word-classes; according to Diagram 1 they are a type
of pronoun, which makes them also a type of noun, but many linguists treat them as a
completely separate word-class. Once again there are relevant facts, and it is possible to
debate the issue.)
One advantage of taking the rather than coffee as the object of drank is to reduce the
differences between English and other languages. Take the Turkish translation of this
sentence, for example: Kahveyi içtik. The ending -i on kahveyi gives two important pieces of
information about this word: it is definite, and it is the object of the verb içtik. The
definiteness is conveyed in English by the determiner (in this case, the), but our analysis gives
the English determiner the object function as well. It has often been suggested that the wordparts of languages such as Turkish could easily be recognised as separate syntactic words,
which are combined into single words only for phonological purposes; so maybe kahveyi
could also be written kahvey + i (with `+' to show that the two words belong together in
phonology). There may be good arguments against this analysis, but if we accept it the WG
analysis might be as in Diagram 2.
5
Diagram 2
I have labelled -i simply `OD' (for `object definite'), since I have no reason for thinking it is a
pronoun (in contrast with English the). However classification is less important than the
`geometry' of the diagram. If we compare this diagram with the one for We drank the coffee,
the two main differences between English and Turkish stand out clearly: the object follows
the verb in English but precedes it in Turkish, and there has to be a separate subject in
English, but not in Turkish. But more subtly, in both languages the word for `coffee' is only
indirectly related to the verb, by a little word that marks definiteness and which in Turkish,
but not in English, can only carry one grammatical function, `object'.
3. Dependencies
The most important part of a syntactic analysis is what it tells us about the relationships
among the words - how they combine to make a meaningful sentence. (In contrast, the
classification of the words is much less less relevant to the meaning, and correspondingly less
important.) This information is carried in a WG analysis by the labelled arrows, so we need to
consider these a little more carefully. They are also probably the most controversial part of the
theory, which is another reason for explaining them.
What I have said so far about the
arrows is that each one points towards the word that has the grammatical function indicated
by the label; the arrow which carries the label `s' (for `subject') points at we, the one labelled
`o' (for `object') points at the (coffee), and so on. However, we can now take a step further:
each of these arrows shows a dependency between two words. The grammatical function is
always the function of the dependent in relation to the other word, so the arrow always points
towards the dependent word. In our example, both the subject and the object depend on the
verb, and (according to my analysis at any rate), the common noun (coffee, kahvey) depends
on the determiner or object marker (the, -i). The main idea behind this notion of dependency
is that each word depends on the word which links it to the rest of the sentence, and which
explains why it is used. The latter is often called the `head' of the former, so in we drank, we
is a dependent of drank and drank is the head of we. Table 1 lists some of the most common
types of dependency found in English, and Diagram 3 shows how a more complicated English
sentence is analysed.
6
Diagram 3
head word
dependent word
function
order
example
verb
noun
subject
<
we drank
noun
adjective
preadjunct
<
nice coffee
noun
noun
"
<
London hospital
adjective
adverb
"
<
really nice
verb
noun
object
>
drank it
verb
adverb
particle
>
fended off
preposition
noun
complement
>
in London
determiner
common noun
"
>
the coffee
any
preposition
adjunct
>
lived in (London)
Table 1
This kind of dependency-based analysis allows some very simple generalisations about
word order. From my very limited knowledge of Turkish I gather that most dependencies are
head-final - that is, the head word follows the word which depends on it. For example, verbs
follow their objects (as well as their subjects), postpositions are used instead of prepositions,
and so on. English word order tends to be the other way round, with heads preceding their
dependents (e.g. verbs precede their objects and we have prepositions instead of
postpositions); but this is only a tendency, as there are a significant number of dependencies
with head-final order. In Table 1 the first four dependencies are head-final, while the last five
are of the more typical head-initial type.
This kind of analysis is very controversial in modern linguistics because it treats phrases
as secondary by-products of the dependencies. In short, (with one major exception which I
shall discuss below) all the units of syntax are single words. For example, the analysis of
We drank the coffee recognises just four units, one per word, and does not recognise the
coffee as a separate unit which could be classified or bear a grammatical function; and
similarly for our second example in Diagram 3, where its London hospital closure plans is not
treated as a single unit, but as five separate words. Most modern theories of syntax put
`phrase structure' at the centre of syntax, and derive the relations between individual words
from their relationships to the phrases. In contrast, the WG analysis treats the word-word
7
relationships as basic, and leaves the larger units implicit. For example, the fact that London
depends on hospital means that these two words `belong together', in contrast with its and
London which are not related in this way (although they are physically next to each other in
our sentence). The reason for ignoring phrases in this way is quite simple: once the wordword dependencies have been analysed, the phrases are simply redundant. A phrase is just a
group of words which all depend, directly or indirectly, on one single word, so adding phraseunits to the analysis would add no extra information whatever. This is a testable claim on
which WG rests, and may (of course) be wrong; but so far I know of no good evidence to the
contrary. The exclusive focus on word-word relations has a very important practical
consequence: the whole system is much easier to learn and to diagram than one which uses
phrases as well as words.
Having said all this, however, the dependency analysis has to be supplemented by a very
important general principle in order to capture one of the most attractive insights of phrasebased syntax: phrases tend to be continuous. In general, if two words belong to the same
phrase they cannot be separated by another word which does not belong to that phrase. For
example, the off of fended off (in our earlier sentence) may stand either before or after the
object, but it cannot interrupt the object phrase (the phrase rooted in the object word an, i.e.
an onslaught on its London hospital closure plans). In our example off precedes the whole
phrase (fended off an onslaught ..); but the only other possible position for it is right at the
end (which is admittedly a very awkward choice, though permitted by the grammar). What we
cannot say (or write) is something like *fended an off onslaught .. or *fended an onslaught off
on its ... Similarly, the phrase of good students is possible, but *good of students is not
because the phrase good students is interrupted by of, which does not belong to it.
There are a number of different ways to import this insight into dependency analysis but
the way I prefer is to impose a very general principle, called the No-tangling Principle:
The No-tangling Principle
Every sentence has a dependency structure in which:
P no dependency arrows tangle, and
P one dependency arrow points at every word.
Diagram 4 shows how this principle rules out various kinds of dependencies which would
otherwise be possible (with ungrammatical patterns marked with `*').
8
Diagram 4
This principle kills three birds with one stone.
P It explains the ungrammaticality of some sentences which would otherwise be
permitted.
P It helps novices to check their diagrams for errors.
P It explains the vertical arrow which connects the verb in We drank the coffee to
nothing at all. The `main' verb in a sentence is (by definition) the one word in the
sentence that does not depend on any other word; but without an arrow, it causes no
tangling in examples like *Red drink wine (the last one in Diagram 4), which are
presumably bad for the same reason as those which tangle. The vertical arrow shows a
`potential' head, because almost any independent sentence may be turned into a
subordinate one, in which the tangling would involve a real dependency. (For instance,
Drink red wine! can be quoted as He told us to drink red wine, where drink depends on
to; in this larger sentence, positioning red before drink would make the dependency
between red and wine tangle with the one between to and drink.)
When introducing the No-tangling Principle I said `phrases tend to be continuous'. This
is only a tendency, to which there are well-known exceptions. In fact, it is the exceptions that
have kept theoretical linguists busy over the last few decades. This is where we use terms like
`raising', `extraposition' and `extraction'. The general principle still applies to most
constructions, but in these special cases there are rules which override it. For example, take
the apparently straightforward English sentence We were drinking the coffee. There are three
almost indisputable facts:
9
P There are very good reasons for taking we as the subject of were (e.g. it is were that
changes places with we in the question: Were we drinking the coffee?);
P There are almost equally good reasons for taking we as the subject of drinking as well.
P The non-dependent word is were, rather than drinking (were can be used without
drinking, as in the answer We were.; but drinking cannot be used without were, unless
we is omitted as well: *We drinking the coffee.)
When we put these three facts together, it turns out that the dependency from drinking to we
must tangle with the vertical arrow. This can be seen in the first structure of Diagram 5.
Diagram 5
This tangling is not just a consequence of the decision to use vertical arrows; the tangling still
occurs if we turn our sentence into a subordinate clause: I know that we were drinking the
coffee, in which were depends on that (for reasons that I cannot give for lack of space). This
is the second sentence analysed in Diagram 5.
The conclusion, then, is that some cases of tangling are permitted; in other words, some
phrases are discontinuous (e.g. the phrase we ... drinking the coffee). How can we reconcile
this conclusion with the No-tangling Principle? Answer: simply by interpreting the principle
as saying that every sentence's total dependency structure must include a substructure which
is tangle-free and single-arrowed. In other words, there must be a `skeleton' structure which
holds all the words together without tangling, but there may be other dependencies added
onto this structure (such as the dependency from drinking to we). The skeleton provides a
kind of `surface' structure, to which `deeper' relationships may be added. A practical
diagramming convention arises from this claim: we can show the skeleton of surface
relationships above the words (on the surface), while adding the deeper ones below the
words. This is what I have done in the bottom structure in Diagram 5. The skeleton above the
10
words must be tangle-free and contain precisely one arrow per word, but much more
complicated and `messy' patterns are allowed below the line, controlled only by the specifics
of the grammar for the language concerned. Diagram 6 shows how complex the deep patterns
can be for quite an ordinary English sentence.
Diagram 6
This very brief survey of how WG analysis works has shown how the structure of a
whole sentence can be seen as nothing but a collection of relationships between pairs of
single words. However, I mentioned that there was a major exception to this principle, which
we must now consider. The exception is one particular kind of syntactic relationship which is
quite different from the dependencies which we have considered so far: coordination. In a
sentence like John and Mary came, two words (John, Mary) are coordinated, with and
signalling this relationship; but this is an relationship between equals, unlike dependencies - a
completely different kind of relationship. The best way to handle coordination seems to be to
recognise `word-strings' such as John and Mary consisting of smaller word-strings, the
`conjuncts'. In the example just given, each of these smaller word-strings actually consists of
just one word, but of course they could have been much more complicated (e.g. the man with
the beard and the woman with the walking-stick). The diagramming is simple: brackets, with
curly brackets {...} for the whole coordination and square brackets [..] for each conjunct:
{[John] and [Mary]} came, or to take a more complicated example:
(3)
We {[drank the coffee] and [ate the food]} before we left.
The two kinds of structure, based on dependency and coordination, are closely linked, but we
unfortunately do not have time to explore them further. Instead of a proper discussion I shall
just give a diagram for one example.
11
Diagram 7
This completes the survey of WG syntax, and we now start a brief review of various
ways in which the system can be applied to practical problems.
4. Application: L1 study
I start with the study of the mother-tongue, which occupies a major part of the school
curriculum in any country. England and Wales are just emerging from a strange grammar-free
period, which contrasts with the situation found in many European countries (e.g. parts of
Eastern Europe, Germany and France) where children all learn something about the grammar
of their own language. There are several good reasons for including first-language grammar in
the curriculum (which I discussed at length in Hudson 1992).
P Children should be proud of their language, including its grammar, which is hardly
possible without some study of it. Each language has a character of its own, which
makes it interestingly different from every other language, and linguists can offer a
collection of `parameters' on which languages are known to vary. These parameters
involve:
P several general questions about word order,
P the amount of inflectional variation in words,
P the extent to which syntactic words fuse into longer phonological words (as in
Turkish),
P how subjects and objects are distinguished,
P how relative clauses are formed,
and much, much else. It is hard to see why any child should leave school without some
idea of their language's general `character', and perhaps also something of its history. I
have the impression that most people, and especially children, find it much easier to
relate to texts than to the grammatical system itself, so what is needed is a system such
as WG which is easy to apply to texts and allows them to develop their own
generalisations out of texts.
P The need for linguistic pride is especially true of children whose mother-tongue has
no official status as a standard language. In the UK, about 80-90% of children are in this
situation as speakers of non-standard dialects of English, and because of the low social
status of such dialects they are convinced that what they speak is simply bad English.
This has a number of undesirable social and educational consequences, and is quite
unnecessary. Speakers of non-standard dialects and languages should take just as much
pride in their language as those of standard languages, and once again a good way to
12
achieve this is to teach them something about their language's structure by the analysis
of ordinary texts (preferably the children's own ordinary speech).
P A related reason for teaching about the first language is that most school systems have
the task of teaching the national standard language to children who do not speak it
natively. The better the children understand their own language, the more easily they
will understand the differences between it and the standard - and especially if they can
understand diagrams (such as WG diagrams) illustrating both. For example, a very
widespread non-standard feature of English is the use of non-subject pronoun forms
(e.g. me, him) when the subject is coordinated:
(4)
a
b
(Standard)
(Non-standard)
John and I went.
John and me went.
This is a very interesting innovation in English, which may eventually be accepted into
standard English, but at present children (who almost all use the non-standard form in
speaking) have to learn the standard rules. School-teachers tend not to teach the rules
directly (perhaps because they themselves are unclear about them), so children at
present are left to work out the rules for themselves (with the result that some speakers
regularly use subject pronoun forms such as I and he in all coordinated structures,
regardless of their grammatical function: I saw John and he). How much better to draw
syntactic diagrams for the sentences I went and John and I went, as in Diagram 8, and
then to compare the words that standard and non-standard dialects put in the slot
indicated - I for standard, me for non-standard.
Diagram 8
There are a number of other good reasons for teaching syntactic analysis as part of
mother-tongue teaching, but we shall consider them below under other headings.
5. Application: L2 study
For a decade or so grammar has been unfashionable in the teaching of second languages, but
at present it seems to be making a come-back. This strikes me as a welcome development,
provided that it avoids the mindless rote-learning that used to characterise some grammarlearning in the Good Old Days. Here are my reasons for favouring the use of syntactic
analysis in L2 teaching:
P If the students can already analyse their mother-tongue (e.g. by drawing simple
13
structure diagrams), it must be helpful for them to apply the same system to the L2.
Where the two grammars are similar, this will appear visually; and where they are
different, it will be easy to localise the difference. For instance, if functions like `subject'
and `object' can be taken for granted, the elementary differences between subjects or
objects in Turkish and English can be explained very easily. Without these linking
concepts the whole of the L2 is equally mysterious.
P In the discussion of L1 study I said that languages have very general features which
allow them to be compared. Every such feature is relevant to L2 study, as it offers the
possibility of a broad generalisation which may help the learner. For example, a Turkish
learner of English ought presumably to understand the general difference between the
inflectional and agglutinative systems found in these two languages; but generalisations
about word order can help too. Diagrams of sentence-structure make any of these
differences much more concrete and easier to explain.
P Students presumably learn an L2 at least in part by experiencing it - by reading it and
listening to it - but they will only benefit from this experience to the extent that they
understand what they are reading. This means not only understanding the meaning, but
also understanding the syntactic patterns through which this meaning is expressed. It is
often possible to work out the meaning (by intelligent reading between the lines) just by
taking the meanings of the individual words, without having much idea of how the
sentence's syntax works; but this cleverness leaves the learner in the same state of
syntactic ignorance as before. For example, suppose a foreign student met the following
sentences which appeared almost next to each other in an English newspaper (The
Guardian, 11th May 1995):
(5)
a
b
Lord McAlpine used the European newspaper to attack his party again.
... the politician who knows his country to be on the wrong track.
Superficially their structures are the same; in both cases the learner could identify a verb
(used, knows), followed by a noun-phrase (the European newspaper, his country), followed
by to plus an infinitive phrase (to attack his party again, to be on the wrong track). And yet
their meaning structures are quite different. The learner needs to be aware that their syntactic
structures are also different, as shown in Diagram 9.
14
Diagram 9
The first sentence is easy - the to-phrase defines the purpose for which Lord McAlpine used
the European newspaper, and is completely separate from the other phrases which depend on
the verb used. The problem arises for our foreign learner if she thinks the second sentence's
syntactic structure is the same; in this case there is an extra arrow (below the line of words)
which shows that his country is the subject of to be, just as it would have been in who knows
that his country is on the wrong track (which has the same meaning).
The general point which this example illustrates is that L2-teaching would be much
easier if diagrams could be used. They could be used in various ways - e.g. by the teacher
when explaining a construction or by the student when puzzling through a difficult text, or
when being tested. A student who can draw the correct diagram for a sentence can certainly
understand its structure, but one who understands diagramming in general but can't draw a
diagram for that sentence probably doesn't understand its structure.
6. Application: Complex communication
Some sentences are much harder to understand than others. In some cases the difficulty is due
to the complexity and density of the ideas, but in others it is simply a matter of syntax. All the
following sentences express the same meaning, but they are not equally easy to understand:
(6)
a
b
c
That she likes him enough to marry him surprises me.
It surprises me that she likes him enough to marry him.
I am surprised that she likes him enough to marry him.
The sentence whose structure is the most straightforward is (a), where the subject of surprises
is the subordinate clause itself (that she likes him enough to marry him); but this is the
hardest of the three to read. In contrast, the other two sentences both contain one more word
(it or am) and have parts which are much less directly related to the meaning. Why should
they nevertheless be easier to read? The answer will suggest advice that we can give to our
students to help them to present complex information in as simple a way as possible.
The structures for the three sentences are shown in Diagram 10.
15
Diagram 10
These dependency diagrams carry an extra piece of information which takes the form of a
number under each word-space; for example, in the first sentence `2' stands after that, `3' after
she, and so on. This number is a measure of the work that the reader has to do while reading
that part of the sentence; more specifically, it shows how many incomplete dependencies the
reader has to hold in her mind at that point.
Imagine the words of each sentence being presented to you one at a time on a screen.
After reading the word that, and no other word, you have two incomplete dependencies to
hold in your mind until you can complete them. One is the dependency between that and its
head, the main verb (which eventually, much later, will turn out to be surprises); the other is
the dependency between it and the verb inside the subordinate clause (which will be likes).
After reading the next word, she, both these dependencies are still incomplete, but there is a
third: the dependency between she and its head. After likes the number falls to 2 because two
of these dependencies have been completed, but a new one has opened up: the one between
likes and its object. Of course, if you look at the diagram there is a third dependency after
likes, between it and enough; but the dependency between them is labelled `a', for `adjunct',
which means that it is not `expected' by the head, likes. Any dictionary ought to say that like
takes an object, but not that it takes a `quantity phrase' like enough to .... Therefore this
16
dependency does not count, for the simple reason that you, as reader, don't know anything
about it until you read the word enough. And so on through all the sentences.
Simply
glancing at the figures under the first sentence shows that they are higher than those under the
other two sentences, but we can make the comparison more precise by calculating the average
load per word: 1.6 for the first sentence, 1.0 for the second, and 0.9 for the third. I call this
measure the sentence's `dependency density'; and this, I suggest, is why the first is so much
harder to read than the other two: because it is constructed in such a way that a lot of
dependencies are incomplete at the same time. If writers could be taught to recognise this
kind of structure, and to know how to replace it by a more readable alternative, then every
reader would benefit.
This is not the only way to measure syntactic complexity, of course; but it gives
objectively calculated figures which correspond to more intuitive notions of complexity, and
it only works with theories like WG which use dependency diagrams.
7. Conclusions
No doubt the list of applications could be extended, but my main objective was to introduce
the Word Grammar approach to syntactic analysis and to show that it has some practical uses.
Some of these practical targets could perhaps be achieved by almost any approach to syntax,
but my claim has been that WG offers a particularly good combination of theoretical
sophistication (`truth'), teachability, usability and comprehensiveness.
References
Blake, B. 1990. Relational Grammar. Routledge.
Bresnan, J. 1982. The Mental Representation of Grammatical Relations. MIT Press.
Chomsky, N. 1993. A Minimalist Program for Linguistic Theory. In K. Hale and S. J. Keyser
(eds.) The View from Building 20: Essays in Linguistics in Honor of Sylvain Bromburger.
MIT Press, 1-52.
Halliday, M. 1985. An Introduction to Functional Grammar. Arnold.
Hudson, R. A. 1984. Word Grammar. Blackwell.
Hudson, R. A. 1990. English Word Grammar. Blackwell.
Hudson, R. A. 1992. Teaching Grammar. A guide to the National Curriculum. Blackwell.
Langacker, R. W. 1990. Concept, Image and Symbol. The cognitive basis of grammar.
Mouton de Gruyter.
Pollard, C. and Sag, I. 1994. Head-driven Phrase Structure Grammar. University of Chicago
Press.
Siewierkska, A. 1991. Functional Grammar. Routledge.
17
Practical importance of syntax
P L2-learning
P L2-teaching
P L1-teaching
P Language pathology
P Complex communication
P Computer-systems for understanding language
18
Agreement on sentence-structure
P Closely linked to meaning
P but different structures can express the same meaning
P Based on two kinds of relations among words:
P grammatical functions (e.g. subject, object),
P phrase-forming links
P Controlled by rules which refer to:
P meanings
P grammatical functions
P word order
P word-classes
19
Theoretical disagreements about sentence-structure
Some alternative theories:
P Minimalism
P Government-Binding Theory
P Head-driven Phrase Structure Grammar
P Lexical Functional Grammar
P Categorial Grammar
P Cognitive Grammar
P Systemic Functional Grammar
P Relational Grammar
P Functional Grammar
20
Requirements of a `practical' approach to syntax:
P As true as possible
P As complicated as the facts - no more, no less
P Diagramm-able
P Usable as learned
P As comprehensive as possible
e.g. `WORD GRAMMAR'
21
Some Principles of Word Grammar
P All units of syntax are single words (NOT phrases).
P Phrases are implicit in the dependencies.
P Most phrases are CONTINUOUS (not interrupted).
The No-tangling Principle:
Every sentence has a dependency structure in which:
P no dependency arrows tangle, and
P one dependency arrow points at every word.
P Excludes ungrammatical sequences.
P Helps novices to check their diagrams.
P Explains the vertical `root' arrow.
P These phrases are shown by `surface' dependencies above the words.
P Some phrases are interrupted.
e.g. We were drinking the coffee.
P Discontinuous phrases are shown by `deep' dependencies below the words.
P Coordination is exceptional, and cannot be shown by dependencies:
P Shown by brackets:
{[John] and [Mary]} left.
We {[drank the coffee] and [ate the food]} before we left.
22
Some applications of practical syntax
P L1 study
P L2 study
P measuring syntactic complexity
P L1 study
P to make children proud of L1
e.g. they could know something about:
P word order
P balance between inflection, agglutination, isolation
P how grammatical functions are distinguished.
P how relative clauses are formed.
P etc
P especially important if it is a non-standard language
P to explain how standard language differs from children's L1
e.g. Non-standard I went out.
Standard
John and me went out.
I went out.
John and I went out.
23
Some applications of practical syntax
P L1 study
P L2 study
P measuring syntactic complexity
P L2 study
P to build on existing knowledge of L1 and allow direct comparison of L2 and L1.
P to make comparisons visible and concrete
P to force the learner to pay attention to the syntactic structures behind the
meanings which can be guessed.
VERB NOUN-PHRASE
Lord McAlpine
used
the European newspaper
the politician who knows his country
INFINITIVEPHRASE
to attack his party again.
to be on the wrong track.
24
Some applications of practical syntax
P L1 study
P L2 study
P measuring syntactic complexity
P measuring syntactic complexity
a That she likes him enough to marry him surprises me.
b It surprises me that she likes him enough to marry him.
c I am surprised that she likes him enough to marry him.
25