Download The Power of Tree Series Transducers

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
The Power of Tree Series Transducers
Andreas Maletti1
Technische Universität Dresden
Fakultät Informatik
June 15, 2006
1
Research funded by German Research Foundation (DFG GK 334)
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
1 / 26
1
Motivation
2
Definition of Tree Series Transducers
3
Results
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
2 / 26
Machine Translation
Overview
Subfield of computational linguistics
Automatic translation shall assist (human) translator
Offer several (likely) alternatives
History
1954: High prospects and expectations after Georgetown Experiment
1966: “Perfect translation” failed (ALPAC report)
1993: Statistical machine translation system [Brown et al 93]
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
3 / 26
Machine Translation
Problem
Translate text of language X into grammatical text of language Y .
1
Preserve meaning
2
Preserve connotation
3
Preserve style
Relaxed Problem
Transform text of language X into text of language Y such that
1
the result is grammatical
2
expert for X and Y can discern original sentence
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
4 / 26
Tree-based Model [Yamada, Knight 01]
VB
PRP VB1
VB2
He adores VB
TO
listening TO NN
to music
⇒
VB
PRP
VB2
VB1
He TO
VB adores
NN TO listening
music to
⇓
VB
PRP
VB2
VB1
VB ga daisuki desu
kare ha TO
NN TO kiku no
ongaku wo
VB
PRP
VB2
VB1
⇐ He ha TO
VB ga adores desu
NN TO listening no
music to
kare ha ongaku wo kiku no ga daisuki desu
3 phases: (i) Reorder, (ii) Insert, (iii) Translate
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
5 / 26
Implementation of Phase (i): Reorder
Implementation by top-down tree series transducer
VB (0.723)
TT (vb)
VB
PRP VB1
VB2
He adores VB
TO
listening TO NN
TT (prp) TT (vb2) TT (vb1)
⇒
PRP
VB2
VB1
He VB
TO adores
listening TO NN
to music
to music
0.723
vb(VB(x1 , x2 , x3 )) → VB(prp(x1 ), vb2(x3 ), vb1(x2 ))
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
6 / 26
Implementation of Phase (i): Reorder
VB (0.723)
VB (0.723)
TT (prp) TT (vb2) TT (vb1)
PRP (1) TT (vb2) TT (vb1)
PRP
VB2
VB1
He VB
TO adores
listening TO NN
⇒
He
VB2
VB1
VB
TO adores
listening TO NN
to music
to music
1
prp(PRP(x1 )) → PRP(x1 )
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
7 / 26
Implementation of Phase (i): Reorder
VB (0.723)
VB (0.723)
PRP (1) TT (vb2) TT (vb1)
PRP (1) VB2 (0.749) TT (vb1)
He
VB2
VB1
VB
TO adores
listening TO NN
to music
⇒
He TT (to) TT (vb) VB1
TO
VB adores
TO NN listening
to music
0.749
vb2(VB2(x1 , x2 )) → VB2(to(x2 ), vb(x1 ))
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
8 / 26
Implementation of Phase (i): Reorder
VB (0.723)
VB (0.723)
PRP (1) VB2 (0.749) TT (vb1)
He TT (to) TT (vb) VB1
TO
VB adores
TO NN listening
to music
Andreas Maletti (TU Dresden)
PRP (1)
⇒
∗
VB2 (0.749)
He TO (0.893)
VB1 (1)
VB (1) adores
NN (1) TO (1) listening
music
The Power of Tree Series Transducers
to
June 15, 2006
9 / 26
Implementation of Phase (i): Reorder
VB
PRP VB1
VB2
He adores VB
TO
listening TO NN
VB (0.723)
PRP (1)
⇒∗
to music
VB2 (0.749)
He TO (0.893)
VB1 (1)
VB (1) adores
NN (1) TO (1) listening
music
to
The above reordering has probability:
0.723 · 0.749 · 0.893 = 0.484
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
10 / 26
Implementation Details
Rules
Original
PRP VB1 VB2
VB TO
TO NN
Andreas Maletti (TU Dresden)
Reordered
PRP VB1 VB2
PRP VB2 VB1
VB1 PRP VB2
VB1 VB2 PRP
VB2 PRP VB1
VB2 VB1 PRP
VB TO
TO VB
TO NN
NN TO
Probability
0.074
0.723
0.061
0.037
0.083
0.021
0.251
0.749
0.107
0.893
The Power of Tree Series Transducers
June 15, 2006
11 / 26
Tiburon [May, Knight 06]
Overview
Implements top-down weighted tree automata and top-down tree
series transducers over the probability semiring
Operations WTA: intersection, weighted determinization, pruning
Operations TST: application, composition, training
Applications
Used to implement Yamada-Knight model (custom implementation
took > 1 year, implementation in Tiburon 2 days)
Used to implement Japanese transliteration [Knight, Graehl 98]
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
12 / 26
1
Motivation
2
Definition of Tree Series Transducers
3
Results
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
13 / 26
Tree Series Transducers
Overview
tree series
transducer
weighted tree
automaton
weighted transducer
tree transducer
weighted automaton
tree automaton
generalized
sequential machine
string automaton
History
Introduced in [Kuich 99]
Extended to full generality in [Engelfriet, Fülöp, Vogler 02]
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
14 / 26
Semiring
Definition
(A, +, ·, 0, 1) semiring, if
(A, +, 0) commutative monoid
(A, ·, 1) monoid
· distributes (both sided) over +
0 is absorbing for · (a · 0 = 0 = 0 · a)
Example
Natural numbers (N, +, ·, 0, 1)
Probabilities ([0, 1], max, ·, 0, 1)
Subsets (P(A), ∪, ∩, ∅, A)
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
15 / 26
Top-down Tree Series Transducer [Engelfriet et al 02]
Definition
Polynomial top-down tree series transducer (Q, Σ, ∆, A, I , R) where
Q finite set of states
Σ and ∆ input and output ranked alphabet
A = (A, +, ·, 0, 1) semiring
I ⊆ Q set of initial states
R finite set of rules of the form
a
q(σ(x1 , . . . , xk )) → t
where t ∈ T∆ (Q(Xk ))
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
16 / 26
Properties of TST
Definition
(Q, Σ, ∆, A, I , R) top-down TST
deterministic, if there is at most one rule with a given left hand side
and at most one initial state
linear, if (for every rule) every variable appears at most once in the
right hand side
nondeleting, if (for every rule) variables that occur in the left hand
side also occur in the right hand side
Note
Bottom-up TST process input tree from leaves toward root.
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
17 / 26
Classes of Transformations
Definition
denotation
class of transformations computed by
substitution
x-TOPε (A)
top-down TST with properties x
ε-subst.
x-TOPo (A)
top-down TST with properties x
o-subst.
x-BOTε (A)
bottom-up TST with properties x
ε-subst.
x-BOTo (A)
bottom-up TST with properties x
o-subst.
In diagram:
x-TOPω (A) abbreviated to xω>
x-BOTω (A) abbreviated to xω⊥
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
18 / 26
1
Motivation
2
Definition of Tree Series Transducers
3
Results
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
19 / 26
Hasse Diagram for Deterministic TST
Probability Semiring and Semiring of Natural Numbers
d⊥
ε
d⊥
o
dt⊥
ε
dl⊥
ε
dn⊥
ε
h⊥
ε
dlt⊥
ε
dnt⊥
ε
hl⊥
ε
hn⊥
ε
dn⊥
o
dl⊥
o
dt⊥
o
dnl⊥
=
dnt⊥
o
dlt⊥
o
h=
o
dnlt⊥
=
hn=
o
hl=
o
dnt>
=
dlt>
=
dn>
=
dl>
=
hnl=
=
dnlt>
=
dnl>
=
dt>
=
d>
=
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
20 / 26
Hasse Diagram for Deterministic TST
Semiring of Subsets
d⊥
=
d>
=
dn⊥
=
dl⊥
=
dt⊥
=
dnl⊥
=
dnt⊥
=
dlt⊥
=
dt>
=
dn>
=
dl>
=
dnt>
=
dlt>
=
dnl>
=
h>
=
h⊥
o
h⊥
ε
hl>
=
dnlt⊥
=
hn=
=
hl⊥
o
dnlt>
=
hl⊥
ε
hnl=
=
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
21 / 26
Composition of Transformations
Definition
Let
ϕ : TΣ × T∆ → A
ψ : T∆ × TΓ → A
Composition of ϕ and ψ
(ϕ ; ψ) : TΣ × TΓ → A
X
(t, v ) 7→
ϕ(t, u) · ψ(u, v )
u∈T∆
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
22 / 26
Composition Results
Theorem (see [Kuich 99] and [Engelfriet et al 02])
A commutative semiring
nlp-BOT(A) ; p-BOT(A) = p-BOT(A)
p-BOT(A) ; bdth-BOT(A) = p-BOT(A)
Theorem
A commutative semiring
lp-BOT(A) ; p-BOT(A) = p-BOT(A)
p-BOT(A) ; bd-BOT(A) = p-BOT(A)
bdt-TOP(A) ; lp-TOP(A) ⊆ p-TOP(A)
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
23 / 26
References
Joost Engelfriet, Zoltán Fülöp, and Heiko Vogler.
Bottom-up and top-down tree series transformations.
J. Autom. Lang. Combin., 7(1):11–70, 2002.
Werner Kuich.
Full abstract families of tree series I.
In Jewels Are Forever, pages 145–156. Springer, 1999.
Werner Kuich.
Tree transducers and formal tree series.
Acta Cybernet., 14(1):135–149, 1999.
Jonathan May and Kevin Knight.
Tiburon: A weighted tree automata toolkit.
In Proc. 11th Int. Conf. Implementation and Application of Automata,
LNCS. Springer, 2006.
to appear.
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
24 / 26
Publications
Conferences
Myhill-Nerode theorem for sequential transducers over unique GCD-monoids.
Proc. 9th Int. Conf. Implementation and Application of Automata
Relating Tree Series Transducers and Weighted Tree Automata.
Proc. 8th Int. Conf. Developments in Language Theory
Compositions of bottom-up tree series transformations.
Proc. 11th Int. Conf. Automata and Formal Languages
The power of tree series transducers of type I and II.
Proc. 9th Int. Conf. Developments in Language Theory
Hierarchies of tree series transformations—Revisited.
Proc. 10th Int. Conf. Developments in Language Theory
The substitution vanishes.
(with A. Kühnemann)
Proc. 11th Int. Conf. Algebraic Methodology and Software Technology
Does o-substitution preserve recognizability?
Proc. 11th Int. Conf. Implementation and Application of Automata
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
25 / 26
Publications
Journals
Relating tree series transducers and weighted tree automata.
Int. J. of Foundations of Computer Science 16, 2005
Hasse diagrams for classes of deterministic bottom-up tree-to-tree-series
transformations.
Theoretical Computer Science 339, 2005
Cut sets as recognizable tree languages.
(with B. Borchardt, B. Šešelja, A. Tepavčević, H. Vogler)
Fuzzy Sets and Systems 157, 2006
Incomparability results for classes of polynomial tree series transformations.
(with H. Vogler)
J. Automata, Languages and Combinatorics, to appear, 2006
Bounds for tree automata with polynomial costs.
(with B. Borchardt, Zs. Gazdag, Z. Fülöp)
J. Automata, Languages and Combinatorics, to appear, 2006
Compositions of tree series transformations.
Theoretical Computer Science, to appear, 2006
Andreas Maletti (TU Dresden)
The Power of Tree Series Transducers
June 15, 2006
26 / 26