Binarizing Syntax Trees to Improve Syntax-Based Machine Translation Accuracy
|
|
|
- Lynne Osborne
- 9 years ago
- Views:
Transcription
1 Binarizing Syntax Trees to Improve Syntax-Based Machine Translation Accuracy Wei Wang and Kevin Knight and Daniel Marcu Language Weaver, Inc Admiralty Way, Suite 1210 Marina del Rey, CA, Abstract We show that phrase structures in Penn Treebank style parses are not optimal for syntaxbased machine translation. We exploit a series of binarization methods to restructure the Penn Treebank style trees such that syntactified phrases smaller than Penn Treebank constituents can be acquired and exploited in translation. We find that by employing the EM algorithm for determining the binarization of a parse tree among a set of alternative binarizations gives us the best translation result. 1 Introduction Syntax-based translation models (Eisner, 2003; Galley et al., 2006; Marcu et al., 2006) are usually built directly from Penn Treebank (PTB) (Marcus et al., 1993) style parse trees by composing treebank grammar rules. As a result, often no substructures corresponding to partial PTB constituents are extracted to form translation rules. Syntax translation models acquired by composing treebank grammar rules assume that long rewrites are not decomposable into smaller steps. This effectively restricts the generalization power of the induced model. For example, suppose we have an xrs (Knight and Graehl, 2004) rule R 1 in Figure 1 that translates the Chinese phrase RUSSIA MINISTER VIKTOR-CHERNOMYRDIN into an English tree fragment yielding an English phrase. Also suppose that we want to translate a Chinese phrase VIKTOR-CHERNOMYRDIN AND HIS COLLEAGUE into English. What we desire is that if we have another rule R 2 as shown in Figure 1, we could somehow compose it with R 1 to obtain the desirable translation. We unfortunately cannot do this because R 1 and R 2 are not further decomposable and their substructures cannot be re-used. The requirement that all translation rules have exactly one root node does not enable us to use the translation of VIKTOR-CHERNOMYRDIN in any other contexts than those seen in the training corpus. A solution to overcome this problem is to rightbinarize the left-hand side (LHS) (or the Englishside) tree of R 1 such that we can decompose R 1 into R 3 and R 4 by factoring NNP(viktor) NNP(chernomyrdin) out as R 4 according to the word alignments; and left-binarize the LHS of R 2 by introducing a new tree node that collapses the two NNP s, so as to generalize this rule, getting rule R 5 and rule R 6. We also need to consistently syntactify the root labels of R 4 and the new frontier label of R 6 such that these two rules can be composed. Since labeling is not a concern of this paper, we simply label new nodes with X-bar where X here is the parent label. With all these in place, we now can translate the foreign sentence by composing R 6 and R 4 in Figure 1. Binarizing the syntax trees for syntax-based machine translation is similar in spirit to generalizing parsing models via markovization (Collins, 1997; Charniak, 2000). But in translation modeling, it is unclear how to effectively markovize the translation rules, especially when the rules are complex like those proposed by Galley et al. (2006). In this paper, we explore the generalization ability of simple binarization methods like left-, right-, and head-binarization, and also their combinations. Simple binarization methods binarize syntax trees in a consistent fashion (left-, right-, or head-) and 746 Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp , Prague, June c 2007 Association for Computational Linguistics
2 R 1 R 2 JJ NNP NNP NNP russia minister viktor chernomyrdin RUSSIA MINISTER V C x0:nnp x1:nnp CC PRB$ NNS and his colleague NNP NNP AND HIS COLLEAGUE? V C AND HIS COLLEAGUE R 3 JJ russia minister RUSSIA MINISTER NNP NNP R 4 viktor chernomyrdin R 6 R 6 R5 CC PRB$ his x0:nnp x1:nnp and HIS NNS colleague COLLEAGUE R 4 V C AND HIS COLLEAGUE V C NNP NNP AND Figure 1: Generalizing translation rules by binarizing trees. thus cannot guarantee that all the substructures can be factored out. For example, right binarization on the LHS of R 1 makes available R 4, but misses R 6 on R 2. We then introduce a parallel restructuring method, that is, one can binarize both to the left and right at the same time, resulting in a binarization forest. We employ the EM (Dempster et al., 1977) algorithm to learn the binarization bias for each tree node from the parallel alternatives. The EM-binarization yields best translation performance. The rest of the paper is organized as follows. Section 2 describes related research. Section 3 defines the concepts necessary for describing the binarizations methods. Section 4 describes the tree binarization methods in details. Section 5 describes the forest-based rule extraction algorithm, and section 6 explains how we restructure the trees using the EM algorithm. The last two sections are for experiments and conclusions. 2 Related Research Several researchers (Melamed et al., 2004; Zhang et al., 2006) have already proposed methods for binarizing synchronous grammars in the context of machine translation. Grammar binarization usually maintains an equivalence to the original grammar such that binarized grammars generate the same language and assign the same probability to each string as the original grammar does. Grammar binarization is often employed to make the grammar fit in a CKY parser. In our work, we are focused on binarization of parse trees. Tree binarization generalizes the resulting grammar and changes its probability distribution. In tree binarization, synchronous grammars built from restructured (binarized) training trees still contain non-binary, multi-level rules and thus still require the binarization transformation so as to be employed by a CKY parser. The translation model we are using in this paper belongs to the xrs formalism (Knight and Graehl, 2004), which has been proved successful for machine translation in (Galley et al., 2004; Galley et al., 2006; Marcu et al., 2006). 3 Concepts We focus on tree-to-string (in noisy-channel model sense) translation models. Translation models of this type are typically trained on tuples of a sourcelanguage sentence f, a target language (e.g., English) parse tree π that yields e and translates from f, and the word alignments a between e and f. Such a tuple is called an alignment graph in (Galley et al., 2004). The graph (1) in Figure 2 is such an alignment graph. 747
3 (1) unbinarized tree NNP1 NNP2 NNP3 NNP4* viktor chernomyrdin VIKTOR CHERNOMYRDIN (2) left-binarization (3) right-/head-binarization NNP 4 NNP1 NNP1 NNP2 NNP3 chernomyrdin NNP2 NNP3 NNP4 viktor viktor chernomyrdin (4) left-binarization (5) right-binarization (6) left-binarization (7) right-/head-binarization - - NNP4 NNP1 NNP2 NNP3 chernomyrdin NNP3 NNP4 NNP1 NNP2 viktor viktor chernomyrdin Figure 2: Left, right, and head binarizations. Heads are marked with s. New nonterminals introduced by binarization are denoted by X-bars. A tree node in π is admissible if the f string covered by the node is contiguous but not empty, and if the f string does not align to any e string that is not covered by π. An xrs rule can be extracted only from an admissible tree node, so that we do not have to deal with dis-contiguous f spans in decoding (or synchronous parsing). For example, in tree (2) in Figure 2, node is not admissible because the f string that the node covers also aligns to NNP4, which is not covered by the. Node in tree (3), on the other hand, is admissible. A set of sibling tree nodes is called factorizable if we can form an admissible new node dominating them. For example, in tree (1) of Figure 2, sibling nodes NNP 2 NNP 3 and NNP 4 are factorizable because we can factorize them out and form a new node, resulting in tree (3). Sibling tree nodes NNP 1 NNP 2 and NNP 3 are not factorizable. In synchronous parse trees, not all sibling nodes are factorizable, thus not all sub-phrases can be acquired and syntactified. The main purpose of our paper is to restructure parse trees by factorization such that syntactified sub-phrases can be employed in translation. 4 Binarizing Syntax Trees We are going to binarize a tree node n that dominates r children n 1,..., n r. Restructuring will be performed by introducing new tree nodes to dominate a subset of the children nodes. To avoid overgeneralization, we allow ourselves to form only one new node at a time. For example, in Figure 2, we can binarize tree (1) into tree (2), but we are not allowed to form two new nodes, one dominating NNP 1 NNP 2 and the other dominating NNP 3 NNP 4. Since labeling is not the concern of this paper, we relabel the newly formed nodes as n. 4.1 Simple binarization methods The left binarization of node n (i.e., the in tree (1) of Figure 2) factorizes the leftmost r 1 children by forming a new node n (i.e., in tree (2)) to dominate them, leaving the last child n r untouched; and then makes the new node n the left child of n. The method then recursively leftbinarizes the newly formed node n until two leaves are reached. In Figure 2, we left-binarize tree (1) into (2) and then into (4). The right binarization of node n factorizes the rightmost r 1 children by forming a new node n (i.e., in tree (3)) to dominate them, leaving the 748
4 first child n 1 untouched; and then makes the new node n the right child of n. The method then recursively right-binarizes the newly formed node n. In Figure 2, we right-binarize tree (1) into (3) and then into (7). The head binarization of node n left-binarizes n if the head is the first child; otherwise, rightbinarizes n. We prefer right-binarization to leftbinarization when both are applicable under the head restriction because our initial motivation was to generalize the -rooted translation rules. As we will show in the experiments, binarization of other types of phrases contribute to the translation accuracy improvement as well. Any of these simple binarization methods is easy to implement, but is incapable of giving us all the factorizable sub-phrases. Binarizing all the way to the left, for example, from tree (1) to tree (2) and to tree (4) in Figure 2, does not enable us to acquire a substructure that yields NNP 3 NNP 4 and their translational equivalences. To obtain more factorizable sub-phrases, we need to parallel-binarize in both directions. 4.2 Parallel binarization Simple binarizations transform a parse tree into another single parse tree. Parallel binarization will transform a parse tree into a binarization forest, desirably packed to enable dynamic programming when extracting translation rules from it. Borrowing terms from parsing semirings (Goodman, 1999), a packed forest is composed of additive forest nodes ( -nodes) and multiplicative forest nodes ( -nodes). In the binarization forest, a - node corresponds to a tree node in the unbinarized tree; and this -node composes several -nodes, forming a one-level substructure that is observed in the unbinarized tree. A -node corresponds to alternative ways of binarizing the same tree node in the unbinarized tree and it contains one or more - nodes. The same -node can appear in more than one place in the packed forest, enabling sharing. Figure 3 shows a packed forest obtained by packing trees (4) and (7) in Figure 2 via the following parallel binarization algorithm. To parallel-binarize a tree node n that has children n 1,...,n r, we employ the following steps: 7(NNP1) 5() 6() 3() 4() 8(NNP2) 2() 9(NNP3) 10(NNP4) 1() 7(NNP1) 11() 8(NNP2) 12() 13() 9(NNP3) 14() 15() 10(NNP4) Figure 3: Packed forest obtained by packing trees (4) and (7) in Figure 2 We recursively parallel-binarize children nodes n 1,..., n r, producing binarization -nodes (n 1 ),..., (n r ), respectively. We right-binarize n, if any contiguous 1 subset of children n 2,...,n r is factorizable, by introducing an intermediate tree node labeled as n. We recursively parallel-binarize n to generate a binarization forest node (n). We form a multiplicative forest node R as the parent of (n 1 ) and (n). We left-binarize n if any contiguous subset of n 1,...,n r 1 is factorizable and if this subset contains n 1. Similar to the above rightbinarization, we introduce an intermediate tree node labeled as n, recursively parallel-binarize n to generate a binarization forest node (n), form a multiplicative forest node L as the parent of (n) and (n 1 ). We form an additive node (n) as the parent of the two already formed multiplicative nodes L and R. The (left and right) binarization conditions consider any subset to enable the factorization of small constituents. For example, in tree (1) of Figure 2, although NNP 1 NNP 2 NNP 3 of are not factorizable, the subset NNP 1 NNP 2 is factorizable. The binarization from tree (1) to tree (2) serves as a relaying step for us to factorize NNP 1 NNP 2 in tree (4). The left-binarization condition is stricter than 1 We factorize only subsets that cover contiguous spans to avoid introducing dis-contiguous constituents for practical purpose. In principle, the algorithm works fine without this binarization condition. 749
5 the right-binarization condition to avoid spurious binarization; i.e., to avoid the same subconstituent being reached via both binarizations. We could transform tree (1) directly into tree (4) without bothering to generate tree (3). However, skipping tree (3) will create us difficulty in applying the EM algorithm to choose a better binarization for each tree node, since tree (4) can neither be classified as left binarization nor as right binarization of the original tree (1) it is the result of the composition of two left-binarizations. In parallel binarization, nodes are not always binarizable in both directions. For example, we do not need to right-binarize tree (2) because NNP 2 NNP 3 are not factorizable, and thus cannot be used to form sub-phrases. It is still possible to right-binarize tree (2) without affecting the correctness of the parallel binarization algorithm, but that will spuriously increase the branching factor of the search for the rule extraction, because we will have to expand more tree nodes. A restricted version of parallel binarization is the headed parallel binarization, where both the left and the right binarization must respect the head propagation property at the same time. A nice property of parallel binarization is that for any factorizable substructure in the unbinarized tree, we can always find a corresponding admissible -node in the parallel-binarized packed forest. A leftmost substructure like the lowest -subtree in tree (4) of Figure 2 can be made factorizable by several successive left binarizations, resulting in 5 ()-node in the packed forest in Figure 3. A substructure in the middle can be factorized by the composition of several left- and right-binarizations. Therefore, after a tree is parallel-binarized, to make the sub-phrases available to the MT system, all we need to do is to extract rules from the admissible nodes in the packed forest. Rules that can be extracted from the original unrestructured tree can be extracted from the packed forest as well. Parallel binarization results in parse forests. Thus translation rules need to be extracted from training data consisting of (e-forest, f, a)-tuples. 5 Extracting translation rules from (e-forest, f, a)-tuples The algorithm to extract rules from (e-forest, f, a)- tuples is a natural generalization of the (e-parse, f, a)-based rule extraction algorithm in (Galley et al., 2006). The input to the forest-based algorithm is a (e-forest, f, a)-triple. The output of the algorithm is a derivation forest (Galley et al., 2006) composed of xrs rules. The algorithm recursively traverses the e- forest top-down and extracts rules only at admissible forest nodes. The following procedure transforms the packed e- forest in Figure 3 into a packed synchronous derivation in Figure 4. Condition 1: Suppose we reach an additive e-forest node, e.g. 1 () in Figure 3. For each of 1 () s children, e-forest nodes 2 () and 11 (), we go to condition 2 to recursively extract rules on these two e-forest nodes, generating multiplicative derivation forest nodes, i.e., (( : x 0 NNP 3 (viktor) NNP 4 (chernomyrdin) 4 ) x 0 V-C) and ((NNP 1 (NNP 2 : x 0 : x 1 )) x 0 x 1 x 2 ) in Figure 4. We make these new nodes children of () in the derivation forest. Condition 2: Suppose we reach a multiplicative parse forest node, i.e., 11 () in Figure 3. We extract rules rooted at it using the procedure in (Galley et al., 2006), forming multiplicative derivation forest nodes, i.e., ((NNP 1 (NNP 2 : x 0 : x 1 )) x 0 x 1 x 2 ) We then go to condition 1 to form the derivation forest on the additive frontier e-forest nodes of the newly extracted rules, generating additive derivation forest nodes, i.e., (NNP 1 ), (NNP 2 ) and (). We make these nodes the children of node ((NNP 1 (NNP 2 : x 0 : x 1 )) x 0 x 1 x 2 ) in the derivation forest. This algorithm is a natural extension of the extraction algorithm in (Galley et al., 2006) in the sense that we have an extra condition (1) to relay rule extraction on additive e-forest nodes. It is worthwhile to eliminate the spuriously ambiguous rules that are introduced by the parallel bi- 750
6 () ( ( : x 0 NNP(viktor) NNP(chernomyrdin)) x 0 V-C ) ( (NNP : x 0 (NNP : x 1 : x 2 )) x 0 x 1 x 2 ) () ( (NNP : x 0 NNP : x 1 x 0 x 1 ) ) (NNP) (NNP) () ( (NNP(viktor) NNP(chernomyrdin)) V-C ) Figure 4: Derivation forest. narization. For example, we may extract the following two rules: - A(A(B:x 0 C:x 1 )D:x 2 ) x 1 x 0 x 2 - A(B:x 0 A(C:x 1 D:x 2 )) x 1 x 0 x 2 These two rules, however, are not really distinct. They both converge to the following rules if we delete the auxiliary nodes A. - A(B:x 0 C:x 1 D:x 2 ) x 1 x 0 x 2 The forest-base rule extraction algorithm produces much larger grammars than the tree-based one, making it difficult to scale to very large training data. From a 50M-word Chinese-to-English parallel corpus, we can extract more than 300 million translation rules, while the tree-based rule extraction algorithm gives approximately 100 million. However, the restructured trees from the simple binarization methods are not guaranteed to give the best trees for syntax-based machine translation. What we desire is a binarization method that still produces single parse trees, but is able to mix left binarization and right binarization in the same tree. In the following, we shall use the EM algorithm to learn the desirable binarization on the forest of binarization alternatives proposed by the parallel binarization algorithm. 6 Learning how to binarize via the EM algorithm The basic idea of applying the EM algorithm to choose a restructuring is as follows. We perform a set {β} of binarization operations on a parse tree τ. Each binarization β is the sequence of binarizations on the necessary (i.e., factorizable) nodes in τ in preorder. Each binarization β results in a restructured tree τ β. We extract rules from (τ β, f, a), generating a translation model consisting of parameters (i.e., rule e parse 2 1 parallel binarization composed rule extraction (Galley et al., 2006) project e parse e forest f,a syntax translation model viterbi derivations forest based rule extraction of minimal rules synchronous derivation forests Figure 5: Using the EM algorithm to choose restructuring. probabilities) θ. Our aim is to obtain the binarization β that gives the best likelihood of the restructured training data consisting of (τ β, f, a)-tuples. That is β 4 3 EM = arg max p(τ β,f,a θ ) (1) β In practice, we cannot enumerate all the exponential number of binarized trees for a given e-parse. We therefore use the packed forest to store all the binarizations that operate on an e-parse in a compact way, and then use the inside-outside algorithm (Lari and Young, 1990; Knight and Graehl, 2004) for model estimation. The probability p(τ β,f,a) of a (τ β, f, a)-tuple is what the basic syntax-based translation model is concerned with. It can be further computed by aggregating the rule probabilities p(r) in each derivation ω in the set of all derivations Ω (Galley et al., 2004; Marcu et al., 2006). That is p(τ β,f,a) = p(r) (2) ω Ω r ω Since it has been well-known that applying EM with tree fragments of different sizes causes overfitting (Johnson, 1998), and since it is also known that syntax MT models with larger composed rules in the mix significantly outperform rules that minimally explain the training data (minimal rules) in 751
7 translation accuracy (Galley et al., 2006), we decompose p(τ b,f,a) using minimal rules during running of the EM algorithm, but, after the EM restructuring is finished, we build the final translation model using composed rules for evaluation. Figure 5 is the actual pipeline that we use for EM binarization. We first generate a packed e-forest via parallel binarization. We then extract minimal translation rules from the (e-forest, f, a)-tuples, producing synchronous derivation forests. We run the inside-outside algorithm on the derivation forests until convergence. We obtain the Viterbi derivations and project the English parses from the derivations. Finally, we extract composed rules using Galley et al. (2006) s (e-tree, f, a)-based rule extraction algorithm. This procedure corresponds to the path in the pipeline. 7 Experiments We carried out a series of experiments to compare the performance of different binarization methods in terms of BLEU on Chinese-to-English translation tasks. 7.1 Experimental setup Our bitext consists of 16M words, all in the mainland-news domain. Our development set is a 925-line subset of the 993-line NIST02 evaluation set. We removed long sentences from the NIST02 evaluation set to speed up discriminative training. The test set is the full 919-line NIST03 evaluation set. We used a bottom-up, CKY-style decoder that works with binary xrs rules obtained via a synchronous binarization procedure (Zhang et al., 2006). The decoder prunes hypotheses using strategies described in (Chiang, 2007). The parse trees on the English side of the bitexts were generated using a parser (Soricut, 2004) implementing the Collins parsing models (Collins, 1997). We used the EM procedure described in (Knight and Graehl, 2004) to perform the inside-outside algorithm on synchronous derivation forests and to generate the Viterbi derivation forest. We used the rule extractor described in (Galley et al., 2006) to extract rules from (e-parse, f, a)-tuples, but we made an important modification: new nodes introduced by binarization will not be counted when computing the rule size limit unless they appear as the rule roots. The motivation is that binarization deepens the parses and increases the number of tree nodes. In (Galley et al., 2006), a composed rule is extracted only if the number of internal nodes it contains does not exceed a limit (i.e., 4), similar to the phrase length limit in phrase-based systems. This means that rules extracted from the restructured trees will be smaller than those from the unrestructured trees, if the X nodes are deleted. As shown in (Galley et al., 2006), smaller rules lose context, and thus give lower translation performance. Ignoring X nodes when computing the rule sizes preserves the unstructured rules in the resulting translation model and adds substructures as bonuses. 7.2 Experiment results Table 1 shows the BLEU scores of mixed-cased and detokenized translations of different systems. We see that all the binarization methods improve the baseline system that does not apply any binarization algorithm. The EM-binarization performs the best among all the restructuring methods, leading to 1.0 BLEU point improvement. We also computed the bootstrap p-values (Riezler and Maxwell, 2005) for the pairwise BLEU comparison between the baseline system and any of the system trained from binarized trees. The significance test shows that the EM binarization result is statistically significant better than the baseline system (p > 0.005), even though the baseline is already quite strong. To our best knowledge, is the highest BLEU score on this test set to date. Also as shown in Table 1, the grammars trained from the binarized training trees are almost two times of the grammar size with no binarization. The extra rules are substructures factored out by these binarization methods. How many more substructures (or translation rules) can be acquired is partially determined by how many more admissible nodes each binarization method can factorize, since rules are extractable only from admissible tree nodes. According to Table 1, binarization methods significantly increase the number of admissible nodes in the training trees. The EM binarization makes available the largest 752
8 EXPERIMENT NIST03-BLEU # RULES # ADMISSIBLE NODES IN TRAINING no-bin M 7,995,569 left binarization (p = 0.047) 114.0M 10,463,148 right binarization (p = 0.044) 113.0M 10,413,194 head binarization (p = 0.086) 113.8M 10,534,339 EM binarization (p = ) 115.6M 10,658,859 Table 1: Translation performance, grammar size and # admissible nodes versus binarization algorithms. BLEU scores are for mixed-cased and detokenized translations, as we usually do for NIST MT evaluations. nonterminal left-binarization right-binarization NP 96.97% 3.03% NP-C 97.49% 2.51% 0.25% 99.75% VP 93.90% 6.10% PP 83.75% 16.25% ADJP 87.83% 12.17% ADVP 82.74% 17.26% S 85.91% 14.09% S-C 18.88% 81.12% SBAR 96.69% 3.31% QP 86.40% 13.60% PRN 85.18% 14.82% WHNP 97.93% 2.07% NX 100% 0 SINV 87.78% 12.22% PRT 100% 0 SQ 93.53% 6.47% CONJP 18.08% 81.92% Table 2: Binarization bias learned by EM. number of admissible nodes, and thus results in the most rules. The EM binarization factorizes more admissible nodes because it mixes both left and right binarizations in the same tree. We computed the binarization biases learned by the EM algorithm for each nonterminal from the binarization forest of headed-parallel binarizations of the training trees, getting the statistics in Table 2. Of course, the binarization bias chosen by left-/right-binarization methods would be 100% deterministic. One noticeable message from Table 2 is that most of the categories are actually biased toward left-binarization, although our motivating example in our introduction section is for, which needed right binarization. The main reason might be that the head sub-constituents of most categories tend to be on the left, but according to the performance comparison between head binarization and EM binarization, head binarization does not suffice because we still need to choose the binarization between left and right if they both are head binarizations. 8 Conclusions In this paper, we not only studied the impact of simple tree binarization algorithms on the performance of end-to-end syntax-based MT, but also proposed binarization methods that mix more than one simple binarization in the binarization of the same parse tree. Binarizing a tree node whether to the left or to the right was learned by employing the EM algorithm on a set of alternative binarizations and by choosing the Viterbi one. The EM binarization method is informed by word alignments such that unnecessary new tree nodes will not be blindly introduced. To our best knowledge, our research is the first work that aims to generalize a syntax-based translation model by restructuring and achieves significant improvement on a strong baseline. Our work differs from traditional work on binarization of synchronous grammars in that we are not concerned with the equivalence of the binarized grammar to the original grammar, but intend to generalize the original grammar via restructuring of the training parse trees to improve translation performance. Acknowledgments The authors would like to thank David Chiang, Bryant Huang, and the anonymous reviewers for their valuable feedbacks. References E. Charniak A maximum-entropy-inspired parser. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, Seattle, May. David Chiang Hierarchical phrase-based translation. Computational Linguistics, 33(2). Michael Collins Three generative, lexicalized models for statistical parsing. In Proceedings of the 753
9 35th Annual Meeting of the Association for Computational Linguistics (ACL), pages 16 23, Madrid, Spain, July. A. P. Dempster, N. M. Laird, and D. B. Rubin Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, 39(1):1 38. Jason Eisner Learning non-isomorphic tree mappings for machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), pages , Sapporo, July. M. Galley, M. Hopkins, K. Knight, and D. Marcu What s in a Translation Rule? In Proceedings of the Human Language Technology Conference and the North American Association for Computational Linguistics (HLT-NAACL), Boston, Massachusetts. M. Galley, J. Graehl, K. Knight, D. Marcu, S. DeNeefe, W. Wang, and I. Thayer Scalable Inference and Training of Context-Rich Syntactic Models. In Proceedings of the 44th Annual Meeting of the Association for Computational Linguistics (ACL). Joshua Goodman Semiring parsing. Computational Linguistics, 25(4): M. Johnson The DOP estimation method is biased and inconsistent. Computational Linguistics, 28(1): K. Knight and J. Graehl Training Tree Transducers. In Proceedings of NAACL-HLT. K. Lari and S. Young The estimation of stochastic context-free grammars using the inside-outside algorithm. Computer Speech and Language, pages Daniel Marcu, Wei Wang, Abdessamad Echihabi, and Kevin Knight SPMT: Statistical machine translation with syntactified target language phraases. In Proceedings of EMNLP-2006, pp , Sydney, Australia. M. Marcus, B. Santorini, and M. Marcinkiewicz Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2): I. Dan Melamed, Giorgio Satta, and Benjamin Wellington Generalized multitext grammars. In Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL), Barcelona, Spain. Stefan Riezler and John T. Maxwell On some pitfalls in automatic evaluation and significance testing for MT. In Proc. ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for MT and/or Summarization. Radu Soricut A reimplementation of Collins s parsing models. Technical report, Information Sciences Institute, Department of Computer Science University of Southern California. Hao Zhang, Liang Huang, Daniel Gildea, and Kevin Knight Synchronous binarization for machine translation. In Proceedings of the HLT-NAACL. 754
Synchronous Binarization for Machine Translation
Synchronous Binarization for Machine Translation Hao Zhang Computer Science Department University of Rochester Rochester, NY 14627 [email protected] Daniel Gildea Computer Science Department University
Language Model of Parsing and Decoding
Syntax-based Language Models for Statistical Machine Translation Eugene Charniak ½, Kevin Knight ¾ and Kenji Yamada ¾ ½ Department of Computer Science, Brown University ¾ Information Sciences Institute,
DEPENDENCY PARSING JOAKIM NIVRE
DEPENDENCY PARSING JOAKIM NIVRE Contents 1. Dependency Trees 1 2. Arc-Factored Models 3 3. Online Learning 3 4. Eisner s Algorithm 4 5. Spanning Tree Parsing 6 References 7 A dependency parser analyzes
Statistical Syntax-Directed Translation with Extended Domain of Locality
Statistical Syntax-Directed Translation with Extended Domain of Locality Liang Huang University of Pennsylvania Philadelphia, PA 19104 [email protected] Kevin Knight USC/ISI Marina del Rey, CA 90292
Factored Translation Models
Factored Translation s Philipp Koehn and Hieu Hoang [email protected], [email protected] School of Informatics University of Edinburgh 2 Buccleuch Place, Edinburgh EH8 9LW Scotland, United Kingdom
Effective Self-Training for Parsing
Effective Self-Training for Parsing David McClosky [email protected] Brown Laboratory for Linguistic Information Processing (BLLIP) Joint work with Eugene Charniak and Mark Johnson David McClosky - [email protected]
Statistical Machine Translation
Statistical Machine Translation Some of the content of this lecture is taken from previous lectures and presentations given by Philipp Koehn and Andy Way. Dr. Jennifer Foster National Centre for Language
The XMU Phrase-Based Statistical Machine Translation System for IWSLT 2006
The XMU Phrase-Based Statistical Machine Translation System for IWSLT 2006 Yidong Chen, Xiaodong Shi Institute of Artificial Intelligence Xiamen University P. R. China November 28, 2006 - Kyoto 13:46 1
Multi-Engine Machine Translation by Recursive Sentence Decomposition
Multi-Engine Machine Translation by Recursive Sentence Decomposition Bart Mellebeek Karolina Owczarzak Josef van Genabith Andy Way National Centre for Language Technology School of Computing Dublin City
Accelerating and Evaluation of Syntactic Parsing in Natural Language Question Answering Systems
Accelerating and Evaluation of Syntactic Parsing in Natural Language Question Answering Systems cation systems. For example, NLP could be used in Question Answering (QA) systems to understand users natural
Transition-Based Dependency Parsing with Long Distance Collocations
Transition-Based Dependency Parsing with Long Distance Collocations Chenxi Zhu, Xipeng Qiu (B), and Xuanjing Huang Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science,
Adaptive Development Data Selection for Log-linear Model in Statistical Machine Translation
Adaptive Development Data Selection for Log-linear Model in Statistical Machine Translation Mu Li Microsoft Research Asia [email protected] Dongdong Zhang Microsoft Research Asia [email protected]
Symbiosis of Evolutionary Techniques and Statistical Natural Language Processing
1 Symbiosis of Evolutionary Techniques and Statistical Natural Language Processing Lourdes Araujo Dpto. Sistemas Informáticos y Programación, Univ. Complutense, Madrid 28040, SPAIN (email: [email protected])
An Iteratively-Trained Segmentation-Free Phrase Translation Model for Statistical Machine Translation
An Iteratively-Trained Segmentation-Free Phrase Translation Model for Statistical Machine Translation Robert C. Moore Chris Quirk Microsoft Research Redmond, WA 98052, USA {bobmoore,chrisq}@microsoft.com
The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2
2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2 1 School of
Course: Model, Learning, and Inference: Lecture 5
Course: Model, Learning, and Inference: Lecture 5 Alan Yuille Department of Statistics, UCLA Los Angeles, CA 90095 [email protected] Abstract Probability distributions on structured representation.
Basic Parsing Algorithms Chart Parsing
Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS 2011/2012 Anna Schmidt Talk Outline Chart Parsing Basics Chart Parsing Algorithms Earley Algorithm CKY Algorithm
The CMU Syntax-Augmented Machine Translation System: SAMT on Hadoop with N-best alignments
The CMU Syntax-Augmented Machine Translation System: SAMT on Hadoop with N-best alignments Andreas Zollmann, Ashish Venugopal, Stephan Vogel interact, Language Technology Institute School of Computer Science
Introduction. BM1 Advanced Natural Language Processing. Alexander Koller. 17 October 2014
Introduction! BM1 Advanced Natural Language Processing Alexander Koller! 17 October 2014 Outline What is computational linguistics? Topics of this course Organizational issues Siri Text prediction Facebook
Open Domain Information Extraction. Günter Neumann, DFKI, 2012
Open Domain Information Extraction Günter Neumann, DFKI, 2012 Improving TextRunner Wu and Weld (2010) Open Information Extraction using Wikipedia, ACL 2010 Fader et al. (2011) Identifying Relations for
A New Input Method for Human Translators: Integrating Machine Translation Effectively and Imperceptibly
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2015) A New Input Method for Human Translators: Integrating Machine Translation Effectively and Imperceptibly
Outline of today s lecture
Outline of today s lecture Generative grammar Simple context free grammars Probabilistic CFGs Formalism power requirements Parsing Modelling syntactic structure of phrases and sentences. Why is it useful?
Shallow Parsing with Apache UIMA
Shallow Parsing with Apache UIMA Graham Wilcock University of Helsinki Finland [email protected] Abstract Apache UIMA (Unstructured Information Management Architecture) is a framework for linguistic
Analysis of Algorithms I: Optimal Binary Search Trees
Analysis of Algorithms I: Optimal Binary Search Trees Xi Chen Columbia University Given a set of n keys K = {k 1,..., k n } in sorted order: k 1 < k 2 < < k n we wish to build an optimal binary search
Natural Language to Relational Query by Using Parsing Compiler
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
Translating the Penn Treebank with an Interactive-Predictive MT System
IJCLA VOL. 2, NO. 1 2, JAN-DEC 2011, PP. 225 237 RECEIVED 31/10/10 ACCEPTED 26/11/10 FINAL 11/02/11 Translating the Penn Treebank with an Interactive-Predictive MT System MARTHA ALICIA ROCHA 1 AND JOAN
Chapter 5. Phrase-based models. Statistical Machine Translation
Chapter 5 Phrase-based models Statistical Machine Translation Motivation Word-Based Models translate words as atomic units Phrase-Based Models translate phrases as atomic units Advantages: many-to-many
Modeling Graph Languages with Grammars Extracted via Tree Decompositions
Modeling Graph Languages with Grammars Extracted via Tree Decompositions Bevan Keeley Jones, [email protected] School of Informatics University of Edinburgh Edinburgh, UK Sharon Goldwater [email protected]
Why language is hard. And what Linguistics has to say about it. Natalia Silveira Participation code: eagles
Why language is hard And what Linguistics has to say about it Natalia Silveira Participation code: eagles Christopher Natalia Silveira Manning Language processing is so easy for humans that it is like
Web Data Extraction: 1 o Semestre 2007/2008
Web Data : Given Slides baseados nos slides oficiais do livro Web Data Mining c Bing Liu, Springer, December, 2006. Departamento de Engenharia Informática Instituto Superior Técnico 1 o Semestre 2007/2008
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 CWMT2011 技 术 报 告
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 Jin Yang and Satoshi Enoue SYSTRAN Software, Inc. 4444 Eastgate Mall, Suite 310 San Diego, CA 92121, USA E-mail:
Building a Question Classifier for a TREC-Style Question Answering System
Building a Question Classifier for a TREC-Style Question Answering System Richard May & Ari Steinberg Topic: Question Classification We define Question Classification (QC) here to be the task that, given
Constraints in Phrase Structure Grammar
Constraints in Phrase Structure Grammar Phrase Structure Grammar no movement, no transformations, context-free rules X/Y = X is a category which dominates a missing category Y Let G be the set of basic
COMP 356 Programming Language Structures Notes for Chapter 4 of Concepts of Programming Languages Scanning and Parsing
COMP 356 Programming Language Structures Notes for Chapter 4 of Concepts of Programming Languages Scanning and Parsing The scanner (or lexical analyzer) of a compiler processes the source program, recognizing
Chapter 6. Decoding. Statistical Machine Translation
Chapter 6 Decoding Statistical Machine Translation Decoding We have a mathematical model for translation p(e f) Task of decoding: find the translation e best with highest probability Two types of error
Jane 2: Open Source Phrase-based and Hierarchical Statistical Machine Translation
Jane 2: Open Source Phrase-based and Hierarchical Statistical Machine Translation Joern Wuebker M at thias Huck Stephan Peitz M al te Nuhn M arkus F reitag Jan-Thorsten Peter Saab M ansour Hermann N e
How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.
Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam
INF5820 Natural Language Processing - NLP. H2009 Jan Tore Lønning [email protected]
INF5820 Natural Language Processing - NLP H2009 Jan Tore Lønning [email protected] Semantic Role Labeling INF5830 Lecture 13 Nov 4, 2009 Today Some words about semantics Thematic/semantic roles PropBank &
Tagging with Hidden Markov Models
Tagging with Hidden Markov Models Michael Collins 1 Tagging Problems In many NLP problems, we would like to model pairs of sequences. Part-of-speech (POS) tagging is perhaps the earliest, and most famous,
A Workbench for Prototyping XML Data Exchange (extended abstract)
A Workbench for Prototyping XML Data Exchange (extended abstract) Renzo Orsini and Augusto Celentano Università Ca Foscari di Venezia, Dipartimento di Informatica via Torino 155, 30172 Mestre (VE), Italy
Statistical Machine Translation Lecture 4. Beyond IBM Model 1 to Phrase-Based Models
p. Statistical Machine Translation Lecture 4 Beyond IBM Model 1 to Phrase-Based Models Stephen Clark based on slides by Philipp Koehn p. Model 2 p Introduces more realistic assumption for the alignment
Syntax: Phrases. 1. The phrase
Syntax: Phrases Sentences can be divided into phrases. A phrase is a group of words forming a unit and united around a head, the most important part of the phrase. The head can be a noun NP, a verb VP,
Classification of Natural Language Interfaces to Databases based on the Architectures
Volume 1, No. 11, ISSN 2278-1080 The International Journal of Computer Science & Applications (TIJCSA) RESEARCH PAPER Available Online at http://www.journalofcomputerscience.com/ Classification of Natural
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information Satoshi Sekine Computer Science Department New York University [email protected] Kapil Dalwani Computer Science Department
Learning and Inference over Constrained Output
IJCAI 05 Learning and Inference over Constrained Output Vasin Punyakanok Dan Roth Wen-tau Yih Dav Zimak Department of Computer Science University of Illinois at Urbana-Champaign {punyakan, danr, yih, davzimak}@uiuc.edu
SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 统
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems Jin Yang, Satoshi Enoue Jean Senellart, Tristan Croiset SYSTRAN Software, Inc. SYSTRAN SA 9333 Genesee Ave. Suite PL1 La Grande
Distributed forests for MapReduce-based machine learning
Distributed forests for MapReduce-based machine learning Ryoji Wakayama, Ryuei Murata, Akisato Kimura, Takayoshi Yamashita, Yuji Yamauchi, Hironobu Fujiyoshi Chubu University, Japan. NTT Communication
Applying Co-Training Methods to Statistical Parsing. Anoop Sarkar http://www.cis.upenn.edu/ anoop/ [email protected]
Applying Co-Training Methods to Statistical Parsing Anoop Sarkar http://www.cis.upenn.edu/ anoop/ [email protected] 1 Statistical Parsing: the company s clinical trials of both its animal and human-based
Semantic parsing with Structured SVM Ensemble Classification Models
Semantic parsing with Structured SVM Ensemble Classification Models Le-Minh Nguyen, Akira Shimazu, and Xuan-Hieu Phan Japan Advanced Institute of Science and Technology (JAIST) Asahidai 1-1, Nomi, Ishikawa,
Log-Linear Models a.k.a. Logistic Regression, Maximum Entropy Models
Log-Linear Models a.k.a. Logistic Regression, Maximum Entropy Models Natural Language Processing CS 6120 Spring 2014 Northeastern University David Smith (some slides from Jason Eisner and Dan Klein) summary
Appraise: an Open-Source Toolkit for Manual Evaluation of MT Output
Appraise: an Open-Source Toolkit for Manual Evaluation of MT Output Christian Federmann Language Technology Lab, German Research Center for Artificial Intelligence, Stuhlsatzenhausweg 3, D-66123 Saarbrücken,
Syntaktická analýza. Ján Šturc. Zima 208
Syntaktická analýza Ján Šturc Zima 208 Position of a Parser in the Compiler Model 2 The parser The task of the parser is to check syntax The syntax-directed translation stage in the compiler s front-end
Distributed Dynamic Load Balancing for Iterative-Stencil Applications
Distributed Dynamic Load Balancing for Iterative-Stencil Applications G. Dethier 1, P. Marchot 2 and P.A. de Marneffe 1 1 EECS Department, University of Liege, Belgium 2 Chemical Engineering Department,
Generating SQL Queries Using Natural Language Syntactic Dependencies and Metadata
Generating SQL Queries Using Natural Language Syntactic Dependencies and Metadata Alessandra Giordani and Alessandro Moschitti Department of Computer Science and Engineering University of Trento Via Sommarive
SEARCH ENGINE OPTIMIZATION USING D-DICTIONARY
SEARCH ENGINE OPTIMIZATION USING D-DICTIONARY G.Evangelin Jenifer #1, Mrs.J.Jaya Sherin *2 # PG Scholar, Department of Electronics and Communication Engineering(Communication and Networking), CSI Institute
Phrase-Based MT. Machine Translation Lecture 7. Instructor: Chris Callison-Burch TAs: Mitchell Stern, Justin Chiu. Website: mt-class.
Phrase-Based MT Machine Translation Lecture 7 Instructor: Chris Callison-Burch TAs: Mitchell Stern, Justin Chiu Website: mt-class.org/penn Translational Equivalence Er hat die Prüfung bestanden, jedoch
GRAPH THEORY LECTURE 4: TREES
GRAPH THEORY LECTURE 4: TREES Abstract. 3.1 presents some standard characterizations and properties of trees. 3.2 presents several different types of trees. 3.7 develops a counting method based on a bijection
An End-to-End Discriminative Approach to Machine Translation
An End-to-End Discriminative Approach to Machine Translation Percy Liang Alexandre Bouchard-Côté Dan Klein Ben Taskar Computer Science Division, EECS Department University of California at Berkeley Berkeley,
A Statistical Parser for Czech*
Michael Collins AT&T Labs-Research, Shannon Laboratory, 180 Park Avenue, Florham Park, NJ 07932 mcollins@research, att.com A Statistical Parser for Czech* Jan Haj i~. nstitute of Formal and Applied Linguistics
Learning Translation Rules from Bilingual English Filipino Corpus
Proceedings of PACLIC 19, the 19 th Asia-Pacific Conference on Language, Information and Computation. Learning Translation s from Bilingual English Filipino Corpus Michelle Wendy Tan, Raymond Joseph Ang,
THUTR: A Translation Retrieval System
THUTR: A Translation Retrieval System Chunyang Liu, Qi Liu, Yang Liu, and Maosong Sun Department of Computer Science and Technology State Key Lab on Intelligent Technology and Systems National Lab for
PULLING OUT OPINION TARGETS AND OPINION WORDS FROM REVIEWS BASED ON THE WORD ALIGNMENT MODEL AND USING TOPICAL WORD TRIGGER MODEL
Journal homepage: www.mjret.in ISSN:2348-6953 PULLING OUT OPINION TARGETS AND OPINION WORDS FROM REVIEWS BASED ON THE WORD ALIGNMENT MODEL AND USING TOPICAL WORD TRIGGER MODEL Utkarsha Vibhute, Prof. Soumitra
Turker-Assisted Paraphrasing for English-Arabic Machine Translation
Turker-Assisted Paraphrasing for English-Arabic Machine Translation Michael Denkowski and Hassan Al-Haj and Alon Lavie Language Technologies Institute School of Computer Science Carnegie Mellon University
The Role of Size Normalization on the Recognition Rate of Handwritten Numerals
The Role of Size Normalization on the Recognition Rate of Handwritten Numerals Chun Lei He, Ping Zhang, Jianxiong Dong, Ching Y. Suen, Tien D. Bui Centre for Pattern Recognition and Machine Intelligence,
Systematic Comparison of Professional and Crowdsourced Reference Translations for Machine Translation
Systematic Comparison of Professional and Crowdsourced Reference Translations for Machine Translation Rabih Zbib, Gretchen Markiewicz, Spyros Matsoukas, Richard Schwartz, John Makhoul Raytheon BBN Technologies
Classification/Decision Trees (II)
Classification/Decision Trees (II) Department of Statistics The Pennsylvania State University Email: [email protected] Right Sized Trees Let the expected misclassification rate of a tree T be R (T ).
Classification On The Clouds Using MapReduce
Classification On The Clouds Using MapReduce Simão Martins Instituto Superior Técnico Lisbon, Portugal [email protected] Cláudia Antunes Instituto Superior Técnico Lisbon, Portugal [email protected]
Collecting Polish German Parallel Corpora in the Internet
Proceedings of the International Multiconference on ISSN 1896 7094 Computer Science and Information Technology, pp. 285 292 2007 PIPS Collecting Polish German Parallel Corpora in the Internet Monika Rosińska
A Business Process Driven Approach for Generating Software Modules
A Business Process Driven Approach for Generating Software Modules Xulin Zhao, Ying Zou Dept. of Electrical and Computer Engineering, Queen s University, Kingston, ON, Canada SUMMARY Business processes
Modern Natural Language Interfaces to Databases: Composing Statistical Parsing with Semantic Tractability
Modern Natural Language Interfaces to Databases: Composing Statistical Parsing with Semantic Tractability Ana-Maria Popescu Alex Armanasu Oren Etzioni University of Washington David Ko {amp, alexarm, etzioni,
Customizing an English-Korean Machine Translation System for Patent Translation *
Customizing an English-Korean Machine Translation System for Patent Translation * Sung-Kwon Choi, Young-Gil Kim Natural Language Processing Team, Electronics and Telecommunications Research Institute,
A binary search tree is a binary tree with a special property called the BST-property, which is given as follows:
Chapter 12: Binary Search Trees A binary search tree is a binary tree with a special property called the BST-property, which is given as follows: For all nodes x and y, if y belongs to the left subtree
D2.4: Two trained semantic decoders for the Appointment Scheduling task
D2.4: Two trained semantic decoders for the Appointment Scheduling task James Henderson, François Mairesse, Lonneke van der Plas, Paola Merlo Distribution: Public CLASSiC Computational Learning in Adaptive
Statistical Machine Translation: IBM Models 1 and 2
Statistical Machine Translation: IBM Models 1 and 2 Michael Collins 1 Introduction The next few lectures of the course will be focused on machine translation, and in particular on statistical machine translation
Convergence of Translation Memory and Statistical Machine Translation
Convergence of Translation Memory and Statistical Machine Translation Philipp Koehn and Jean Senellart 4 November 2010 Progress in Translation Automation 1 Translation Memory (TM) translators store past
Semantic Tree Kernels to classify Predicate Argument Structures
emantic Tree Kernels to classify Predicate Argument tructures Alessandro Moschitti 1 and Bonaventura Coppola 2 and Daniele Pighin 3 and Roberto Basili 4 Abstract. Recent work on emantic Role Labeling (RL)
Pushdown automata. Informatics 2A: Lecture 9. Alex Simpson. 3 October, 2014. School of Informatics University of Edinburgh [email protected].
Pushdown automata Informatics 2A: Lecture 9 Alex Simpson School of Informatics University of Edinburgh [email protected] 3 October, 2014 1 / 17 Recap of lecture 8 Context-free languages are defined by context-free
The Specific Text Analysis Tasks at the Beginning of MDA Life Cycle
SCIENTIFIC PAPERS, UNIVERSITY OF LATVIA, 2010. Vol. 757 COMPUTER SCIENCE AND INFORMATION TECHNOLOGIES 11 22 P. The Specific Text Analysis Tasks at the Beginning of MDA Life Cycle Armands Šlihte Faculty
On the k-path cover problem for cacti
On the k-path cover problem for cacti Zemin Jin and Xueliang Li Center for Combinatorics and LPMC Nankai University Tianjin 300071, P.R. China [email protected], [email protected] Abstract In this paper we
Analysis of Algorithms I: Binary Search Trees
Analysis of Algorithms I: Binary Search Trees Xi Chen Columbia University Hash table: A data structure that maintains a subset of keys from a universe set U = {0, 1,..., p 1} and supports all three dictionary
A Decision Theoretic Approach to Targeted Advertising
82 UNCERTAINTY IN ARTIFICIAL INTELLIGENCE PROCEEDINGS 2000 A Decision Theoretic Approach to Targeted Advertising David Maxwell Chickering and David Heckerman Microsoft Research Redmond WA, 98052-6399 [email protected]
The Prague Bulletin of Mathematical Linguistics NUMBER 95 APRIL 2011 5 18. A Guide to Jane, an Open Source Hierarchical Translation Toolkit
The Prague Bulletin of Mathematical Linguistics NUMBER 95 APRIL 2011 5 18 A Guide to Jane, an Open Source Hierarchical Translation Toolkit Daniel Stein, David Vilar, Stephan Peitz, Markus Freitag, Matthias
