Generating Chinese Classical Poems with Statistical Machine Translation Models
|
|
|
- Maryann Heath
- 9 years ago
- Views:
Transcription
1 Proceedings of the Twenty-Sixth AAA Conference on Artificial ntelligence Generating Chinese Classical Poems with Statistical Machine Translation Models Jing He* Ming Zhou Long Jiang Tsinghua University Microsoft Research Asia Microsoft Research Asia Beijing, China, Beijing, China, Beijing, China, Abstract This paper describes a statistical approach to generation of Chinese classical poetry and proposes a novel method to automatically evaluate poems. The system accepts a set of keywords representing the writing intents from a writer and generates sentences one by one to form a completed poem. A statistical machine translation (SMT) system is applied to generate new sentences, given the sentences generated previously. For each line of sentence a specific model specially trained for that line is used, as opposed to using a single model for all sentences. To enhance the coherence of sentences on every line, a coherence model using mutual information is applied to select candidates with better consistency with previous sentences. n addition, we demonstrate the effectiveness of the BLEU metric for evaluation with a novel method of generating diverse references. ntroduction This paper presents a novel approach to generating Chinese classical poetry using the quatrain form as a test bed. Chinese classical poetry is an important cultural heritage with over 2,000 years of history. There are many genres of classical Chinese poetry but quatrain ( ) and lüshi ( ) are the most popular. A quatrain is a poem consisting of four lines and a lüshi is a poem of eight lines. Each line of a quatrain or a lüshi contains five or seven characters. Strict tonal pattern, rhyme scheme and structural constraints are required on the quatrain and lüshi as explained below. a) Tonal pattern n the traditional Chinese language, every character consists of one syllable and has one tone which can be either Ping (level tone), or Ze (downward tone). The Copyright 2012, Association for the Advancement of Artificial ntelligence ( All rights reserved. *This work was done when the first author was a visiting student at Microsoft Research Asia. *This work was supported in part by the National Basic Research Program of China Grants 2011CBA00300, 2011CBA00301, and the National Natural Science Foundation of China Grants , , general principle for the tonal pattern in a Chinese classical poem is that these two kinds of tones should be interleaved in each sentence. There are four common tonal patterns for the 5-char and 7-char quatrain (Wang, 2002). We show one of the tonal patterns widely used for a 5-char quatrain, where + indicates the Ze tone, - indicates the Ping tone, and * indicates that either tone is acceptable for the position. * * * b) Rhyme scheme n Chinese poetry, rhyming characters have the same ending vowel. The rhyming constraint for a Chinese quatrain is that the ending characters of the second and the fourth sentences should rhyme. c) Structural constraint The structure of a Chinese quatrain often follows the beginning, continuation, transition, summary template which means that the first sentence starts a topic, the second sentence continues the topic, the third sentence expresses something new on the starting topic, and the fourth sentence expresses or implies a general conclusion. Below is an example of 5-char quatrain. ( ) white sunlight along hill fade ( ) Yellow River into sea flow ( ) wish exhaust thousand mile eyesight ( ) More up one story tower This quatrain, entitled On the Stork Tower was written by Zhihuan Wang, a famous poet of the Tang Dynasty. The following is the English translation. 1650
2 As daylight fades along the hill, The Yellow River flows into the sea. Seeking a thousand mile view, mount another story of the Tower. The quatrain describes the author s feeling when he climbed the Stork Tower beside the Yellow River. As we can see, it fully satisfies the above-mentioned constraints of the quatrain form. n this paper, we propose a statistical approach to generating Chinese classical poetry with the following steps: 1. Given a set of keywords provided by a writer as the description of his/her intent, we use a template-based model to generate candidates for the first sentence of the poem and apply a poetry language model to select the best one. 2. We then use a phrase-based statistical machine translation (SMT) model to generate the remaining sentences one by one. The SMT model takes as input the current sentence as the source language and outputs the next sentence as the target language. Our contribution includes: 1. As every sentence of a poem plays a certain function in the semantic structure of a poem, sentence positionsensitive SMT models are employed, with each model generating a sentence with a particular semantic function. 2. Furthermore, information from all previously generated sentences is considered by the model when generating the next sentence, in order to ensure high coherence between sentences. 3. The BLEU metric is applied to the evaluation of the generated poems, and the method of generating references is introduced. The BLEU score of our system shows high correlation with the scores given by human experts. Related Work As a famous example for automatic Chinese poetry generation, the Daoxiang poem generator ( is one of several popular web sites providing a poem generation service. t uses the manually made linguistic templatebased approach. n the Daoxiang system, a user needs to input a title and select a rhyming template. Then, the system acquires from a manually created table a list of phrases or words related to the keywords in the given title. The system then randomly selects a few words or phrases from the list and inserts these into the user-selected rhyming template to generate lines and finally form the poem. The poems generated do satisfy the tonal and rhyming constraints, but every sentence generated is not natural, and the entire poem has difficulty delivering a centralized meaning that logically corresponds with the user s intent as reflected by the title. n the area of computer-assisted Chinese poetry generation, which is different with automated poetry generation that this paper focuses, Luo has developed a tool that provides the tonal templates for genres of Chinese poetry and a dictionary of the tone of Chinese characters. ( n a study of Chinese couplet generation, which could be considered a narrowed case of Chinese poetry generation, (Jiang and Zhou, 2008) achieve promising results with a statistical machine translation model for generating the second sentence given the first one of a couplet. But no simple extension of this approach can accomplish strong coherence among four lines while meeting global structural constraints, such as with beginning, continuation, transition, summary templates. This paper intends to solve it. We also study other systems that generate English poems and Spanish poems (Oliveria, 2009), but their methods are quite different from ours. Overview of the Proposed Approach As a simplified expression, a poem s topic can be represented by a few keywords. After the author has formed the outline of his or her thoughts, he/she chooses the genre of the poem, for instance, he selects quatrain. Then he needs to select a tonal pattern. Next the chosen tonal template is filled in with qualified characters to complete a line. Then the author continues to write the next line until the entire poem is finished. This is an iterative process in which the author can always change ideas, or try another tonal pattern, or swap the sequence of lines or change characters in a line. Our generation system follows these phases of human poem writing. The system accepts the users intent concerning the content and format and generates sentences one by one to create a poem. The following information is used to express a user s intent: 1) Several keywords that a user inputs to describe the topic of the poem. For example, spring, mountaineering, and climb can be used to express the topic of climbing a mountain in spring. 2) The length of each line. The author can select five or seven characters. 3) The tonal pattern. To simplify the experiments, we constrain the keywords that users may enter to describe the topics to the poetic phrase taxonomy (Liu 1735) ShiXueHanYing ( ), a dictionary used for teaching classical poetry to children. n this taxonomy, 41,218 phrases (34,290 unique) of length ranging from 1 to 5 characters have been classified into 1,016 clusters. A phrase can be classified into multiple clusters. Each cluster is assigned a concept such as spring, mountaineering, etc. As a result, the cluster of spring would contain those phrases widely 1651
3 used in Chinese poems depicting the spring season. 1,016 concepts cover most of the common topics in ancient Chinese poems. n our experiments, after the user inputs the abovementioned information, we use the template-based method to generate the first sentence and then use a statistical machine translation model to generate the other three sentences one-by-one. Template-based Generation of First Sentence After the user selects the tonal pattern and several keywords, an intuitive way to generate the target poem is to find some phrases related to the selected keywords and assemble them into four sentences according to the tonal pattern. Since we have constrained the user to select keywords from the cluster names in the taxonomy of ShiXueHanYing, to get candidate phrases related to the user selected keywords, all phrases included in the corresponding clusters are selected. Then, for generation of each sentence, we build a phrase lattice by putting every candidate phrase into every position of the tonal pattern if the phrase satisfies the tonal constraint for the position. The tone of each character in the phrase is obtained using a tone and rhyming dictionary of Chinese characters, called Pingshui Rhyme. (Named in Chinese, which is a tone and rhyme table for poetry writing compiled by Yuan Liu in the Song Dynasty of China.) To search for the best sentences in each lattice, we introduce a language model that scores all paths in the lattice. Here, a character-based trigram language model trained with Katz back-off from the training data is used. Training details are covered in Section 6. The Forward- Viterbi-Backward-A* algorithm is used to find the N-Best candidate sentences to facilitate the user s selection working in the interactive model. Statistical Quatrain Generation with SMT models (Jiang and Zhou, 2008) proposed a statistical machine translation (SMT) method to generate the second sentence given the first sentence of a Chinese couplet (which can be considered a special kind of two-sentence poetry). nspired by this, we applied a similar SMT-based approach to generate the next sentences given the previous sentences of a quatrain. When generating the second sentence, we regard the first one as the source language in the MT task and regard the second one as the target language sentence. This approach continues to generate the third and the fourth sentences in the same way. Using an SMT model for quatrain generation has several advantages. The statistical learning based method can easily learn the knowledge of poem generation by training over a large poetry corpus. The direct use of this SMT approach, however, faces the difficulties of assuring the coherence between all generated sentences and the constraints of semantic structure over a whole poem. To handle these challenges, firstly, because the SMT model relies on an input sentence to generate the next sentence and thus it cannot generate the first sentence with user s keywords, we use the template-based approach to generate the first sentence. Secondly, to maintain a meaningful semantic structure throughout the poem, we use three different SMT models, to generate the 2nd sentence, the 3rd sentence, and the 4th sentence, respectively, rather than using one single model for all these three sentences. These are thus position-sensitive models. Thirdly, to improve the coherence among the sentences, we incorporate a coherence model using the keywords in all previous sentences into the SMT framework to rank the current sentence candidates so that the current sentence preserves a strong association with the previous sentences. Basic SMT Models Specifically, given a sentence of a quatrain F denoted as F { f 1, f 2,..., fn} (i = 1, 2, 3), our objective is to seek the next sentence where fi and si are Chinese characters, such that S is maximized. Following (Och and Ney, 2002), we use a log-linear model as follows. S* arg max S S arg max S M i loghi( Where hi( are feature functions and M is the number of feature functions. n our design, characters are used instead of words as translation units to form phrases because in ancient Chinese, most words consist of only one character and Chinese word segmentation may introduce unexpected errors. To apply the features, S and F are segmented into phrases s...s and 1 f... f, respectively. We 1 assume a uniform distribution over all possible segmentations. 1 fi si) 2 si fi) 3 pw( fi si) 4 pw( si fi) 5( S) h ( Phrase translation model h ( nverted phrase translation model h ( Lexical weight h ( nverted lexical weight h Language model Table 1. Features Used in the SMT Model. (1) 1652
4 Following (Jiang and Zhou, 2008), the five features listed in Table 1 are selected in our model. This is the basic model for the generation of the next sentence. As we explained before, however, we have to make several important enhancements to improve semantic structure and coherence. n the following subsections, we explain each of the extensions in detail. Position-sensitive SMT Models As introduced above, the functions of the four sentences in a Chinese quatrain can be roughly summarized as beginning, continuation, transition, and summary respectively. Therefore the relationship between adjacent sentences at different positions of a quatrain will differ. A naïve translation model may not handle such dependencies well. n our method, we design three specific translation models for generating the second, third, and fourth sentences respectively, where the translation models actually includes the first four feature functions listed in Table 1. Each model is trained with the sentence pairs at that position. For instance, to train the model for the second sentence generation, we only use the sentence pairs of the first sentence and second sentence in the poem corpus. To alleviate data sparseness, each model is interpolated with a general model trained with all sentence pairs regardless of the positions in a poem. When using the translation probabilities in decoding, we empirically assign higher weights 1 to the probabilities from the specific model and lower weights (1-1 ) to those from the general model. TM = 1 TM s + (1-1 )TM b (2) Coherence Models The models used in Formula (1) only consider the information between adjacent sentences of a quatrain. Consequently, the generated four sentences do not have strong meaningful association to each other. n a good quatrain, however, all four sentences should be coherent in order to deliver a centralized meaning. Coherence is also considered useful in text summarization for handling macro-level relations between clauses or sentences (Mani et al., 1998). n order to keep the next sentence coherent with all the previously generated sentences, the Mutual nformation (M) score is added as the sixth feature h6( to the SMT model, which measures the association between the next sentence and the sentences already generated before. Let Set-A be the characters appearing in the next sentence to be generated and Set-B be the characters in all previously generated sentences. All mutual information values between any two characters in Set-A and Set-B are summed in the coherence model: si, sj) h6( M ( si, sj) log (3) si) sj) i, j i, j Where S i and S j indicate characters in Set-A and Set-B respectively, and the parameters S i, S j ), S i ), and S j ) indicate the probabilities of S i and S j co-occurring, S i occurring, and S j occurring in a poem, respectively. Decoding After the first sentence of the target quatrain is obtained with the template-based approach, the remaining three sentences are generated one by one with the proposed SMT-based approach. For each sentence, we use a phrasebased decoder similar to that used by (Koehn et al., 2003) to generate an N-best list of candidates. Because there is no word reordering operation in our next sentence generation, our decoder is a monotonic decoder. n addition, poetic sentences are often shorter than typical source sentences in the machine translation task, so our decoder is more efficient than traditional translation decoders. Moreover, translation options are pruned if they violate the author-specified tonal pattern. According to the rhyming constraints of the Chinese quatrain, the ending character of the fourth sentence should follow the same rhyme of the ending character of the second sentence. n order to satisfy this constraint, when decoding for the fourth sentence, we remove those translation options at the last position of the sentence whose ending character does not rhyme with that of the second sentence. Note that the process of our poetry generation can be run in an interactive mode, which means that for each of the four sentences, the N-best candidates can be presented for user-selection. The system will use the user-selected sentence as input to continue to generate the N-best candidates of the next sentence. With this mode, the user can control the generation such that the final quatrain best conveys his or her leading idea. Model Training To estimate the weights in formulas (1), (2), and (4), we use the Minimum Error Rate Training (MERT) algorithm, which is widely used for phrase-based SMT model training (Och, 2003). The training data and criteria (BLEU) for MERT will be explained in Section 7. The training data for the translation and language models are discussed below. Data for Translation Model Training We downloaded from the nternet the <Tang Poems>, <Song Poems>, <Ming Poems>, <Qing Poems>, and <Tai Poems>, which amount to more than 3,500,000 sentences. From one quatrain, we extracted three sentence pairs (firstsecond, second-third, and third-fourth) and from an eightsentence lüshi, we get seven pairs. Altogether, we constructed more than 1,200,000 sentence pairs as training data for our translation models. The smoothing details of the general translation models are shown in Section
5 Data for Language Model Training The data for training the language model includes two parts: One is the corpus of poems mentioned above, which contains more than 3,500,000 sentences, and the other is a corpus of ancient articles obtained from the nternet, including,, etc, which contains about 12,000,000 sentences, and serves as the complementary data. Using the BLEU evaluation, we trained two character-based trigram models on the two parts of data, denoted by p1(s) and p2(s), which stand for the poem language model and article language model, respectively. Then we linearly combined the two models to get the final one as follows, where the weights are automatically trained by the Minimum Error Rate Training (MERT) algorithm, See Section 7.4. s) = 1 p 1 (s) + (1-1 )p 2 (s) (4) Evaluation We use BLEU (Papineni et al., 2002), which is widely used for automatic evaluation of machine translation systems. First we will explain the method of building references for BLEU. Second, we verified the effectiveness of BLEU by computing the correlation with human evaluation. Finally, our systems with the proposed position-sensitive SMT models and coherence model are evaluated with BLEU metric. To verify our method, we conducted experiments over 5 different systems. Following (Papineni et al., 2002), we use 5 systems of different levels for parallel experiments, to exhibit the proper usage of BLEU metric. So first, we built two simple systems with only 1/50 and 1/10 of a fraction of the training data, called Sys-1/50 and Sys-1/10 respectively. These two systems use the base SMT approach as Sys-Basic. The third one, called Sys-Basic, is trained with all sentence pairs, rather than using position-sensitive models trained with specific sentence pairs at certain positions, and a coherence model. Then, we evaluated the effectiveness of two advanced systems obtained by respectively adding the position-sensitive models and the coherence model to the baseline SMT System with the whole data set. The two advanced systems are called Sys-Position that means Position-sensitive models and Sys-Coherence that denotes Coherence model. estimate the weights in formulas (1), (2), and (4), we use the Minimum Error Rate Training (MERT) algorithm, BLEU Metric The BLEU metric is widely used for automatic evaluation of machine translation systems. t ranges from 0 to 1 and a higher BLEU score stands for better translation quality. n the work of couplet generation with an SMT approach (Jiang and Zhou, 2008), they use BLEU for automatically evaluating generated couplets. We think it s also useful for evaluating poem-generating systems as couplet is a special type of poem. Here we use 1-gram for BLEU because in ancient Chinese most words consist of only one character. Build Reference for BLEU The biggest difficulty in using BLEU to evaluate the poems is to get the reference sentences. n order to compute BLEU, we need references made by humans. n (Jiang and Zhou, 2008), given the first sentence of a couplet, they collect a set of second sentences written by humans and then compare the n-gram precision between the next sentences generated by the machine and humans. They use an online forum to collect references for a certain first line. n the online forum, someone challenges the other users by giving a first line of the couplet and users give their own answers for the next line, which are collected as references. Different from couplets, in the poetry generation, given the same key words and the first sentence, the poems generated by humans can be quite diverse, thus a numerous number of next sentences with different writing styles and characters are required to ensure the evaluation quality. Due to there not being a large enough number of poets, an automatic reference selecting method is used based on the following idea: f two sentences share similar keywords, the next sentences can be references for each other. For example, and share keywords and, the next sentence of is while the next one of is. We see that and also share the similar contents and emotions. Thus can be the reference of. We refer to "ShiXueHanYing", the taxonomy mentioned in the previous sections. Given a first sentence S1, we extract several keywords, say keywords A, B and C in the taxonomy, and generate a keyword set consisting of both the extracted keywords from the first sentence and the related keywords in the same directory in the taxonomy, say {A, A 1, A 2, B, B 1,, C, C 1, C 2, C 3,.}, which represents the meaning of the first sentence. Then if another sentence S2 has keywords A1, B1 and C3, S1 and S2 are similar ones. The next sentences of S1 and S2 are references for each other. n this way, a numerous number of next sentences with similar keyword sets of the first sentences can be selected from about 30,000 famous classical Chinese poems, such as <Tang Poems>. With this method, we obtained a testing set of 2050 poems with 6150 sentences pairs. Each poem has 3 sentence pairs to be evaluated, the first-second, second-third, and thirdfourth pairs. Results are shown in the following sections. 1654
6 BLEU vs. Human Evaluation To the best of our knowledge, there is no previous work about poetry evaluation with the BLEU metric. n order to prove its soundness, we need to conduct human evaluations on poems and compare them with the BLEU results. Score/ Fluency Rhyme Coherence Meaning Criteria 3 Totally fluent Meet all Good Good Fluent Meet More or Has some 2 most less meaning Not fluent Not meet Not good not 1 meaningful Table 2 The criteria of sentence evaluation A clear criterion is necessary for human evaluation. n the books on poem generation, such as (Wang, 2002), the criteria for evaluating poems are discussed. Based on these books, we formed four simple criteria: Fluency, Rhyme, Coherence, and Meaning, as shown in Table 2, which the human annotators can easily follow. The annotator only needs to assign different scores according to the four criteria. After that, the score of each sentence is calculated by summing up the four separate scores. According to the obtained sentence-score, a sentence s quality is classified into three grades: A, B and C, with A standing for the best quality, B the middle and C the worst. Grades A, B and C are given if the sentence-score is at least 10, in range [7,9], and in range [4,6], respectively. Then the system-scores of different systems can be calculated by summing up all grades by letting A=3, B=2 and C=1. For example, the Basic SMT has 11 sentences with Grade A, 26 sentences with Grade B and 23 sentences with Grade C, thus its system-score is 3*11+2*26+1*23 = 108. The human evaluation are done over 20 5-char quatrains and 20 7-char quatrains with 120 sentence pairs because its time cost is much larger than automatic evaluation. We asked two human judges who are experts in traditional Chinese poems to give scores to the 20 5-char quatrains and 20 7-char quatrains. About 80% of the scores of the testing set are the same as the two judges. We got an average score for the non-consistent ones. Systems 7 char quatrain 5 char quatrain A B C Score A B C Score Sys 1/ Sys 1/ Sys Basic (whole data) +Sys Position +Sys Coherence Table 3 Human evaluation of different systems Systems 7 char quatrain 5 char quatrain Average BLEU Average BLEU Sys 1/ Sys 1/ Sys Basic(whole data) Sys Position Sys Coherence Table 4 BLEU evaluation of different systems We evaluated the five systems, three systems have the same model but different training data and two advanced systems are added new features, to see whether BLEU is well-correlated with human judgments. BLEU evaluation is conducted on the same data set of 40 quatrains. The high correlation coefficients of 0.95 in the 7-char quatrain and 0.97 for the 5-char quatrain indicate that BLEU can track human judgment in poetry generation as well as in machine translation. To illustrate our result, we show a linear regression of the human evaluation scores as a function of the BLEU score over 7-char and 5-char quatrains; see Figures 1 and 2. Figure 1 BLEU predicts human judgment in 7 char quatrains Figure 2 BLEU predicts human judgment in 5 char quatrain BLEU Results of Our Systems BLEU is proved to be reasonable for poem evaluation. n addition for the evaluation, another usage of BLEU is to consider it as the criteria for MERT training for weights of different features in the SMT approach and weights of different translation models and language models in our advanced approach in Section 5 and Section 6. We use MERT training to train the weights in formulas (1) (2) and (4). Our training data set for the BLEU metric contains 2000 classical Chinese poems different from the testing set chosen from <Tang Poems>, half of which are 7-char quatrains and other half are 5-char. Our testing data set for the BLEU metric contains 2050 classical Chinese quatrain poems (with 6150 sentence pairs) that are randomly chosen from the most famous Chinese 1655
7 poetry anthology <Tang Poems> except the ones in the training set. Half of them are 5-char quatrains and the other half are 7-char quatrains. Each poem has 3 sentence pairs to be tested and evaluated, namely the first-second, secondthird, and third-fourth pairs. From Tables 5 and 6, we can see that adding the position-sensitive translation models increases the BLEU score from to for the 5- char quatrain and from to for the 7-char quatrain, respectively. With further investigation, we find that the poems generated by adding the position-sensitive translation models have better semantic structure than the basic SMT approach. Moreover, we added both position-sensitive models and the coherence model to the basic system. We observe that the BLEU score of a 7-char quatrain increases from to while that of a 5-char quatrain increases from to 0.141, which shows that the coherence model works well. Human evaluations are done over a small set of 40 samples from testing data set and the scores are increasing with the advanced models added. See Table 7. Systems Average Sys Basic Sys Position Sys Coherence Table 5 The BLEU scores of 7 char quatrain Cases Average Sys Basic Sys Position Sys Coherence Table 6 The BLEU scores of 5 char quatrain Systems 7 char quatrain 5 char quatrain A B C Score A B C Score Sys Basic Sys Position Sys Coherence Table 7 Human evaluation over 40 sample poems As an illustration of the quality of the generated quatrains, a 5-char quatrain generated by the advanced SMT approach accepting three keywords (spring), (lute), (drunk) is shown below. ( ) With two eyes, watching the autumn river ( ) With one hand, playing the springtime lute ( ) With song done and lute exhausted ( ) drunkenly fall into a dream Conclusion and Future Work Poetry generation is a difficult problem in the field of text generation. We propose a novel approach combining a statistical machine translation model with an ancient poetic phrase taxonomy to generate the Chinese quatrain. According to the evaluation of the generated poems, our method achieves impressive results. As the SMT model we applied is data driven, our method is language independent which means it can be extended to poetry-generation in other languages if the poetry data is readily available at enough scale. The application system can also be improved to be interactive with human in the poem-generating process. We have many issues worth further investigation. Both our template based generation of the first sentence in Section 4 and the generalization of references in Section 7.2 are based on the taxonomy <ShiXueHanYing>. As a poem dictionary collected by human experts in Qing Dynasty, its coverage is very limited and many words are not used now. As a future exploration, we want to reinforce this taxonomy by automatic mining synonyms and associated words, and by learning new concepts from large corpus to improve the current keyword set of <ShiXueHanYing>. We are also interested in extending our methods to English poem generalization. References Engelmore, R., and Morgan, A. eds Blackboard Systems. Reading, Mass.: Addison Wesley. Long Jiang, Ming Zhou. Generating Chinese Couplets using a Statistical MT Approach. n: The 22nd nternational Conference on Computational Linguistics, Manchester, England, August Philipp Koehn, Franz Josef Och, and Daniel Marcu. Statistical Phrase Based Translation. HLT/NAACL Wenwei Liu. ShiXueHanYing. ( ), Mani,., E. Bloedorn, and B. Gates. Using Cohesion and Coherence Models for Text Summarization. n Working Notes of the AAA 98 Spring Symposium on ntelligent Text Summarization, Stanford, CA, Franz Josef Och, Hermann Ney Discriminative Training and Maximum Entropy Models for Statistical Machine Translation. n "Proc. of the 40th Annual Meeting of the Association for Computational Linguistics" pp , Philadelphia, PA, July Franz Josef Och Minimum Error Rate Training for Statistical Machine Translation. n "ACL 2003: Proc. of the 41st Annual Meeting of the Association for Computational Linguistics", Japan, Sapporo, July H. G. Oliveira, Automatic Generation of Poetry: an Overview. n: 1st Seminar of Art, Music, Creativity and Artificial ntelligence, K. Papineni, S. Roukos, T. Ward and W. J. Zhu. BLEU: a Method for automatic evaluation of machine translation. n Proc. of the 40th Meeting of the Association for Computational Linguistics, Li Wang. A Summary of Rhyming Constraints of Chinese Poems ( ). Beijing Press. August,
The XMU Phrase-Based Statistical Machine Translation System for IWSLT 2006
The XMU Phrase-Based Statistical Machine Translation System for IWSLT 2006 Yidong Chen, Xiaodong Shi Institute of Artificial Intelligence Xiamen University P. R. China November 28, 2006 - Kyoto 13:46 1
THUTR: A Translation Retrieval System
THUTR: A Translation Retrieval System Chunyang Liu, Qi Liu, Yang Liu, and Maosong Sun Department of Computer Science and Technology State Key Lab on Intelligent Technology and Systems National Lab for
Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines
, 22-24 October, 2014, San Francisco, USA Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines Baosheng Yin, Wei Wang, Ruixue Lu, Yang Yang Abstract With the increasing
An Iteratively-Trained Segmentation-Free Phrase Translation Model for Statistical Machine Translation
An Iteratively-Trained Segmentation-Free Phrase Translation Model for Statistical Machine Translation Robert C. Moore Chris Quirk Microsoft Research Redmond, WA 98052, USA {bobmoore,chrisq}@microsoft.com
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 CWMT2011 技 术 报 告
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems for CWMT2011 Jin Yang and Satoshi Enoue SYSTRAN Software, Inc. 4444 Eastgate Mall, Suite 310 San Diego, CA 92121, USA E-mail:
Turker-Assisted Paraphrasing for English-Arabic Machine Translation
Turker-Assisted Paraphrasing for English-Arabic Machine Translation Michael Denkowski and Hassan Al-Haj and Alon Lavie Language Technologies Institute School of Computer Science Carnegie Mellon University
Adaptive Development Data Selection for Log-linear Model in Statistical Machine Translation
Adaptive Development Data Selection for Log-linear Model in Statistical Machine Translation Mu Li Microsoft Research Asia [email protected] Dongdong Zhang Microsoft Research Asia [email protected]
The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2
2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2 1 School of
Dublin City University at CLEF 2004: Experiments with the ImageCLEF St Andrew s Collection
Dublin City University at CLEF 2004: Experiments with the ImageCLEF St Andrew s Collection Gareth J. F. Jones, Declan Groves, Anna Khasin, Adenike Lam-Adesina, Bart Mellebeek. Andy Way School of Computing,
A New Input Method for Human Translators: Integrating Machine Translation Effectively and Imperceptibly
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2015) A New Input Method for Human Translators: Integrating Machine Translation Effectively and Imperceptibly
Topics in Computational Linguistics. Learning to Paraphrase: An Unsupervised Approach Using Multiple-Sequence Alignment
Topics in Computational Linguistics Learning to Paraphrase: An Unsupervised Approach Using Multiple-Sequence Alignment Regina Barzilay and Lillian Lee Presented By: Mohammad Saif Department of Computer
Statistical Machine Translation
Statistical Machine Translation Some of the content of this lecture is taken from previous lectures and presentations given by Philipp Koehn and Andy Way. Dr. Jennifer Foster National Centre for Language
Discourse Markers in English Writing
Discourse Markers in English Writing Li FENG Abstract Many devices, such as reference, substitution, ellipsis, and discourse marker, contribute to a discourse s cohesion and coherence. This paper focuses
Virginia English Standards of Learning Grade 8
A Correlation of Prentice Hall Writing Coach 2012 To the Virginia English Standards of Learning A Correlation of, 2012, Introduction This document demonstrates how, 2012, meets the objectives of the. Correlation
SYSTRAN 混 合 策 略 汉 英 和 英 汉 机 器 翻 译 系 统
SYSTRAN Chinese-English and English-Chinese Hybrid Machine Translation Systems Jin Yang, Satoshi Enoue Jean Senellart, Tristan Croiset SYSTRAN Software, Inc. SYSTRAN SA 9333 Genesee Ave. Suite PL1 La Grande
Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast
Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Hassan Sawaf Science Applications International Corporation (SAIC) 7990
7-2 Speech-to-Speech Translation System Field Experiments in All Over Japan
7-2 Speech-to-Speech Translation System Field Experiments in All Over Japan We explain field experiments conducted during the 2009 fiscal year in five areas of Japan. We also show the experiments of evaluation
Genetic Algorithm-based Multi-Word Automatic Language Translation
Recent Advances in Intelligent Information Systems ISBN 978-83-60434-59-8, pages 751 760 Genetic Algorithm-based Multi-Word Automatic Language Translation Ali Zogheib IT-Universitetet i Goteborg - Department
Appraise: an Open-Source Toolkit for Manual Evaluation of MT Output
Appraise: an Open-Source Toolkit for Manual Evaluation of MT Output Christian Federmann Language Technology Lab, German Research Center for Artificial Intelligence, Stuhlsatzenhausweg 3, D-66123 Saarbrücken,
Hybrid Machine Translation Guided by a Rule Based System
Hybrid Machine Translation Guided by a Rule Based System Cristina España-Bonet, Gorka Labaka, Arantza Díaz de Ilarraza, Lluís Màrquez Kepa Sarasola Universitat Politècnica de Catalunya University of the
Why Evaluation? Machine Translation. Evaluation. Evaluation Metrics. Ten Translations of a Chinese Sentence. How good is a given system?
Why Evaluation? How good is a given system? Machine Translation Evaluation Which one is the best system for our purpose? How much did we improve our system? How can we tune our system to become better?
Systematic Comparison of Professional and Crowdsourced Reference Translations for Machine Translation
Systematic Comparison of Professional and Crowdsourced Reference Translations for Machine Translation Rabih Zbib, Gretchen Markiewicz, Spyros Matsoukas, Richard Schwartz, John Makhoul Raytheon BBN Technologies
Machine Translation. Why Evaluation? Evaluation. Ten Translations of a Chinese Sentence. Evaluation Metrics. But MT evaluation is a di cult problem!
Why Evaluation? How good is a given system? Which one is the best system for our purpose? How much did we improve our system? How can we tune our system to become better? But MT evaluation is a di cult
Grade 4 Writing Curriculum Map
Curriculum Map BOE Approval November 2013 CURRICULUM MAP WRITING GRADE 4 SEPTEMBER OCTOBER NOVEMBER DECEMBER JANUARY Creating a Buzz About : It s All About Me! Tell Me A Story Realistic Fiction Tell Me
Micro blogs Oriented Word Segmentation System
Micro blogs Oriented Word Segmentation System Yijia Liu, Meishan Zhang, Wanxiang Che, Ting Liu, Yihe Deng Research Center for Social Computing and Information Retrieval Harbin Institute of Technology,
SEARCH ENGINE OPTIMIZATION USING D-DICTIONARY
SEARCH ENGINE OPTIMIZATION USING D-DICTIONARY G.Evangelin Jenifer #1, Mrs.J.Jaya Sherin *2 # PG Scholar, Department of Electronics and Communication Engineering(Communication and Networking), CSI Institute
Statistical Machine Translation: IBM Models 1 and 2
Statistical Machine Translation: IBM Models 1 and 2 Michael Collins 1 Introduction The next few lectures of the course will be focused on machine translation, and in particular on statistical machine translation
Language Modeling. Chapter 1. 1.1 Introduction
Chapter 1 Language Modeling (Course notes for NLP by Michael Collins, Columbia University) 1.1 Introduction In this chapter we will consider the the problem of constructing a language model from a set
Scenario: Optimization of Conference Schedule.
MINI PROJECT 1 Scenario: Optimization of Conference Schedule. A conference has n papers accepted. Our job is to organize them in a best possible schedule. The schedule has p parallel sessions at a given
Extraction of Chinese Compound Words An Experimental Study on a Very Large Corpus
Extraction of Chinese Compound Words An Experimental Study on a Very Large Corpus Jian Zhang Department of Computer Science and Technology of Tsinghua University, China [email protected]
Statistical Machine Translation Lecture 4. Beyond IBM Model 1 to Phrase-Based Models
p. Statistical Machine Translation Lecture 4 Beyond IBM Model 1 to Phrase-Based Models Stephen Clark based on slides by Philipp Koehn p. Model 2 p Introduces more realistic assumption for the alignment
Collecting Polish German Parallel Corpora in the Internet
Proceedings of the International Multiconference on ISSN 1896 7094 Computer Science and Information Technology, pp. 285 292 2007 PIPS Collecting Polish German Parallel Corpora in the Internet Monika Rosińska
Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network
, pp.273-284 http://dx.doi.org/10.14257/ijdta.2015.8.5.24 Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network Gengxin Sun 1, Sheng Bin 2 and
Building a Web-based parallel corpus and filtering out machinetranslated
Building a Web-based parallel corpus and filtering out machinetranslated text Alexandra Antonova, Alexey Misyurev Yandex 16, Leo Tolstoy St., Moscow, Russia {antonova, misyurev}@yandex-team.ru Abstract
Fault Analysis in Software with the Data Interaction of Classes
, pp.189-196 http://dx.doi.org/10.14257/ijsia.2015.9.9.17 Fault Analysis in Software with the Data Interaction of Classes Yan Xiaobo 1 and Wang Yichen 2 1 Science & Technology on Reliability & Environmental
The Prague Bulletin of Mathematical Linguistics NUMBER 96 OCTOBER 2011 49 58. Ncode: an Open Source Bilingual N-gram SMT Toolkit
The Prague Bulletin of Mathematical Linguistics NUMBER 96 OCTOBER 2011 49 58 Ncode: an Open Source Bilingual N-gram SMT Toolkit Josep M. Crego a, François Yvon ab, José B. Mariño c c a LIMSI-CNRS, BP 133,
Cross-Lingual Concern Analysis from Multilingual Weblog Articles
Cross-Lingual Concern Analysis from Multilingual Weblog Articles Tomohiro Fukuhara RACE (Research into Artifacts), The University of Tokyo 5-1-5 Kashiwanoha, Kashiwa, Chiba JAPAN http://www.race.u-tokyo.ac.jp/~fukuhara/
Interpreting areading Scaled Scores for Instruction
Interpreting areading Scaled Scores for Instruction Individual scaled scores do not have natural meaning associated to them. The descriptions below provide information for how each scaled score range should
AK + ASD Writing Grade Level Expectations For Grades 3-6
Revised ASD June 2004 AK + ASD Writing For Grades 3-6 The first row of each table includes a heading that summarizes the performance standards, and the second row includes the complete performance standards.
CS 533: Natural Language. Word Prediction
CS 533: Natural Language Processing Lecture 03 N-Gram Models and Algorithms CS 533: Natural Language Processing Lecture 01 1 Word Prediction Suppose you read the following sequence of words: Sue swallowed
Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus
Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus 1. Introduction Facebook is a social networking website with an open platform that enables developers to extract and utilize user information
Common Core Progress English Language Arts
[ SADLIER Common Core Progress English Language Arts Aligned to the [ Florida Next Generation GRADE 6 Sunshine State (Common Core) Standards for English Language Arts Contents 2 Strand: Reading Standards
Convergence of Translation Memory and Statistical Machine Translation
Convergence of Translation Memory and Statistical Machine Translation Philipp Koehn and Jean Senellart 4 November 2010 Progress in Translation Automation 1 Translation Memory (TM) translators store past
KINDGERGARTEN. Listen to a story for a particular reason
KINDGERGARTEN READING FOUNDATIONAL SKILLS Print Concepts Follow words from left to right in a text Follow words from top to bottom in a text Know when to turn the page in a book Show spaces between words
Machine Translation. Agenda
Agenda Introduction to Machine Translation Data-driven statistical machine translation Translation models Parallel corpora Document-, sentence-, word-alignment Phrase-based translation MT decoding algorithm
D36. Core Analysis Frame: Poetry. Examine Content. Examine Form and Structure. (continued on page D37)
Core Analysis Frame: Poetry D36 The questions on this analysis frame will help you achieve a basic understanding of any poem you read. For more advanced, in-depth analysis of each element, use the following
DIFFICULTIES AND SOME PROBLEMS IN TRANSLATING LEGAL DOCUMENTS
DIFFICULTIES AND SOME PROBLEMS IN TRANSLATING LEGAL DOCUMENTS Ivanka Sakareva Translation of legal documents bears its own inherent difficulties. First we should note that this type of translation is burdened
Chapter 5. Phrase-based models. Statistical Machine Translation
Chapter 5 Phrase-based models Statistical Machine Translation Motivation Word-Based Models translate words as atomic units Phrase-Based Models translate phrases as atomic units Advantages: many-to-many
A Neural Network and Web-Based Decision Support System for Forex Forecasting and Trading
A Neural Network and Web-Based Decision Support System for Forex Forecasting and Trading K.K. Lai 1, Lean Yu 2,3, and Shouyang Wang 2,4 1 Department of Management Sciences, City University of Hong Kong,
Chinese-Japanese Machine Translation Exploiting Chinese Characters
Chinese-Japanese Machine Translation Exploiting Chinese Characters CHENHUI CHU, TOSHIAKI NAKAZAWA, DAISUKE KAWAHARA, and SADAO KUROHASHI, Kyoto University The Chinese and Japanese languages share Chinese
How to Design and Interpret a Multiple-Choice-Question Test: A Probabilistic Approach*
Int. J. Engng Ed. Vol. 22, No. 6, pp. 1281±1286, 2006 0949-149X/91 $3.00+0.00 Printed in Great Britain. # 2006 TEMPUS Publications. How to Design and Interpret a Multiple-Choice-Question Test: A Probabilistic
Making Reading Content Comprehensible for Intermediate Language Learners. Colin Dalton. University of Houston-Downtown, United States
Making Reading Content Comprehensible for Intermediate Language Learners Colin Dalton University of Houston-Downtown, United States 0135 The European Conference on Language Learning 2013 Official Conference
The Combination Forecasting Model of Auto Sales Based on Seasonal Index and RBF Neural Network
, pp.67-76 http://dx.doi.org/10.14257/ijdta.2016.9.1.06 The Combination Forecasting Model of Auto Sales Based on Seasonal Index and RBF Neural Network Lihua Yang and Baolin Li* School of Economics and
Students will know Vocabulary: claims evidence reasons relevant accurate phrases/clauses credible source (inc. oral) formal style clarify
Sixth Grade Writing : Text Types and Purposes Essential Questions: 1. How do writers select the genre of writing for a specific purpose and audience? 2. How do essential components of the writing process
Wikipedia and Web document based Query Translation and Expansion for Cross-language IR
Wikipedia and Web document based Query Translation and Expansion for Cross-language IR Ling-Xiang Tang 1, Andrew Trotman 2, Shlomo Geva 1, Yue Xu 1 1Faculty of Science and Technology, Queensland University
Method of Fault Detection in Cloud Computing Systems
, pp.205-212 http://dx.doi.org/10.14257/ijgdc.2014.7.3.21 Method of Fault Detection in Cloud Computing Systems Ying Jiang, Jie Huang, Jiaman Ding and Yingli Liu Yunnan Key Lab of Computer Technology Application,
Units of Study 9th Grade
Units of Study 9th Grade First Semester Theme: The Journey Second Semester Theme: Choices The Big Ideas in English Language Arts that drive instruction: Independent thinkers construct meaning through language.
Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016
Network Machine Learning Research Group S. Jiang Internet-Draft Huawei Technologies Co., Ltd Intended status: Informational October 19, 2015 Expires: April 21, 2016 Abstract Network Machine Learning draft-jiang-nmlrg-network-machine-learning-00
Overview of MT techniques. Malek Boualem (FT)
Overview of MT techniques Malek Boualem (FT) This section presents an standard overview of general aspects related to machine translation with a description of different techniques: bilingual, transfer,
Predict the Popularity of YouTube Videos Using Early View Data
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
Tagging with Hidden Markov Models
Tagging with Hidden Markov Models Michael Collins 1 Tagging Problems In many NLP problems, we would like to model pairs of sequences. Part-of-speech (POS) tagging is perhaps the earliest, and most famous,
Research on the UHF RFID Channel Coding Technology based on Simulink
Vol. 6, No. 7, 015 Research on the UHF RFID Channel Coding Technology based on Simulink Changzhi Wang Shanghai 0160, China Zhicai Shi* Shanghai 0160, China Dai Jian Shanghai 0160, China Li Meng Shanghai
Sentiment analysis on news articles using Natural Language Processing and Machine Learning Approach.
Sentiment analysis on news articles using Natural Language Processing and Machine Learning Approach. Pranali Chilekar 1, Swati Ubale 2, Pragati Sonkambale 3, Reema Panarkar 4, Gopal Upadhye 5 1 2 3 4 5
BE CREATIVE AND DIFFERENT! FEEL FREE TO USE YOUR IMAGINATION WHEN CREATING YOUR ASSIGNMENTS. GROUP PRESENTATION DUE
Short Story Project This is your chance to help the class do well on literature quizzes. You will present the short story assigned to you in a meaningful way. Please make sure that you cover the basic
SIXTH GRADE UNIT 1. Reading: Literature
Reading: Literature Writing: Narrative RL.6.1 RL.6.2 RL.6.3 RL.6.4 RL.6.5 RL.6.6 RL.6.7 W.6.3 SIXTH GRADE UNIT 1 Key Ideas and Details Cite textual evidence to support analysis of what the text says explicitly
1 o Semestre 2007/2008
Departamento de Engenharia Informática Instituto Superior Técnico 1 o Semestre 2007/2008 Outline 1 2 3 4 5 Outline 1 2 3 4 5 Exploiting Text How is text exploited? Two main directions Extraction Extraction
Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features
, pp.273-280 http://dx.doi.org/10.14257/ijdta.2015.8.4.27 Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features Lirong Qiu School of Information Engineering, MinzuUniversity of
A Joint Sequence Translation Model with Integrated Reordering
A Joint Sequence Translation Model with Integrated Reordering Nadir Durrani, Helmut Schmid and Alexander Fraser Institute for Natural Language Processing University of Stuttgart Introduction Generation
Words Their Way TM. Word Study in Action. Correlated to: North Carolina STANDARD COURSE OF STUDY Language Arts for Third Grade
TM Word Study in Action Correlated to: North Carolina STANDARD COURSE OF STUDY Language Arts for Third Grade For More Information Contact Debbie Owens, Sales Representative 1-800-435-3499 Ext. 8092 www.pearsonlearning.com
Improving MT System Using Extracted Parallel Fragments of Text from Comparable Corpora
mproving MT System Using Extracted Parallel Fragments of Text from Comparable Corpora Rajdeep Gupta, Santanu Pal, Sivaji Bandyopadhyay Department of Computer Science & Engineering Jadavpur University Kolata
Search and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
CONCEPTUAL MODEL OF MULTI-AGENT BUSINESS COLLABORATION BASED ON CLOUD WORKFLOW
CONCEPTUAL MODEL OF MULTI-AGENT BUSINESS COLLABORATION BASED ON CLOUD WORKFLOW 1 XINQIN GAO, 2 MINGSHUN YANG, 3 YONG LIU, 4 XIAOLI HOU School of Mechanical and Precision Instrument Engineering, Xi'an University
A Comparative Study on Sentiment Classification and Ranking on Product Reviews
A Comparative Study on Sentiment Classification and Ranking on Product Reviews C.EMELDA Research Scholar, PG and Research Department of Computer Science, Nehru Memorial College, Putthanampatti, Bharathidasan
WRITING A CRITICAL ARTICLE REVIEW
WRITING A CRITICAL ARTICLE REVIEW A critical article review briefly describes the content of an article and, more importantly, provides an in-depth analysis and evaluation of its ideas and purpose. The
Using Use Cases for requirements capture. Pete McBreen. 1998 McBreen.Consulting
Using Use Cases for requirements capture Pete McBreen 1998 McBreen.Consulting [email protected] All rights reserved. You have permission to copy and distribute the document as long as you make no changes
Minnesota K-12 Academic Standards in Language Arts Curriculum and Assessment Alignment Form Rewards Intermediate Grades 4-6
Minnesota K-12 Academic Standards in Language Arts Curriculum and Assessment Alignment Form Rewards Intermediate Grades 4-6 4 I. READING AND LITERATURE A. Word Recognition, Analysis, and Fluency The student
FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM
International Journal of Innovative Computing, Information and Control ICIC International c 0 ISSN 34-48 Volume 8, Number 8, August 0 pp. 4 FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT
Phrase-Based MT. Machine Translation Lecture 7. Instructor: Chris Callison-Burch TAs: Mitchell Stern, Justin Chiu. Website: mt-class.
Phrase-Based MT Machine Translation Lecture 7 Instructor: Chris Callison-Burch TAs: Mitchell Stern, Justin Chiu Website: mt-class.org/penn Translational Equivalence Er hat die Prüfung bestanden, jedoch
A Load Balancing Algorithm based on the Variation Trend of Entropy in Homogeneous Cluster
, pp.11-20 http://dx.doi.org/10.14257/ ijgdc.2014.7.2.02 A Load Balancing Algorithm based on the Variation Trend of Entropy in Homogeneous Cluster Kehe Wu 1, Long Chen 2, Shichao Ye 2 and Yi Li 2 1 Beijing
An Automated Model Based Approach to Test Web Application Using Ontology
An Automated Model Based Approach to Test Web Application Using Ontology Hamideh Hajiabadi, Mohsen Kahani [email protected], [email protected] Computer Engineering Department, Ferdowsi
Tibetan Number Identification Based on Classification of Number Components in Tibetan Word Segmentation
Tibetan Number Identification Based on Classification of Number Components in Tibetan Word Segmentation Huidan Liu Institute of Software, Chinese Academy of Sciences, Graduate University of the [email protected]
Applied Mathematical Sciences, Vol. 7, 2013, no. 112, 5591-5597 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2013.
Applied Mathematical Sciences, Vol. 7, 2013, no. 112, 5591-5597 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2013.38457 Accuracy Rate of Predictive Models in Credit Screening Anirut Suebsing
Reliable and Cost-Effective PoS-Tagging
Reliable and Cost-Effective PoS-Tagging Yu-Fang Tsai Keh-Jiann Chen Institute of Information Science, Academia Sinica Nanang, Taipei, Taiwan 5 eddie,[email protected] Abstract In order to achieve
Adaptive Context-sensitive Analysis for JavaScript
Adaptive Context-sensitive Analysis for JavaScript Shiyi Wei and Barbara G. Ryder Department of Computer Science Virginia Tech Blacksburg, VA, USA {wei, ryder}@cs.vt.edu Abstract Context sensitivity is
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Fabian Grüning Carl von Ossietzky Universität Oldenburg, Germany, [email protected] Abstract: Independent
Subspace Analysis and Optimization for AAM Based Face Alignment
Subspace Analysis and Optimization for AAM Based Face Alignment Ming Zhao Chun Chen College of Computer Science Zhejiang University Hangzhou, 310027, P.R.China [email protected] Stan Z. Li Microsoft
ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS
DATABASE MARKETING Fall 2015, max 24 credits Dead line 15.10. ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS PART A Gains chart with excel Prepare a gains chart from the data in \\work\courses\e\27\e20100\ass4b.xls.
Indiana Department of Education
GRADE 1 READING Guiding Principle: Students read a wide range of fiction, nonfiction, classic, and contemporary works, to build an understanding of texts, of themselves, and of the cultures of the United
Query Recommendation employing Query Logs in Search Optimization
1917 Query Recommendation employing Query Logs in Search Optimization Neha Singh Department of Computer Science, Shri Siddhi Vinayak Group of Institutions, Bareilly Email: [email protected] Dr Manish
The Design and Improvement of a Software Project Management System Based on CMMI
Intelligent Information Management, 2012, 4, 330-337 http://dx.doi.org/10.4236/iim.2012.46037 Published Online November 2012 (http://www.scirp.org/journal/iim) The Design and Improvement of a Software
