Multilingual Aligned Corpora From Movie Subtitles
|
|
|
- Dylan Gaines
- 10 years ago
- Views:
Transcription
1 Multilingual Aligned Corpora From Movie Subtitles Mathieu Mangeot, Emmanuel Giguet To cite this version: Mathieu Mangeot, Emmanuel Giguet. Multilingual Aligned Corpora From Movie Subtitles <hal > HAL Id: hal Submitted on 2 Apr 2014 HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
2 Multilingual Aligned Corpora From Movie Subtitles Mathieu Mangeot Condillac-LISTIC Université de Savoie F Le Bourget du Lac Emmanuel Giguet GREYC Université de Caen BP 5186 F Caen Cedex Abstract This paper describes a methodology for building aligned multilingual corpora form movie subtitles found on the Web. The subtitles have specific formats and encodings. In a first step, we convert them to our multilingual subtitle format based on XML. In a second step, we align the subtitle sentences with the time used to display them on the screen. We implemented the tool Jimaku in order to semiautomatically perform these steps. The last step consists in aligning the sentences at the sub-sentence level and to index the corpus for contextual lookup. For this step, we use the WIMS platform, result of previous research on text collections management. 1 Introduction Nowadays, lots of Natural Language Processing works use multilingual aligned corpora. For example, they can help to build a terminological database or enrich a translation memory. In this paper, we tackle the building of such a resource with an original approach: the use of movie subtitles. As mentioned by (Simões, 2004), we use a positive side effect of DivX movie piracy on the Internet. It allows us to find an impressive amount of freely available movie subtitles free of rights in many languages. With this method, one can build quickly and with low cost multilingual aligned corpora by relying on a specific criterion of this data: the temporal indexing of subtitle lines. Furthermore, it allows one to align language pairs for which the opportunities to find other aligned corpora are very weak. One of the first planned use of these corpora is the building of multilingual examples for the Papillon dictionary with a pivot structure (Mangeot et al., 2004). We first present some sample of corpora in their original format as well as in our target XML representation then we detail the temporal alignment procedure. The next sections are dedicated to the presentation of the tools: Jimaku for the conversion and temporal alignment and WIMS platform (Giguet, 2005a) for aligning the sub-sentences and putting them into context with a concordance program. 2 Presentation of the corpora 2.1 The source: Websites for exchanging subtitles The growing number of Websites for exchanging movie subtitles followed naturally the increasing rate of DivX downloads. These sites are all based on the same principle: they offer a search in a subtitle database. The query of a subtitle is usually based on two criteria: the movie title and the language subtitle. Usually, the users have to sign in for downloading the subtitle matching their request. They can also upload missing subtitles on the server. The subtitle websites are most often specialized. Some by movie genre and most of them by language or language group. The majority of the subtitles found on these sites are written by groups of enthusiastic amateurs that begin their work as soon as the movie is available on the Internet. They are available freely but a more detailed study should be driven to check the legal aspects, regarding author copyrights and intellectual property. 2.2 The subtitle formats Lets present first three samples of subtitles of the movie "Ace Ventura: Pet Detective" in their original format (Figures 1, 2 and 3) but after conversion in UTF-8. The format of Figure 1 is the following. The time when the subtitle line display is started is on the first line. The time when the subtitle line display is ended is on the second line between brackets. The third line as well as the following lines until a new line between brackets represent the subtitle text that has to be displayed. Then, the following subtitle begins with a new line between brackets. If the text is on several lines, it will be displayed on several lines on the screen. This format is the SubViewer version2. The original encoding of the file is Shift-Jis (Japanese encoding). The SubRip format seen on Figure 2 is different. The subtitle line number is on the first line and begins by 1 (the first subtitle). The starting and ending time of the display of the subtitle are on the second line, separated by an arrow. The third line, as well as the following lines until an empty one represent the subtitle text that has to be displayed. THe following subtitle begins with a new line with its number only. the format of Figure 3 is the MicroDVD one. Every-
3 [00:01:58.920] [00:02:01.200] [00:02:09.720] [00:02:13.080] [00:02:13.720] [00:02:14.800] [00:02:15.040] [00:02:19.720] [00:02:22.760] [00:02:24.120] Figure 1: Sample of Japanese subtitle of the movie "Ace Ventura : Pet Detective" in SubViewer 2 format 3 00:01:58,932 > 00:02:01,207 Πάμε στην πόλη. 4 00:02:15,052 > 00:02:19,728 Απ' την ΗDS, κύριε. Τι κάνετε; Έχω ένα πακέτο για σας. 5 00:02:22,772 > 00:02:24,125 Ακούγεται σπασμένο. Figure 2: Sample of Greek subtitle of the movie "Ace Ventura : Pet Detective" in SubRip format thing is on the same line: he starting and ending time of the display of the subtitle are rounded curly brackets and the text follows. The line breaks are represented by the special character " ". The next line is the next subtitle. { 1600}{ 1670} Dobra obramba! { 2983}{ 3077} Gremo proti srediču mesta! { 3353}{ 3380} Kaj hoče? { 3386}{ 3540} HDS, gospod. Kako ste kaj to popoldne? Ureduček potem. Paket imam za vas. { 3579}{ 3613} Slii se zlomljeno. Figure 3: Sample of Slovene subtitle of the movie "Ace Ventura : Pet Detective" in MicroDVD format 3 Standardization of the formats: XML-Jimaku 3.1 A Large Variety of Formats To our knowledge, there is no standard for encoding movie subtitles. THus, there is a large variety of formats. Every subtitle writing tool propose its format: SubViewer 1 and 2 (extension.sub), SubStation Alpha (.sst), SubRip (.srt), SAMI (.smi), etc. Furthermore, some movie viewers also propose their own format (Quicktime for example). In order to use these subtitles in a multilingual alignment perspective, we chose to convert them into a common format. We thus had to precise the data that need to be standardized. We particularly focused on temporal indexing, subtitles rendering and characters encodings : Temporal indexing the subtitles are usually indexed on the time in milliseconds from the beginning of the movie video track. But some formats like MicroDVD index the subtitles with the video frame number. This information has to be taken into account because we need to convert them in order to align them on the time with the others. Rendering Most of the formats do not add rendering information on the subtitle text that has to be displayed. In some formats, the line breaks are represented by a special character (" " for MicroDVD). Some formats propose rendering for some parts of text, mainly bold and italics. THese informatoins are usually represented by SGMLlike tags (<b> and <i>). Character encodings Despite the creation of Unicode in 1992, we did not find any format that uses by default an encoding of the Unicode table like UTF-8. Furthermore, there is no link between the format of a subtitle and its encoding. One must note the file encoding when downloading it from a subtitle Website if this type of information is available, or use an encoding guesser. 3.2 The XML-Jimaku Format We chose a common format named Jimaku, based on XML and Unicode. It can represent all the existing encodings. The root element of the format is jimaku. It contains the elements titles (for storing the titles in their multilingual diversity), style (for linking rendering information), head (for the usual meta-information) and subtitles (for storing subtitles with temporal indexing). The subtitles element contains a list of st elements. The element st represents a subtitle. It contains the elements start and end for the temporal indexing of the subtitle and a list of text elements with, for each an attribute xml:lang that indicates the language of the subtitle at the format ISO The DTD of the Jimaku format is available at the
4 following address: services/dml/jimaku.dtd 3.3 A Jimaku Subtitle Example The Figure 4 is a sample of a multilingual subtitle of the movie "Ace Ventura: Pet Detective" in Jimaku format. It has been obtained from the conversion and alignment of previous subtitles. <jimaku version="1.0" ov="en" languages="de,el,en,ja,nl,sl,sv"> <titles> <title xml:lang="en">ace Ventura - Pet Detective</title> </titles> <subtitles> [...] <st n="10"> <start>00:02:26.682</start> <end>00:02:27.722</end> <txt xml:lang="de">was?</txt> <txt xml:lang="en">what do you want?</txt> <txt xml:lang="ja"> </txt> <txt xml:lang="sl">kaj hoče?</txt> </st> <st n="11"> <start>00:02:28.002</start> <end>00:02:32.682</end> <txt xml:lang="de">hds, Sir. Wie geht es lhnen?<br/>also gut. Ein Paket fr Sie.</txt> <txt xml:lang="el">απ' την ΗDS, κύριε. Τι κάνετε;<br/>έχω ένα πακέτο για σας.</txt> <txt xml:lang="en">hds, sir. How are you this afternoon?<br/>alrighty. I have a package for you.</txt> <txt xml:lang="ja"> <br/> </txt> <txt xml:lang="nl">- Wat moet je?<br/>- HDS. Alles goed? Mooi zo.</txt> <txt xml:lang="sl">hds, gospod. Kako ste kaj to popoldne? Ureduek potem. Paket imam za vas.</txt> <txt xml:lang="sv">hds, sir. Hur str det till i dag?<br/>jag har ett paket t er.</txt> </st> [...] </subtitles> Figure 4: Sample of multilingual subtitle of the movie "Ace Ventura : Pet Detective" in XML-Jimaku format 4 Subtitle Alignment 4.1 Manual Alignment Even if they are created for the same movie, the different subtitles have very often a time shift between each others. There are several reasons for that: the video track used to create the subtitle is not always the same, the videos do not start at the same time, etc... Furthermore, the conversion between formats and particularly between frame and time indexed formats can generate time shifts because of the frames per second rate of the video. Before the subtitle alignments, it is thus necessary to reset the subtitles alignment in order to set them on the same video track (hypothetical). The easiest method consists in aligning a subtitle line at the beginning of each file and another one at the end. Then, the computation is propagated on all other subtitle lines. 4.2 Automatic Alignment on the Time The alignment of several subtitles on the time measured between the beginning of the movie and the display of a subtitle on the screen is the core step of our methodology. Some subtitles are built from copies of other existing subtitles. Every line is translated in a new language and the time is not changed. These subtitles do not cause any particular problem because they are already aligned. We just gather the lines that have the same time. The previous case do not occurs very often. Most of the time, every subtitle file is built independently fro the others directly from the original video track. As a result, the times are never identical at the milli-second level. A third case also occurs frequently: when the subtitle lines are indexed on the frame numbers in the original format, they must be first converted into time indexing. Here also, the time cannot perfectly match. In order to handle the last two cases, the subtitle line alignments is done on the time with a variation (delta t). We fixed empirically this delta t at 500 milliseconds. Above, the risk to align un-corresponding subtitle lines thus that are not translations the ones from the others is too important. 5 Subtitle Management with Jimaku Tool 5.1 Presentation of the Tool The Jimaku tool (subtitle in Japanese) has been designed and implemented by Mathieu Mangeot and David Thevenin on a common original idea. the first version was launched in for the moment, this tool is not downloadable. Nevertheless, interested people can contact us. The first aim of this tool was to be able to reset the alignment of a subtitle on its video track. The format
5 and encoding conversion functionalities where useful from the beginning. 5.2 Format Management Because of the large amount of formats, we had to adapt the structure of our tool in order to facilitate the management of a new format. For that purpose, a programmer just has to implement an interface (abstract java class) that can read from the input this new format and convert them into the Jimaku internal representation format and also to write to the output in this format. The formats already handled are the following: Jimaku multilingual format, based on XML, indexed on the time, extension.jmk MicroDVD monolingual format, indexed on the frame, extension.sub QuickTime monolingual format, indexed on the time, extension.txt SAMI multilingual format, based on HTML, indexed on the time, extension.smi SubRip monolingual format, indexed on the time, extension.srt shift can be done either in milliseconds, either by indicating the new time of a subtitle line, or in frame numbers. The stretch can be done either from 24 to 25 frames per second, or the contrary. The manual resetting is done by indicating the time of a subtitle line at the beginning of the movie and another one at the end. 5.5 Temporal Alignment Functions The temporal alignment of two files is launched when an external file is imported into an already opened one. The latter must be in Jimaku format because the result of the import is a multilingual subtitle file. During the import, the user has to specify the location of the file to import as well as its encoding and language. Then, the manual alignment interface appears (such as Figure 6). the subtitles must be aligned on two lines preferably far the one from the other. This is why the interface shows a window with the first subtitle lines and another window with the last ones. According to our experience, it is easier to align the subtitle lines containing proper nouns because in an unknown language it is easier to identify them than other words. The figure 6 shows an alignment first on "HDS" and then on "Mr. Ace Ventura". SubViewer 1 and 2 monolingual format, indexed on the time, extension.sub SubStation Alpha monolingual format, indexed on the time, extension.ssa When a subtitle file is opened into Jimaku, the format is automatically detected and the subtitles are converted into the internal representation format. The subtitles can also be edited and the alignment modified. The "Save" command save the changes in the original format. The "Export" command allows the user to convert the subtitle file into another format and write a result file in output. 5.3 Encoding Detection The encoding and file languages are almost never specified in the subtitle file contents. If the information has not been obtained elsewhere, the user has to test several encodings in order to find the correct original encoding of the file. The Jimaku tool provides a function to change the encoding on the fly. Note: the Jimaku format is always encoded in UTF-8, (furthermore, XML specifies the encoding of the files) thus there is no need to specify the encoding. 5.4 Temporal Resetting Functions As explained previously, it is often necessary to reset the alignment of a subtitle with the corresponding video track. the temporal resetting functions of Jimaku are the following: shifting all the subtitles by a period t, "stretching" the subtitles and manual resetting. The Figure 5: Manual subtitle alignment with Jimaku 6 Subtitle Management in WIMS Platform The multilingual subtitle files produced with Jimaku are aligned at the subtitle line level thanks to the temporal criterion. They are then imported into the WIMS platform. This platform manages the presentation of the documents, their diffusion, and also offers the opportunity to apply several linguistic analysis. WMS integrates, among others, a sub-sentence alignment module that we adapted in order to analyze the Jimaku files. The XML structure of Jimaku files is close to the TMX standard already handled by our alignment module. We wrote an XSLT stylesheet in order to convert the Jimaku files into the TMX format. The sub-sentence alignment method included in WIMS has been already published in several papers (Giguet, 2005b; Giguet and Apidianaki, 2005). We will thus present it only briefly in this paper. Lets just
6 the subtitle alignment with a maximum precision and minimum recall. The next step consists in improving again the Jimaku original alignment technique with the WIMS textual alignment ones. Figure 6: Subtitle presentation in WIMS stress the fact that the method is endogenous: it does not use any other resource apart from the available corpus. This approach can handle transparently poorly represented languages or specialized fields texts (medical for example). References Emmanuel Giguet and Marianna Apidianaki Alignement d unités textuelles de taille variable. In Journée Internationale de la Linguistique de Corpus, Lorient, France, septembre. Emmanuel Giguet. 2005a. Modélisation de l'activité expérimentale du chercheur en traitement des langues sur corpus multilingues. In Journée «Articuler les traitements sur corpus», organisée par Benoît Habert, Serge Heiden et André Salem, Paris, France, 12 février. Emmanuel Giguet. 2005b. Multi-grained alignment of parallel texts with endogenous resources. In Recent Advances in Natural Language Processing (RANLP) International Workshop New Trends in Machine Translations, Borovets, Bulgaria, 24 September. Mathieu Mangeot, Gilles Serasset, and Mathieu Lafourcade Construction collaborative d'une base lexicale multilingue. Traitement Automatique des Langues, 44(2): , February. Alberto Simões Parallel corpora word alignment and applications. Technical report, Escola de Engenharia - Universidade do Minho, Minho, Portugal. Figure 7: Subtitle alignment in WIMS 7 Conclusion We presented a low cost methodology for quickly building multilingual corpora covering an important amount of languages. The major disadvantage of these corpora is that they are limited to the movie subtitles. Nevertheless, we think that they can be very useful as we plan to demonstrate with the building of multilingual examples for Papillon dictionary. Our methodology can be improved mainly at the subtitle line alignment. The next planned step is to automatically calculate the delta t that would optimize
Mobility management and vertical handover decision making in heterogeneous wireless networks
Mobility management and vertical handover decision making in heterogeneous wireless networks Mariem Zekri To cite this version: Mariem Zekri. Mobility management and vertical handover decision making in
ibalance-abf: a Smartphone-Based Audio-Biofeedback Balance System
ibalance-abf: a Smartphone-Based Audio-Biofeedback Balance System Céline Franco, Anthony Fleury, Pierre-Yves Guméry, Bruno Diot, Jacques Demongeot, Nicolas Vuillerme To cite this version: Céline Franco,
A graph based framework for the definition of tools dealing with sparse and irregular distributed data-structures
A graph based framework for the definition of tools dealing with sparse and irregular distributed data-structures Serge Chaumette, Jean-Michel Lepine, Franck Rubi To cite this version: Serge Chaumette,
A usage coverage based approach for assessing product family design
A usage coverage based approach for assessing product family design Jiliang Wang To cite this version: Jiliang Wang. A usage coverage based approach for assessing product family design. Other. Ecole Centrale
Faut-il des cyberarchivistes, et quel doit être leur profil professionnel?
Faut-il des cyberarchivistes, et quel doit être leur profil professionnel? Jean-Daniel Zeller To cite this version: Jean-Daniel Zeller. Faut-il des cyberarchivistes, et quel doit être leur profil professionnel?.
QASM: a Q&A Social Media System Based on Social Semantics
QASM: a Q&A Social Media System Based on Social Semantics Zide Meng, Fabien Gandon, Catherine Faron-Zucker To cite this version: Zide Meng, Fabien Gandon, Catherine Faron-Zucker. QASM: a Q&A Social Media
FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data
FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data Miguel Liroz-Gistau, Reza Akbarinia, Patrick Valduriez To cite this version: Miguel Liroz-Gistau, Reza Akbarinia, Patrick Valduriez. FP-Hadoop:
Discussion on the paper Hypotheses testing by convex optimization by A. Goldenschluger, A. Juditsky and A. Nemirovski.
Discussion on the paper Hypotheses testing by convex optimization by A. Goldenschluger, A. Juditsky and A. Nemirovski. Fabienne Comte, Celine Duval, Valentine Genon-Catalot To cite this version: Fabienne
Aligning subjective tests using a low cost common set
Aligning subjective tests using a low cost common set Yohann Pitrey, Ulrich Engelke, Marcus Barkowsky, Romuald Pépion, Patrick Le Callet To cite this version: Yohann Pitrey, Ulrich Engelke, Marcus Barkowsky,
Additional mechanisms for rewriting on-the-fly SPARQL queries proxy
Additional mechanisms for rewriting on-the-fly SPARQL queries proxy Arthur Vaisse-Lesteven, Bruno Grilhères To cite this version: Arthur Vaisse-Lesteven, Bruno Grilhères. Additional mechanisms for rewriting
Study on Cloud Service Mode of Agricultural Information Institutions
Study on Cloud Service Mode of Agricultural Information Institutions Xiaorong Yang, Nengfu Xie, Dan Wang, Lihua Jiang To cite this version: Xiaorong Yang, Nengfu Xie, Dan Wang, Lihua Jiang. Study on Cloud
Online vehicle routing and scheduling with continuous vehicle tracking
Online vehicle routing and scheduling with continuous vehicle tracking Jean Respen, Nicolas Zufferey, Jean-Yves Potvin To cite this version: Jean Respen, Nicolas Zufferey, Jean-Yves Potvin. Online vehicle
Expanding Renewable Energy by Implementing Demand Response
Expanding Renewable Energy by Implementing Demand Response Stéphanie Bouckaert, Vincent Mazauric, Nadia Maïzi To cite this version: Stéphanie Bouckaert, Vincent Mazauric, Nadia Maïzi. Expanding Renewable
Towards Unified Tag Data Translation for the Internet of Things
Towards Unified Tag Data Translation for the Internet of Things Loïc Schmidt, Nathalie Mitton, David Simplot-Ryl To cite this version: Loïc Schmidt, Nathalie Mitton, David Simplot-Ryl. Towards Unified
ANIMATED PHASE PORTRAITS OF NONLINEAR AND CHAOTIC DYNAMICAL SYSTEMS
ANIMATED PHASE PORTRAITS OF NONLINEAR AND CHAOTIC DYNAMICAL SYSTEMS Jean-Marc Ginoux To cite this version: Jean-Marc Ginoux. ANIMATED PHASE PORTRAITS OF NONLINEAR AND CHAOTIC DYNAMICAL SYSTEMS. A.H. Siddiqi,
An update on acoustics designs for HVAC (Engineering)
An update on acoustics designs for HVAC (Engineering) Ken MARRIOTT To cite this version: Ken MARRIOTT. An update on acoustics designs for HVAC (Engineering). Société Française d Acoustique. Acoustics 2012,
VR4D: An Immersive and Collaborative Experience to Improve the Interior Design Process
VR4D: An Immersive and Collaborative Experience to Improve the Interior Design Process Amine Chellali, Frederic Jourdan, Cédric Dumas To cite this version: Amine Chellali, Frederic Jourdan, Cédric Dumas.
Overview of model-building strategies in population PK/PD analyses: 2002-2004 literature survey.
Overview of model-building strategies in population PK/PD analyses: 2002-2004 literature survey. Céline Dartois, Karl Brendel, Emmanuelle Comets, Céline Laffont, Christian Laveille, Brigitte Tranchand,
Use of tabletop exercise in industrial training disaster.
Use of tabletop exercise in industrial training disaster. Alexis Descatha, Thomas Loeb, François Dolveck, Nathalie-Sybille Goddet, Valerie Poirier, Michel Baer To cite this version: Alexis Descatha, Thomas
Global Identity Management of Virtual Machines Based on Remote Secure Elements
Global Identity Management of Virtual Machines Based on Remote Secure Elements Hassane Aissaoui, P. Urien, Guy Pujolle To cite this version: Hassane Aissaoui, P. Urien, Guy Pujolle. Global Identity Management
SELECTIVELY ABSORBING COATINGS
SELECTIVELY ABSORBING COATINGS J. Vuletin, P. Kuli ik, M. Bosanac To cite this version: J. Vuletin, P. Kuli ik, M. Bosanac. SELECTIVELY ABSORBING COATINGS. Journal de Physique Colloques, 1981, 42 (C1),
ANALYSIS OF SNOEK-KOSTER (H) RELAXATION IN IRON
ANALYSIS OF SNOEK-KOSTER (H) RELAXATION IN IRON J. San Juan, G. Fantozzi, M. No, C. Esnouf, F. Vanoni To cite this version: J. San Juan, G. Fantozzi, M. No, C. Esnouf, F. Vanoni. ANALYSIS OF SNOEK-KOSTER
Information Technology Education in the Sri Lankan School System: Challenges and Perspectives
Information Technology Education in the Sri Lankan School System: Challenges and Perspectives Chandima H. De Silva To cite this version: Chandima H. De Silva. Information Technology Education in the Sri
A model driven approach for bridging ILOG Rule Language and RIF
A model driven approach for bridging ILOG Rule Language and RIF Valerio Cosentino, Marcos Didonet del Fabro, Adil El Ghali To cite this version: Valerio Cosentino, Marcos Didonet del Fabro, Adil El Ghali.
Managing Risks at Runtime in VoIP Networks and Services
Managing Risks at Runtime in VoIP Networks and Services Oussema Dabbebi, Remi Badonnel, Olivier Festor To cite this version: Oussema Dabbebi, Remi Badonnel, Olivier Festor. Managing Risks at Runtime in
Territorial Intelligence and Innovation for the Socio-Ecological Transition
Territorial Intelligence and Innovation for the Socio-Ecological Transition Jean-Jacques Girardot, Evelyne Brunau To cite this version: Jean-Jacques Girardot, Evelyne Brunau. Territorial Intelligence and
Minkowski Sum of Polytopes Defined by Their Vertices
Minkowski Sum of Polytopes Defined by Their Vertices Vincent Delos, Denis Teissandier To cite this version: Vincent Delos, Denis Teissandier. Minkowski Sum of Polytopes Defined by Their Vertices. Journal
Cobi: Communitysourcing Large-Scale Conference Scheduling
Cobi: Communitysourcing Large-Scale Conference Scheduling Haoqi Zhang, Paul André, Lydia Chilton, Juho Kim, Steven Dow, Robert Miller, Wendy E. Mackay, Michel Beaudouin-Lafon To cite this version: Haoqi
Cracks detection by a moving photothermal probe
Cracks detection by a moving photothermal probe J. Bodnar, M. Egée, C. Menu, R. Besnard, A. Le Blanc, M. Pigeon, J. Sellier To cite this version: J. Bodnar, M. Egée, C. Menu, R. Besnard, A. Le Blanc, et
GDS Resource Record: Generalization of the Delegation Signer Model
GDS Resource Record: Generalization of the Delegation Signer Model Gilles Guette, Bernard Cousin, David Fort To cite this version: Gilles Guette, Bernard Cousin, David Fort. GDS Resource Record: Generalization
Advantages and disadvantages of e-learning at the technical university
Advantages and disadvantages of e-learning at the technical university Olga Sheypak, Galina Artyushina, Anna Artyushina To cite this version: Olga Sheypak, Galina Artyushina, Anna Artyushina. Advantages
What Development for Bioenergy in Asia: A Long-term Analysis of the Effects of Policy Instruments using TIAM-FR model
What Development for Bioenergy in Asia: A Long-term Analysis of the Effects of Policy Instruments using TIAM-FR model Seungwoo Kang, Sandrine Selosse, Nadia Maïzi To cite this version: Seungwoo Kang, Sandrine
IntroClassJava: A Benchmark of 297 Small and Buggy Java Programs
IntroClassJava: A Benchmark of 297 Small and Buggy Java Programs Thomas Durieux, Martin Monperrus To cite this version: Thomas Durieux, Martin Monperrus. IntroClassJava: A Benchmark of 297 Small and Buggy
Terminology Extraction from Log Files
Terminology Extraction from Log Files Hassan Saneifar, Stéphane Bonniol, Anne Laurent, Pascal Poncelet, Mathieu Roche To cite this version: Hassan Saneifar, Stéphane Bonniol, Anne Laurent, Pascal Poncelet,
Towards Collaborative Learning via Shared Artefacts over the Grid
Towards Collaborative Learning via Shared Artefacts over the Grid Cornelia Boldyreff, Phyo Kyaw, Janet Lavery, David Nutter, Stephen Rank To cite this version: Cornelia Boldyreff, Phyo Kyaw, Janet Lavery,
DEM modeling of penetration test in static and dynamic conditions
DEM modeling of penetration test in static and dynamic conditions Quoc Anh Tran, Bastien Chevalier, Pierre Breul To cite this version: Quoc Anh Tran, Bastien Chevalier, Pierre Breul. DEM modeling of penetration
Florin Paun. To cite this version: HAL Id: halshs-00628978 https://halshs.archives-ouvertes.fr/halshs-00628978
Demand Readiness Level (DRL), a new tool to hybridize Market Pull and Technology Push approaches. Introspective analysis of the new trends in Technology Transfer practices. Florin Paun To cite this version:
Methodology of organizational learning in risk management : Development of a collective memory for sanitary alerts
Methodology of organizational learning in risk management : Development of a collective memory for sanitary alerts Wim Van Wassenhove, Jean-Luc Wybo To cite this version: Wim Van Wassenhove, Jean-Luc Wybo.
Optimization results for a generalized coupon collector problem
Optimization results for a generalized coupon collector problem Emmanuelle Anceaume, Yann Busnel, Ernst Schulte-Geers, Bruno Sericola To cite this version: Emmanuelle Anceaume, Yann Busnel, Ernst Schulte-Geers,
Running an HCI Experiment in Multiple Parallel Universes
Running an HCI Experiment in Multiple Parallel Universes,, To cite this version:,,. Running an HCI Experiment in Multiple Parallel Universes. CHI 14 Extended Abstracts on Human Factors in Computing Systems.
Performance Evaluation of Encryption Algorithms Key Length Size on Web Browsers
Performance Evaluation of Encryption Algorithms Key Length Size on Web Browsers Syed Zulkarnain Syed Idrus, Syed Alwee Aljunid, Salina Mohd Asi, Suhizaz Sudin To cite this version: Syed Zulkarnain Syed
ISO9001 Certification in UK Organisations A comparative study of motivations and impacts.
ISO9001 Certification in UK Organisations A comparative study of motivations and impacts. Scott McCrosson, Michele Cano, Eileen O Neill, Abdessamad Kobi To cite this version: Scott McCrosson, Michele Cano,
An integrated planning-simulation-architecture approach for logistics sharing management: A case study in Northern Thailand and Southern China
An integrated planning-simulation-architecture approach for logistics sharing management: A case study in Northern Thailand and Southern China Pree Thiengburanathum, Jesus Gonzalez-Feliu, Yacine Ouzrout,
How to translate VisualPlace
Translation tips 1 How to translate VisualPlace The international language support in VisualPlace is based on the Rosette library. There are three sections in this guide. It starts with instructions for
An Automatic Reversible Transformation from Composite to Visitor in Java
An Automatic Reversible Transformation from Composite to Visitor in Java Akram To cite this version: Akram. An Automatic Reversible Transformation from Composite to Visitor in Java. CIEL 2012, P. Collet,
Improved Method for Parallel AES-GCM Cores Using FPGAs
Improved Method for Parallel -GCM Cores Using FPGAs Karim Moussa Ali Abdellatif, Roselyne Chotin-Avot, abib Mehrez To cite this version: Karim Moussa Ali Abdellatif, Roselyne Chotin-Avot, abib Mehrez.
Business intelligence systems and user s parameters: an application to a documents database
Business intelligence systems and user s parameters: an application to a documents database Babajide Afolabi, Odile Thiery To cite this version: Babajide Afolabi, Odile Thiery. Business intelligence systems
New implementions of predictive alternate analog/rf test with augmented model redundancy
New implementions of predictive alternate analog/rf test with augmented model redundancy Haithem Ayari, Florence Azais, Serge Bernard, Mariane Comte, Vincent Kerzerho, Michel Renovell To cite this version:
The truck scheduling problem at cross-docking terminals
The truck scheduling problem at cross-docking terminals Lotte Berghman,, Roel Leus, Pierre Lopez To cite this version: Lotte Berghman,, Roel Leus, Pierre Lopez. The truck scheduling problem at cross-docking
Question template for interviews
Question template for interviews This interview template creates a framework for the interviews. The template should not be considered too restrictive. If an interview reveals information not covered by
Partial and Dynamic reconfiguration of FPGAs: a top down design methodology for an automatic implementation
Partial and Dynamic reconfiguration of FPGAs: a top down design methodology for an automatic implementation Florent Berthelot, Fabienne Nouvel, Dominique Houzet To cite this version: Florent Berthelot,
XTM for Language Service Providers Explained
XTM for Language Service Providers Explained 1. Introduction There is a new generation of Computer Assisted Translation (CAT) tools available based on the latest Web 2.0 technology. These systems are more
Virtual plants in high school informatics L-systems
Virtual plants in high school informatics L-systems Janka Majherov To cite this version: Janka Majherov. Virtual plants in high school informatics L-systems. Michael E. Auer. Conference ICL2007, September
Crowdsourcing and digitization
Crowdsourcing and digitization Mathieu Andro To cite this version: Mathieu Andro. Crowdsourcing and digitization. The Ebooks on Demand Conference 2014 (Innsbruck University, April 11th 2014), Apr 2014,
Distance education engineering
Distance education engineering Olivier Marty To cite this version: Olivier Marty. Distance education engineering. Working paper. 2012. HAL Id: halshs-00776581 https://halshs.archives-ouvertes.fr/halshs-00776581
Hinky: Defending Against Text-based Message Spam on Smartphones
Hinky: Defending Against Text-based Message Spam on Smartphones Abdelkader Lahmadi, Laurent Delosière, Olivier Festor To cite this version: Abdelkader Lahmadi, Laurent Delosière, Olivier Festor. Hinky:
EBU STL Viewer User s Guide
EBU STL Viewer User s Guide EBU STL VIEWER USER S GUIDE 2 LEGAL NOTICE The information in this manual is furnished for informational use only. No part of this manual may be reproduced or transmitted in
Novel Client Booking System in KLCC Twin Tower Bridge
Novel Client Booking System in KLCC Twin Tower Bridge Hossein Ameri Mahabadi, Reza Ameri To cite this version: Hossein Ameri Mahabadi, Reza Ameri. Novel Client Booking System in KLCC Twin Tower Bridge.
Application-Aware Protection in DWDM Optical Networks
Application-Aware Protection in DWDM Optical Networks Hamza Drid, Bernard Cousin, Nasir Ghani To cite this version: Hamza Drid, Bernard Cousin, Nasir Ghani. Application-Aware Protection in DWDM Optical
Heterogeneous PLC-RF networking for LLNs
Heterogeneous PLC-RF networking for LLNs Cedric Chauvenet, Bernard Tourancheau To cite this version: Cedric Chauvenet, Bernard Tourancheau. Heterogeneous PLC-RF networking for LLNs. CFIP 2011 - Colloque
Design and architecture of an online system for Vocational Education and Training
Design and architecture of an online system for Vocational Education and Training Christos Bouras, Eri Giannaka, Thrasyvoulos Tsiatsos To cite this version: Christos Bouras, Eri Giannaka, Thrasyvoulos
SIMS AND SCANNING ION MICROSCOPY
SIMS AND SCANNING ION MICROSCOPY G. Allen, I. Brown To cite this version: G. Allen, I. Brown. SIMS AND SCANNING ION MICROSCOPY. Journal de Physique Colloques, 1989, 50 (C2), pp.c2-121-c2-125. .
How To Design A Distributed Content Management Framework For Digital Museum Exhibitions
Distributed Content Management Framework for Digital Museum Exhibitions Samuel Cruz-Lara, Bai-Hsuen Chen, Jen-Shin Hong To cite this version: Samuel Cruz-Lara, Bai-Hsuen Chen, Jen-Shin Hong. Distributed
One Report, Many Languages: Using SAS Visual Analytics to Localize Your Reports
Technical Paper One Report, Many Languages: Using SAS Visual Analytics to Localize Your Reports Will Ballard and Elizabeth Bales One Report, Many Languages: Using SAS Visual Analytics to Localize Your
Surgical Tools Recognition and Pupil Segmentation for Cataract Surgical Process Modeling
Surgical Tools Recognition and Pupil Segmentation for Cataract Surgical Process Modeling David Bouget, Florent Lalys, Pierre Jannin To cite this version: David Bouget, Florent Lalys, Pierre Jannin. Surgical
Online free translation services
[Translating and the Computer 24: proceedings of the International Conference 21-22 November 2002, London (Aslib, 2002)] Online free translation services Thei Zervaki [email protected] Introduction
Proposal for the configuration of multi-domain network monitoring architecture
Proposal for the configuration of multi-domain network monitoring architecture Aymen Belghith, Bernard Cousin, Samer Lahoud, Siwar Ben Adj Said To cite this version: Aymen Belghith, Bernard Cousin, Samer
A modeling approach for locating logistics platforms for fast parcels delivery in urban areas
A modeling approach for locating logistics platforms for fast parcels delivery in urban areas Olivier Guyon, Nabil Absi, Dominique Feillet, Thierry Garaix To cite this version: Olivier Guyon, Nabil Absi,
Conceptual Workflow for Complex Data Integration using AXML
Conceptual Workflow for Complex Data Integration using AXML Rashed Salem, Omar Boussaid, Jérôme Darmont To cite this version: Rashed Salem, Omar Boussaid, Jérôme Darmont. Conceptual Workflow for Complex
Understanding Big Data Spectral Clustering
Understanding Big Data Spectral Clustering Romain Couillet, Florent Benaych-Georges To cite this version: Romain Couillet, Florent Benaych-Georges Understanding Big Data Spectral Clustering 205 IEEE 6th
TRANSLATIONS FOR A WORKING WORLD. 2. Translate files in their source format. 1. Localize thoroughly
4 Website Localization Tip Sheet 1. Localize thoroughly For your international audience, the content you publish should be in keeping with both your global marketing strategy and your local campaigns.
P2Prec: A Social-Based P2P Recommendation System
P2Prec: A Social-Based P2P Recommendation System Fady Draidi, Esther Pacitti, Didier Parigot, Guillaume Verger To cite this version: Fady Draidi, Esther Pacitti, Didier Parigot, Guillaume Verger. P2Prec:
Undulators and wigglers for the new generation of synchrotron sources
Undulators and wigglers for the new generation of synchrotron sources P. Elleaume To cite this version: P. Elleaume. Undulators and wigglers for the new generation of synchrotron sources. Journal de Physique
Good Practices as a Quality-Oriented Modeling Assistant
Good Practices as a Quality-Oriented Modeling Assistant Vincent Le Gloahec, Régis Fleurquin, Salah Sadou To cite this version: Vincent Le Gloahec, Régis Fleurquin, Salah Sadou. Good Practices as a Quality-Oriented
Flauncher and DVMS Deploying and Scheduling Thousands of Virtual Machines on Hundreds of Nodes Distributed Geographically
Flauncher and Deploying and Scheduling Thousands of Virtual Machines on Hundreds of Nodes Distributed Geographically Daniel Balouek, Adrien Lèbre, Flavien Quesnel To cite this version: Daniel Balouek,
CNC Kompetenzzenter mit der multimedialen Software
CNC Kompetenzzenter mit der multimedialen Software LLP-LdV-TOI-2009-EE-001 1 Information sur le projet Titre: Code Projet: Année: 2009 Type de Projet: Statut: Pays: Accroche marketing: Résumé: CNC Kompetenzzenter
ControVol: A Framework for Controlled Schema Evolution in NoSQL Application Development
ControVol: A Framework for Controlled Schema Evolution in NoSQL Application Development Stefanie Scherzinger, Thomas Cerqueus, Eduardo Cunha de Almeida To cite this version: Stefanie Scherzinger, Thomas
Your single-source partner for corporate product communication. Transit NXT Evolution. from Service Pack 0 to Service Pack 8
Transit NXT Evolution from Service Pack 0 to Service Pack 8 April 2009: Transit NXT Service Pack 0 (Version 4.0.0.671) Additional versions of DTP programs supported: InDesign CS3 and FrameMaker 9 Additional
Multilateral Privacy in Clouds: Requirements for Use in Industry
Multilateral Privacy in Clouds: Requirements for Use in Industry Ina Schiering, Markus Hansen To cite this version: Ina Schiering, Markus Hansen. Multilateral Privacy in Clouds: Requirements for Use in
