Najeh HAJLAOUI Postdoctoral Research Fellow at university of Wolverhampton United Kingdom Mobile: + 33 (0)6 62 20 66 49 Mel : Najeh.Hajlaoui@gmail.com Web site: http://www-clips.imag.fr/geta/user/najeh.hajlaoui/ DOCTOR IN INFORMATICS Experience in teaching computer science Experience in Natural Language Processing and Machine Translation Sense of teamwork Quadrilingual (Arabic, French, English, Italian) EXPERIENCES & EDUCATION 01/2011-today Postdoctoral research fellow at university of Wolverhampton United Kingdom «statistical machine translation Arabic -English». 12/2009-11/2010 Postdoc at France Telecom R&D Lannion France Acquisition of adequate terminology by multilevel extension of applicative corpus. July 2010 Visiting researcher at NII National Institute of Informatics at Tokyo in Japan systematic experimentation of the usefulness of statistical machine translations systems. 2007-2009 Associate Lecturer at Jean Monnet University, ISEAG «Institut Supérieur d'economie, d'administration et de Gestion» at Saint Etienne. Researcher at LHC Laboratoire Hubert Curien at Saint Etienne supervised classification of documents based on inter-documents links. 2003-2009 Researcher at LIG lab (Laboratoire d Informatique de Grenoble), GETALP group (Groupe d Etude en Traduction/Traitement Automatique des Langues et de la Parole) at Grenoble. Postdoc «End to End evaluation of linguistic porting of Management Content systems» Ph.D in computer science Multilingualization of e-commerce systems handling spontaneous utterances in natural language at UJF (Université Joseph Fourier, Grenoble 1) under supervision of Professor Christian BOITET. Ph.D selected and published by European University Editions. 2001-2002 Master in "information systems" at Joseph Fourier university (Grenoble 1), with honors. Joint European Diploma MATIS (Management And Technology of Information Systems) at Geneva university and Lausanne university in Switzerland, UJF (Université Joseph Fourier) and UdS (Université de Savoie) in France, with honors. Master student at CSI (Conception des Circuits Intégrés / Integrated Circuits Design) laboratory at Grenoble. 10 months research project 1/7
under the supervision of Professor Gabrielle SAUCIER: «Exchange of virtual electronic components via Intranet / Internet». 1996-2001 Undergraduate education, BSc in Computer Science, Science University, Monastir Tunisia, with honors. PFE «Projet de Fin d Etudes» internship: 5 months at SITEX company (Société Industrielle de TEXtiles), Sousse, Tunisia, on production management assisted by computer. Summer Internship: 2 months at ATS Advanced Technical of informatics Systems company at Sidi Bouzid - Tunisia. 1989-1996 Higher education, Baccalaureate of mathematics at institute of Lessouda Sidi-Bouzid Tunisia. RESEARCH ACTIVITY Centers of interest Domains: NLP Natural Language Processing, MT Machine Translation, Multilinguisation, Localisation Multilinguisation of content management systems treating spontaneous utterances in natural language. Generation of real corpora (natural corpora) for tests and SMT building. Automatic Processing of sub-languages. Automatic Translation (statistical) of sub-languages. Management of multilingual corpora. Classification of documents. Acquisition of adequate corpus and terminology for an application need. Keywords: spontaneous utterances, natural language, e-commerce, linguistic porting, sub-language, content extraction, statistical translation, acquisition of corpus, analyse of corpus, acquisition of terminology Supervision 9 students: 2009-2010 : Prem-Kumar CASIMIR, Master 2 computer science «Evaluation acquisition of terminology tools in a multimedia application», Université Pierre et Marie Curie Jussieu, Paris VI. 2008-2009 : Xiang YIN, TER «Evaluation task of content management system», UJF «Université Joseph Fourier». 2007-2008: Sana HAJLAOUI, Imen YOUSFI, PFE Information retrieval in a numeric, multilingual and multimedia library. ISIMS «Institut Supérieur d Informatique et de Multimédia de Sfax». 2007-2008: Houssem ZAIR, TER internal and external evaluation of linguistic porting of a content processing system, UJF «Université Joseph Fourier». 2004-2005: Yan WANG, TER evaluation of aids for translators, locally and on the Web, and a first study of UNL, UJF «Université Joseph Fourier». 2/7
2003-2004: Mélanie PINEAU, TER Comparison of translation results (automatic or not), UJF «Université Joseph Fourier». 2002-2003: Song ZHICHAO, TER Translation of a corpus of parallel sentences, UJF «Université Joseph Fourier». 2002-2003: Kamel AMIROU, TER System for helping human translation assisted by machine, UJF «Université Joseph Fourier». Publications 1 article of book, 2 articles in internationals journals (1 accepted and 1 under submission), 16 papers in internationals conferences (COLING x 2, EGC x 3, IBIMA x 2, LREC, TOTh x 2, NLDB, MEDES ) Article of book Hajlaoui, N. and C. Boitet (2003), A "pivot" XML-based architecture for multilingual, multiversion documents: parallel monolingual documents aligned through a central correspondence descriptor and possible use of UNL. Convergences 03, Alexandria, 2-6 December 2003. 8p. (http://www.cicling.org/2005/unl-book/) Internationals journals Hajlaoui, N. (2010). Évaluation d une extension multiniveau de corpus applicatif. Revue RNTI. Numéro spécial sur EvalECD (Évaluation des méthodes en Extraction et la Gestion des Connaissances). 19 p. (waiting notification). Hajlaoui, N. (2010). Multilevel Extension of applicative corpus. Journal of CIBIMA (Communications of the 15th International Business Information Management Association). 11 p. Internationals conferences Hajlaoui, N. (2010). Système de recherche de musique adaptable à la perception de chaque utilisateur. Proc. EGC- 2011_Demos. Démonstration de logiciels dans la 11ème Conférence Internationale Francophone sur l'extraction et la Gestion des Connaissances. Brest, France. 25-28 janvier 2011. 3 p. Hajlaoui, N. (2010). Extraction d information à partir des SMS. Proc. EGC-2011 (11ème Conférence Internationale Francophone sur l'extraction et la Gestion des Connaissances). Atelier SoS'2011 (Sources ouvertes et services). Brest, France. 25-28-janvier 2011. 10 p. Hajlaoui, N. (2010). Acquisition de terminologie adéquate par extension multiniveau de corpus. Proc. EGC-2011. (11ème Conférence Internationale Francophone sur l'extraction et la Gestion des Connaissances) Atelier QDC'2011 (Qualité des Données et des Connaissances). Brest, France. 25-28-janvier 2011. 12 p. Hajlaoui, N. (2010). Extension multiniveau de corpus applicatif. Proc. IBIMA (15th International Business Information 3/7
Management Association) Conference. Caire, Egypt. 6-7 Novembre 2010. 11 p. Hajlaoui, N. and C. Boitet (2010). Multilinguization and Personalization of NL-based Systems. Proc. COLING Workshop CLIA (Cross Lingual Information Access). Chine, août 23-28, 2010. 9 p. Hajlaoui, N. (2009). End to end evaluation of Impression based Music retrieval service. Proc. ACM MEDES'09 (The international ACM Conference on Management of Emergent Digital EcoSystems). Lyon, France. October 27-30, 2009. 5 p. Hajlaoui, N. and C. Boitet (2009). Portage linguistique de l arabe et vers l arabe. Proc. Infol@ngues III 2009 Conférence sur l informatique et les langues. Tabarka, Tunisie. 20-21 février 2009. 10 p. Hajlaoui, N. and C. Boitet (2008). TA statistique à petits corpus pour des petits sous-langages. Proc. TOTh 2008 Conférence sur la Terminologie & Ontologie : Théories et Applications. Annecy, France. 5-6 juin 2008. 15 p. Hajlaoui, N., D. M. Daoud, et al. (2008). Methods for porting NL-based restricted e-commerce systems into other languages. Proc. LREC 2008 (Language Resources and Evaluation Conference). Marrakech, Maroc. june 26-31, 2008. 7 p. Hajlaoui, N. (2007), Multilinguïsation de services de gestion de contenu. Proc. IC (Ingénierie des Connaissances) dans le cadre de la plate-forme AFIA (Association Française pour l Intelligence Artificielle), France, Grenoble 2-6 juillet 2007. Hajlaoui, N. (2007), Localizing content management application for spontaneous textual utterances in natural language, Proc. NLDB (12th international conference on Applications of Natural Language to Information Systems) France, Paris 27-29 juin 2007. Hajlaoui, N. and C. Boitet (2007). Portage linguistique d'applications de gestion de contenu. Proc. TOTh 2007 Conférence sur la Terminologie & Ontologie : Théories et Applications. Annecy, France. 1 juin 2007. 13 p. Hajlaoui, N. (2006). Recherche et production de corpus de messages pour la multilinguisation de sites de e-commerce en SMS, initialement en arabe. Proc. IBIMA (6th International Business Management Information Association) Conference. Bonn, Allemagne. 5-6 juin 2006. 11 p. Hajlaoui, N. (2006), PolyphraZ, un outil pour la gestion de corpus multilingues parallèles à travers le web, CALS (Colloque international d Albi Langages et Signification), France, Albi, 10-14 juillet 2006. Hajlaoui, N. and C. Boitet (2004). PolyphraZ: a tool for the quantitaive and subjective evaluation of parallel corpora. Proc. IWSLT 2004: International Workshop on Spoken Language Translation. Kyoto, Japan. September 30 October 01, 2004. 8p. Hajlaoui, N. and C. Boitet (2004), PolyphraZ : a tool for the management of parallel corpora. Proc. COLING Workshop MLR2004 (on Multilingual Linguistic Resources). Geneva. August 28th 2004. 8p. 4/7
Member of scientific committee Evaluation of 1 paper for the 15 th IBIMA conference. Evaluation of 2 papers (co-lecturer with Edmond Lassalle) for CORIA 2010 conference. Evaluation of 1 paper for the COSI 2010 conference (Colloque sur l Optimisation et les Systèmes d Information), Evaluation of 2 papers for LREC 2010 conference, Semitics languages workshop. Evaluation of 4 papers for TALN 2010 conference. TEACHING: 636 H TD 2009-2010 Lecturer at the private school Sup2i at Grenoble: 16 H of courses. Module real-time and embedded systems : 16 H courses. 2008-2009 Associate Lecturer at Jean Monnet University, Saint Etienne : 96 h TD (tutorials) Course and TD on tools and software, Licence 1 Economie et gestion : 96 h TD 2007-2008 Associate Lecturer at Jean Monnet University, Saint Etienne: 96 h TD Course and TD of information system and database, Master 1 Economie et Management : 48 h TD. Course and TD web technologies, tools and languages : license AES «Administration Economique et Sociale : 48 h TD. 2003-2007 Attached teacher at UPMF (Université Pierre Mendès France, Grenoble 2): 428 h TD Initiation to informatics, office software, basic notions of network and database, DEUG psychology : 146 h TD = (56 h TD in 02/03 + 90 h TD in 03/04) Design and definition of a course programme in informatics and documentation : DEUG sociology (96 h C/TD = (32 h TD in 02/03 + 64 h TD in 03/04)), L1 sociology (120 h C/TD in 04/05) Animation of work on management of dynamic Web applications and databases (XHTML/CSS/PHP5/Postgres), Licence Professionnelle SIL «Systèmes Informatiques et Logiciels» at IUT2 (18 h TD) E-learning: C2I «Certificat Informatique et Internet» at UPMF (48 h TD) 5/7
COLLECTIVE/COMMUNITY ACTIVITIES 2004-2007 Elective function : member representing by election all PhD students on the CLIPS laboratory called now LIG. 2003-2004 Responsible of project proposals: in the framework of an internal project at GETA, TraCorpeX (Traduction de Corpus d exemples), preparation and submission of two project proposals in the framework of the French-Tunisian cooperation to the CMCU (Comité Mixte de Coopération Universitaire). 2003-Today Elected member of the CA (Conseil d Administration / board) of the Champollion association, a non-profit association supporting research on MT and NLP. SCIENTIFIC COMPETENCES INFORMATICS COMPETENCES Systems: Windows, Linux, Unix, Mac Os X. Environments: Visual BASIC 6, Visual C++ 6, AMC*Designer 6, Developer 2000. SGBD: ORACLE 7-8, MYSQL, ACCESS. POSTGRES Languages: C/C++, SQL, XML, HTML,XHTML, Xpath, CSS, PHP, Java, Java script, Visual Basic, Prolog, Perl, TclTk Architecture: Enhydra/Postgres, PHP/MySQL/Postgres Methods: UML, MERISE Specialised Language for Linguistic Processing (SLP) : EnCo (EnConversion environment), et DeCo (Deconversion environment) : two SLP diffused by UNL «Universal Networking Language» centre of Tokyo). LANGUAGES Arabic: mother tongue French: bilingual English: read (good), speak (average), write (average) Italian: read (little), speak (little), write (little) COMPLEMENTARY INFORMATIONS CIVIL STATUS Date of birth: 01/21/1978 Nationality: Tunisian, French Driving license: category B Civil status: single INTEREST CENTRES Sport: football, swimming Travel New technologies 6/7
PROJECTS SHORT-TERM GOALS MEDIUM-TERM GOALS LONG-TERM GOALS Finish my actual research fellow. Refine the linguistic porting technologies on which I worked during my PhD. Do further research on the types of sublanguages for which it is possible to develop efficient (useful) statistical MT systems in the context of applications extracting content from spontaneous utterances. Join some establishment offering an adequate research and/or development environment. 7/7