Pattern Recognition and Human Language Technology Group (Grupo de Reconocimiento de Formas y Tecnologías de la Percepción) Institut Tecnològic d Informàtica Departament de Sistemes Informàtics i Computació Universitat Politècnica de València http://prhlt.iti.es/ December 2005
1981 Universitat de València: RFIA group BRIEF HISTORY 1986 Universitat Politècnica de València: RFIA group, DSIC department European projects: ROARS, SPIN, EuTrans-I Spanish projects: ALBAYZIN, TRACOM 1997 PRHLT group derives from RFIA group and integrates in ITI European projects: EuTrans-II, TT2 Spanish projects: Pattern Recognition and Computer Vision: TAR, ATRAM, TYRIG Machine Translation: TAVAL, SISHITRA, TEFATE, Dialogue Systems: BASURDE, DIHANA
MAIN RESEARCH AREAS Pattern Recognition Handwritten Character Recognition Biometrics Computer Vision Language Translation Speech Processing Machine Learning Dialogue Systems
PEOPLE Academic degree 12 Ph. D. 19 Ph. D. Students Academic position 14 Professors and assistants (DSIC-UPV, DC-UPV, DI-UCLM) 8 Research contracts 8 Fellowships
RECENT EUROPEAN PROJECTS Projects EUTRANS: Example based language TRANslation Systems. ESPRIT Open Long Term Research. ACCION 30268. 2 Phases.1996-2000. ITI, Zeres, Fundaziones Ugo Bordoni, Aachen University. TT2: TransType2-Computer-Assisted Translation. IST Programme. IST-2001-32091. 2002-2005. ATOS, ITI, Aachen University, Celer, RALI, Xerox Co., Gamma. Acciones Integradas España-Alemania. 2000-2002: ITI, Aachen University España-Portugal. 2002-2004: ITI, INESC ID/IST Lisbon.
RECENT SPANISH PROJECTS (CICYT) EXTRA: Extensiones del sistema de traducción de texto y habla en dominios restringidos aprendible con ejemplos. 1997-1999. UJI, DSIC-UPV. BASURDE: Desarrollo de un sistema de diálogo para habla espontánea en un dominio semáticamente restringido. 1998-2001. UPC, DSIC-UPV, LSI-UPC, EHU-UPV, UZ, UJI. TAVAL: Traductor automático bidireccional entre castellano y valenciano. 2000-2001. ITI. SISHITRA: Sistemas híbridos para la traducción valenciano-castellano a partir de voz y texto. 2001-2003. ITI, LSI-UA. TAR: Técnicas avanzadas en reconocimiento de formas y sus aplicaciones en procesos industriales y comerciales. 2001-2003. ITI, LSI-UA, LSI-UJI. DIHANA: Sistema de Diálogo para el Acceso a la Información mediante habla espontánea en diferentes entornos. CICYT. 2003-2005. DSIC-UPV, EHU-UPV, UZ. ATRAM: Aplicación de Técnicas de Reconocimiento de Formas para el Análisis Morfológico del Pie y Fabricación del Calzado. CICYT. 2001-2004. IBV, DSIC-UPV. ITEFTE: Inferencia de traductores de estados finitos para la traducción automática y la ayuda a la traducción en tareas específicas. 2003-2005. LSI-UA, DSIC-UPV.
RECENT PROJECTS WITH COMPANIES AMETRA, ADUR Software Productions S. Co. Pick-by-Voice, RUMBO Sistemas S.L. A classification system by voice, Teismaderas S.A. Biometrics, Advanced Software Technologies S.A. Opinion poll by phone, ODEC. Optical processing of documents, ODEC.
PUBLICATIONS Proceedings of conferences: International Conference on Pattern Recognition, International Conference on Acoustic, Speech and Signal Processing, Conference on Computational Linguistics,... Journals: IEEE Transactions Pattern Analysis and Machine Intelligence, Pattern Recognition, Machine Learning Journal, Computational Linguistics, IEEE Transactions on Acoustic, Speech and Signal Processing, Computer Speech and Language, IEEE Transactions on Speech and Audio Processing, IEEE Transactions on Systems, Man and Cybernetics,...
LINKS European groups Lehrstuhl für Informatik VI. RWTH Aachen - University of Technology. Germany. (H. Ney) Equipe Universitaire de Recherche en Informatique de Saint Etienne (EURISE). Université de Saint Etienne-Jean Monnet. France. (C. de la Higuera) Institute for Systems and Computer Engineering. Spoken Language Systems Lab (L 2 F). Lisboa. Portugal. (I. Trancoso) Spanish groups Grupo de Reconocimiento Automático del Habla. Universidad del País Vasco Grupo de Aprendizaje Computacional, Reconocimiento Automático y Traducción del Habla. Universitat Jaume I. Grupo de Reconocimiento de Formas e Inteligencia Artificial. Universidad de Alicante Grup de Teoria del Senyal. Universitat Politécnica de Catalunya Centro Politécnico Superior. Universidad de Zaragoza
METHODOLOGIES MODELS Hidden markov models Stochastic finite-state transducers Statistical alignment models and phrase-based models (Local) Feature vectors and (weighted) distances TRAINING Statistical estimation (E-M algorithms) Grammatical inference techniques Clustering SEARCH Viterbi algorithm (+ N-best + Word graphs) Stack-decoding algorithm (+ N-best) K-nearest neighbor classifiers
BASIC PROTOTYPES ATROS: Speech recognition, speech translation and handwritten character recognition. PBSMT: Machine translation. TT2: Computer-assisted translation. LFC: Face recognition, speaker recognition, computer vision....
MACHINE TRANSLATION SYSTEMS SISHITRA: A knowledge-based Spanish-to-Catalan translator http://prhltdemos.iti.es/ taval/ (Access) Statistical translators http://dcomgp05.gnd.upv.es/webtrans.debug/trad TEFATE: Spanish-to-Catalan (Unrestricted task) AMETRA-METEO: Spanish-to-Basque (Meteorological News) AMETRA-DFB: Spanish-to-Basque (Administrative task) TT2-EU: English-to-Spanish (European Union Bulletin)
COMPUTER-ASSISTED TRANSLATION Translation of printer manuals English-Spanish English-German English-French Weather reports Spanish-Basque Administrative proceedings Spanish-Basque Spanish-Catalan
SPEECH-TO-SPEECH TRANSLATION(Access) Spanish-to-English tourist task http://prhltdemos.iti.upv.es/demo/spanish_demo.html Italian-to-English tourist task http://prhltdemos.iti.upv.es/demo/italian_demo.html Catalan-to-English tourist task http://prhltdemos.iti.upv.es/demo/valcat_demo.html Spanish-to-Basque tourist task http://prhltdemos.iti.upv.es/demo/ametra_demo.html Portuguese-to-English tourist task
SPEECH RECOGNITION A speech understanding prototype http://prhlt.iti.es/demos/demo_speechunderstand/index.htm Automatic voice-driven telephone exchange http://prhlt.iti.es/demos/demo_exchange/index.htm Speech dialogue with an information system http://physionet.cps.unizar.es/ eduardo/investigacion/voz/ tic98-0423.html
HANDWRITTEN CHARACTER RECOGNITION Off-line handwritten character recognition An example http://prhlt.iti.es/demos/demo_htr/index.htm On-line handwritten character recognition
BIOMETRICS Automatic face recognition Preprocessing Training & search Speaker verification
OTHER CLASSIFICATION TASKS Plain text classification Handwritten text classification Chromosome classification Prostate ultrasonography pattern analysis Breast cancer detection in digitized mammograms 3D foot shape characterisation and footwear fitting prediction
Thank you for your attention