Motif Knowledge Base Based on Deductive Object-Oriented Database Language

Size: px
Start display at page:

Download "Motif Knowledge Base Based on Deductive Object-Oriented Database Language"

Transcription

1 - 10 Motif Knowledge Base Based on Deductive Object-Oriented Database Language MAKOTO HIROSAWAt REIKO TANAKAt MASATO ISHIKAWA tex-icot, presently with Hitachi System Development Laboratory 1099 Ohzenji, Asao, Kawasaki, 215, JAPAN 1 tinformation and Mathematical Science Laboratory, Inc. (IMS) presently at ICOT Institute for New Generation Computer Technology (ICOT) Mita, Minato, 108 JAPAN Abstract The representation of biological concepts in a knowldge base are important to a machine or a non-specialist of biology to understand and analyze genetic information. In our previous study, we studied the representation of biological knowledge and the representation of biological knowledge related to motif of protein with the goal of discovering new motifs. In this paper, firstly, the requirements for the representation of biological knowledge are listed. Then, solutions to these requirements are stated. Finally, representation of bioloigal knowledge on motif in the Deductive Object-Oriented Language, QUIXOTE, is shown. The knowledge base includes Prosite, a representative motif database, as the basis of the knowledge base.

2 11 1 Introduction The analysis of the genes of living organisms is essential technology to the deciphering of biological phenomena. Today, with increasing advances in bio-technology, the genes that have been identified but which have yet to be analyzed are rapidly increasing. To enable the automatic analysis of massive genes to extract biological information from them, the introduction of knowledge-based analysis is necessary, in addition to the development of highspeed computers and fast analysis algorithms. This is partly because the quality of analysis without biological knowledge is not high enough and partly because the time required for the analysis can be reduced remarkably by the introduction of biological knowledge. Nowadays, data about genetic information is stored in databases. But, these databases have been constructed with little consideration to their application to knowledg engineering. So, it is difficult to use these databases to create a system that enables high-level knowledge processing. To do this, the effective representation of biological knowledge is necessary. The representation of biological knowledge has been studied by several researchers [HTanaka 1993][Kazic et al. 1993]. They succeeded in representing simple, small amounts of knowledge. But, the representation can not necessarily be extended to the representation of practical problems. We selected motif discovery as a practical problem to study the representation of biological knowledge. There were two reasons for our selection. One was that the level of complexity of biological knowledge about motif is adequate for the study of the representation of biological knowledge. Its complexity is low enough for non-specialists of biology to study the representation and thus to prevent us from becoming overwhelmed by unfamiliar sciencẹ Still, its complexity is high enough to cover the essential concepts of biology. We thought that the understanding gained through this study would also help us in the representation of other areas of biological knowledge. The second reason for our selection was our familiarity with the motif discovery problem. We have already developed a motif discovery system [Hirosawa et al. 1993]. But, the representation in the system was not particularly elaborate. The logic programming language, KL1, in which the system itself is written, was used to express biological knowledgẹ But, because KL1 is essentially a programming language rather than knowledge representation language, its effectiveness of representation is limited. This time we opted to use DOOD (Deductive Object-Oriented Database) language QUIXOTE [Yasukawa et al.1992] to represent biological knowledge. We thought that the Object-Oriented feature of QUIXOTE would be useful for the representation and that its deductive feature is suitable for interfacing with an inference machine. 2 Biological Knowledge Base In the first subsection, the requirements for biological knowledge are described and the solutions satisfying them, using the Deductive Object-Oriented Knowledge base, are stated briefly. Then, after overviewing of our Biological Knowledge Base, the knowledge base using QUIXOTE is described. 2.1 Requirements for a Biological Knowledge Base If we want a knowledge base to be useful for both an inference machine and non-specialists of biology, there are several requirements that the biological knowledge base must satisfy. The

3 - 12 requirements and their solutions are listed below. Requirement 1 Knowledges of different reliabilities must co-exist in the knowledge base. But knowledges must be processed differently according to their reliability. Solution 1 This can be realized by exploiting the module concept of QUIXOTE. We divide knowledge into modules according to the reliability of the knowledge. When we want to discover a small but reliable motif, we should consult only those modules whose reliabilities are high. When we want to discover many, by sacrificing of reliability, we should also consults those modules whose reliabilities are low. Requirement 2 The system must manage public knowledge bases and these knowledge bases belonging to researchers differently. Also, the system must manage knowledge bases belonging to individuals differently. Solution 2 This, too, can be realized by exploiting the module concept of QUIXOTE. Suppose there are three modules, Public*, Userl and User2. When userl wants to consult biological knowledge, he or she should consult Public* and Userl. When User2 wants consult biological knowledge, he or she should consult Public* and User2. Requirement 3 In biology, Person 1 may regard a motif C as being a motif of enzyme A, while Person 2 regards that motif as being a motif having function B. This often happens in biology. This wuold appear to be a contradiction, but actually is not. This arises because enzyme A has function B, and a portion of the protein around motif C has function B. This happens because of the different interests of different researchers. In this case, Person 1 is interested in the class of the discovered protein and Person 2 is interested in its function. Solution 3 This kind of complexity cannot be represented by conventional hierarchical representation. But, it can easily be represented in multiple inheritance in Object Oriented feature of QUIXOTE, if the relation between biological concepts is untangled and properly described. To make the description, not only knowledge of, DOOD but also knowledge of biology is required. We have the both. Requirement 4 Biologists use their biological knowledge to a understand database. But, a machine or a non-biologist has difficulty in understanding a database, or they cannot understand it perfectly because of their for the lack of knowledge. For example, they would find essentially impossible to understand the information described in the comment in the database which is written in a natural language. Because the information in comment is sometimes extremely important, this kind of information must be understandable by both by the machine and non-specialists. Solution 4 This can not be done automatically, It can be done only by reading and understanding the database and representing the knowledge. But this dirty and time-consuming job is facilitated if concept of Object-oriented is in our mind. 2.2 Overview of Biological Knowledge Base We constructed our Biological Knowledge Base using QUIXOTE. The Biological Knowledge Base contains knowledge on motif knowledge and its related knowledge.

4 2.2.1 Representation using the module feature corresponding to Requirement 1 and 2 and their solutions There are three modules in the knowledge base, Prosite*, User and Discovery. The system consults Prosite* and User when it atempts to discover motifs, and it registers any discovered motifs in Discovery. Separate handling the motifs are realized by module feature of QUIXOTE. In the system, Prosite* is more reliable than User. In Prosite*, motifs in Prosite [Bairoch 1991], a representative database of motif, and related knowledge are represented in a hierarchical and multiple inheritance scheme. With the use of the multiple inheritance scheme, biological concepts are more properly represented (In the original Prosite motifs are represented just in a hierarchical scheme). User is a module of Biological Knowledge Base in which motifs collected by user are stored. In some cases, motifs in User may be less reliable than those in Prosite*. To represent biological concepts in User and Discovery, the same scheme as that used in Prosite* is used. When users want to deduce small but reliable information from the knowledge base, they should select only the Prosite* module. When, however, they want to deduce many information at the expense of reliability, they should select User module as well. Figure Representation of complex biological concept corresponding to Requirement 3 and and their solutions To illustrate the representation of complex concepts in the biological Knowledge Base more in detail, a portion of the knowledge in Prosite* is visualized in Figure 1. In the figure, concepts and motifs belonging to kinasegroup and domain-general are shown. KinaseGroup is a concept related to a class of protein and domain-general is related to the functions of proteins. Because the two concepts are note independent, there is cross-linking between them. KinaseGroup, which transfers phosphate, is classified into four groups, one of which contains protein-kinase. Furthermore, protein-kinase is classified into tyrosine kinase(ty-kinase) and serine/threonine kinase(s-th-kinase). The three classes (protein kinase, tyosine kinase and serine/threonine kinase) didn't exist in the original Prosite. We introduce these three classes because, although they are not necessary for biologists to understand knowledge base but are

5 14- necessary for non-specialists and machine to understand it. But, introduction of the concepts is can be appeared canceled. If users don't want the introduced these concepts of class (ex. tyrocine kinase), the system handles knowledge as if these classes did not exist. This can be realized by using method of QUIXOTE. In some class of protein, the corresponding motif is registereḍ Protein kinase has a motif named protein-kinase-general. Its pattern is "[LIV]-G-x-G-x-[FYM]-[SG]-x-V". Here, expression [SG] signifies S or G; 'x' signifies any amino acid. Tyrosine kinase has a motif named protein-kinase-signatures-tyrosine, while serine/threonine kinase has a motif named proteinkinases-serine/threonine. As a motif of tyrosine kinase or serine/threonine kinase, we can refer to the motif belonging to protein kinase in addition to its own motif. This can be done by using method of QUIXOTE. Also as a motif of protein kinase we can refer to the motif named "ATP/GTP-binding site motif A (P-loop)" belonging to binding-domain by using of method of QUIXOTE. In the original Prosite, the information about cross-linking was described in natural language at as a comment. We read the implicit information and expressed it as explicit information to enable readability by ourselves and the machine. 2.3 Description by DOOD The description of knowledge by QUIXOTE (which is illustrated in Figure 1) is shown in Figure 2 (class structure) and in Figure 3 (entries of motif). In Figure 2, A >= B means that B is a sub-concept of A. Figure 2 There are four motif entries in Figure 3. Among them, the upper three belong to Prosite* and the lowest belongs to User. The specification of module is done by the specification before the double semi-colon. We can specify Prosite* or User or Discovery as modulẹ In the top entry (protein kinase) there is the attribute, othermotif. The attribute enables the system to also refer to the motif belonging to another class (binding-domain). The procedure for referencing is also written using method of QUIXOTE. This multiple inheritance is one of the features of DOOD. Protein kinase doesn't appear to have attribute value of acceptor in Figure 3. But it actually does have attribute value, becuase alcohol-group is inherited as the value from its upper class ec There are two sub-groups of protein kinase, tyrosine kinase and serine/threonine kinase, which have tyrosine and serine or threonine (expressed by acceptor = {serine, threonine} ) as their attribute values. In this case, the inherited attribute + value is overwritten. From a biological point of view, the attribute value becomes more precise because tyrosine, serine and threonine are members of alcohol group. This is another feature of DOOD. Using this information, the acceptor of protein can be detremined more precisely if we find a motif that belongs to higher hierarchy. The identification as tyrosine kinase or serine/threonine

6 -15 Figure 3 kinase gives us more information than identification as protein kinase, the upper concept of the two. 3 Discussion Knowledge Base The Biological Knowledge Base in the system satisfies the four conditions stated in Section 2.1. Requirement 1, the management of multiple levels of reliability, is satisfied by the module feature of QUIXOTE. Requirement 2, the management of multiple knowledge bases having differents owner, doesn't correspond to the system. But, it too can be satisfied by the module feature of QUIXOTE. Requirement 3, the correct representation of biological concepts, is satisfied by multiple inheritance, the overwriting of inheritance and the use of method, which are not special features of QUIXOTE but which are standard features of Deductive Object-Oriented Database. Requirement 4 is satisfied by our endeavor to biology. Motif Discovery Motif discovery using the previous representation [Hirosawa et al.1993a] and that using the current representation [Hirosawa et al.1993b] are compared. The motif discovered using the current biological knowledge base system is the same as these discovered motif using the previous biological knowledge base. The discovered motif is that of protein kinase. The difference is the time required for the discovery. The previous system couldn't notice the motif as being a motif of protein kinase, ATP/GTP-binding site motif A (P-loop), belonging to the binding domain, whereas the present system notices the motif with the help of multiple inheritance. The information of the motif makes the time taken of the present system lower than that of the previous system. This proves that proper representation of knowledge can contribute to the speed-up of the genetic analysis

7 16- Others The use of the module feature of QUIXOTE, used to satisfy Requirement 1 and 2, is not indispensable to our system. And, it can be substituted by another means of description without the feature. But, we think that the module feature will prove useful when the size of the biological knowledge base is increased. 4 Conclusion Biological knowledge base on motif was constructed by the use of DOOD (Deductive Object- Oriented Database) language QUIXOTE. In the biological knowledge base, entangle concepts of biology were described with the multiple inference and non-monotonic features of DOOD (precisely 00 feature). Because knowledge of motif comprises an essential aspect of biological knowledge, we think the framework of representation in this research is also applicable to the representation of biological knowledge of other domains Also, the D (Deductive) feature of DOOD is also suitable for interfacing with the inference module of the system. Consequently, DOOD is suitable for representing a biological knowledge base that is a part of genetic analysis system. 5 Acknowledgment The authors thanks Prof. Fuchi, ex-director of ICOT, for introducing us to knowledg engineering and providing us with the research environment, and also Dr. Uchida, director of ICOT, introducing us to genetic information processing. We also thanks to the QUIXOTE groups for at ICOT, especially H. Tanaka, for their assistance. Finally, the first author gives special thanks to N. Mizuhara and M. Kunimoto at the System development Lab. of Hitachi Ltd. for providing him with the computer environment needed to write the paper. References [Bairoch 1991] Bairoch, A. Prosite : A dictionary of protein site and pattern : User manual Release 7.00, May [Hirosawa et al. 1993a] Hirosawa, M. Hoshiwa, M. Ishikawa, M. Protein Multiple Sequence Alignment using Knowledge. Proceedings of the Twenty-Sixth Annual Hawaii International Conference on System Sciences,Vol.1 pp ,1993. [Hirosawa et al. 1993b] Hirosawa, M., Tanaka, R., Ishikawa, M. Application of Deductive Object-Oriented Knowledge Base to Genetic information Processing. Proceedings of the Interantional Symposium Next Generation Database Systems Their Applications,1993. [HTakaka 1993] Tanaka, H. A Private Knowledge Base for Molecular Biological Research. Proceedings of the Twenty-Sixth Annual Hawaii International Conference on System Sciences,Vol.1 pp ,1993. [Kazic 1993] Kazic, T. Proceedings of the Twenty-Sixth Annual Hawaii International Conference on System Sciences,Vol.1 pp ,1993. [Yasukawa et al. 1992] H.Yasukawa et al. Objects, Properties, and Modules in Quixote. Proc. of FGCS92, pp y89-112, 1992

Bio-Informatics Lectures. A Short Introduction

Bio-Informatics Lectures. A Short Introduction Bio-Informatics Lectures A Short Introduction The History of Bioinformatics Sanger Sequencing PCR in presence of fluorescent, chain-terminating dideoxynucleotides Massively Parallel Sequencing Massively

More information

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the

More information

Effect of Using Neural Networks in GA-Based School Timetabling

Effect of Using Neural Networks in GA-Based School Timetabling Effect of Using Neural Networks in GA-Based School Timetabling JANIS ZUTERS Department of Computer Science University of Latvia Raina bulv. 19, Riga, LV-1050 LATVIA janis.zuters@lu.lv Abstract: - The school

More information

Replication Study Guide

Replication Study Guide Replication Study Guide This study guide is a written version of the material you have seen presented in the replication unit. Self-reproduction is a function of life that human-engineered systems have

More information

ARTIFICIAL INTELLIGENCE METHODS IN EARLY MANUFACTURING TIME ESTIMATION

ARTIFICIAL INTELLIGENCE METHODS IN EARLY MANUFACTURING TIME ESTIMATION 1 ARTIFICIAL INTELLIGENCE METHODS IN EARLY MANUFACTURING TIME ESTIMATION B. Mikó PhD, Z-Form Tool Manufacturing and Application Ltd H-1082. Budapest, Asztalos S. u 4. Tel: (1) 477 1016, e-mail: miko@manuf.bme.hu

More information

Course Outline. 1. COURSE INFORMATION Session Offered Winter 2012 Course Name Biochemistry

Course Outline. 1. COURSE INFORMATION Session Offered Winter 2012 Course Name Biochemistry Course Outline 1. COURSE INFORMATION Session Offered Winter 2012 Course Name Biochemistry Course Code BIOTECH 2BC3 Program Name Biotechnology Calendar Description Biochemistry and biotechnology; amino

More information

Genetomic Promototypes

Genetomic Promototypes Genetomic Promototypes Mirkó Palla and Dana Pe er Department of Mechanical Engineering Clarkson University Potsdam, New York and Department of Genetics Harvard Medical School 77 Avenue Louis Pasteur Boston,

More information

Unraveling protein networks with Power Graph Analysis

Unraveling protein networks with Power Graph Analysis Unraveling protein networks with Power Graph Analysis PLoS Computational Biology, 2008 Loic Royer Matthias Reimann Bill Andreopoulos Michael Schroeder Schroeder Group Bioinformatics 1 Complex Networks

More information

Web-Based Genomic Information Integration with Gene Ontology

Web-Based Genomic Information Integration with Gene Ontology Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, kai.xu@nicta.com.au Abstract. Despite the dramatic growth of online genomic

More information

Tudumi: Information Visualization System for Monitoring and Auditing Computer Logs

Tudumi: Information Visualization System for Monitoring and Auditing Computer Logs Tudumi: Information Visualization System for Monitoring and Auditing Computer Logs Tetsuji Takada Satellite Venture Business Lab. University of Electro-Communications zetaka@computer.org Hideki Koike Graduate

More information

Johannes Sametinger. C. Doppler Laboratory for Software Engineering Johannes Kepler University of Linz A-4040 Linz, Austria

Johannes Sametinger. C. Doppler Laboratory for Software Engineering Johannes Kepler University of Linz A-4040 Linz, Austria OBJECT-ORIENTED DOCUMENTATION C. Doppler Laboratory for Software Engineering Johannes Kepler University of Linz A-4040 Linz, Austria Abstract Object-oriented programming improves the reusability of software

More information

REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf])

REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf]) 820 REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf]) (See also General Regulations) BMS1 Admission to the Degree To be eligible for admission to the degree of Bachelor

More information

Co-developing Model for User Participation in Web Application Development

Co-developing Model for User Participation in Web Application Development Co-developing Model for User Participation in Web Application Development Tae Yoneda, Kouhei Mitsui, Jun Sasaki, Yutaka Funyu Iwate Prefectural University, Japan 2005.9.30 Contents 2 Background and Object

More information

Subject Area(s) Biology. Associated Unit Engineering Nature: DNA Visualization and Manipulation. Associated Lesson Imaging the DNA Structure

Subject Area(s) Biology. Associated Unit Engineering Nature: DNA Visualization and Manipulation. Associated Lesson Imaging the DNA Structure Subject Area(s) Biology Associated Unit Engineering Nature: DNA Visualization and Manipulation Associated Lesson Imaging the DNA Structure Activity Title Inside the DNA Header Image 1 ADA Description:

More information

Introduction to Genome Annotation

Introduction to Genome Annotation Introduction to Genome Annotation AGCGTGGTAGCGCGAGTTTGCGAGCTAGCTAGGCTCCGGATGCGA CCAGCTTTGATAGATGAATATAGTGTGCGCGACTAGCTGTGTGTT GAATATATAGTGTGTCTCTCGATATGTAGTCTGGATCTAGTGTTG GTGTAGATGGAGATCGCGTAGCGTGGTAGCGCGAGTTTGCGAGCT

More information

Semantic Search in Portals using Ontologies

Semantic Search in Portals using Ontologies Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br

More information

LAB 3: Introduction to Domain Modeling and Class Diagram

LAB 3: Introduction to Domain Modeling and Class Diagram LAB 3: Introduction to Domain Modeling and Class Diagram OBJECTIVES Use the UML notation to represent classes and their properties. Perform domain analysis to develop domain class models. Model the structural

More information

An Overview of Knowledge Discovery Database and Data mining Techniques

An Overview of Knowledge Discovery Database and Data mining Techniques An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,

More information

How To Develop Software

How To Develop Software Software Engineering Prof. N.L. Sarda Computer Science & Engineering Indian Institute of Technology, Bombay Lecture-4 Overview of Phases (Part - II) We studied the problem definition phase, with which

More information

Translation Study Guide

Translation Study Guide Translation Study Guide This study guide is a written version of the material you have seen presented in the replication unit. In translation, the cell uses the genetic information contained in mrna to

More information

Gene Expression Macro Version 1.1

Gene Expression Macro Version 1.1 Gene Expression Macro Version 1.1 Instructions Rev B 1 Bio-Rad Gene Expression Macro Users Guide 2004 Bio-Rad Laboratories Table of Contents: Introduction..................................... 3 Opening

More information

Guide for Bioinformatics Project Module 3

Guide for Bioinformatics Project Module 3 Structure- Based Evidence and Multiple Sequence Alignment In this module we will revisit some topics we started to look at while performing our BLAST search and looking at the CDD database in the first

More information

Chapter 23. Database Security. Security Issues. Database Security

Chapter 23. Database Security. Security Issues. Database Security Chapter 23 Database Security Security Issues Legal and ethical issues Policy issues System-related issues The need to identify multiple security levels 2 Database Security A DBMS typically includes a database

More information

Checking Access to Protected Members in the Java Virtual Machine

Checking Access to Protected Members in the Java Virtual Machine Checking Access to Protected Members in the Java Virtual Machine Alessandro Coglio Kestrel Institute 3260 Hillview Avenue, Palo Alto, CA 94304, USA Ph. +1-650-493-6871 Fax +1-650-424-1807 http://www.kestrel.edu/

More information

Data mining and official statistics

Data mining and official statistics Quinta Conferenza Nazionale di Statistica Data mining and official statistics Gilbert Saporta président de la Société française de statistique 5@ S Roma 15, 16, 17 novembre 2000 Palazzo dei Congressi Piazzale

More information

The sequence of bases on the mrna is a code that determines the sequence of amino acids in the polypeptide being synthesized:

The sequence of bases on the mrna is a code that determines the sequence of amino acids in the polypeptide being synthesized: Module 3F Protein Synthesis So far in this unit, we have examined: How genes are transmitted from one generation to the next Where genes are located What genes are made of How genes are replicated How

More information

Protein Analysis and WEKA Data Mining

Protein Analysis and WEKA Data Mining GenMiner: a Data Mining Tool for Protein Analysis Gerasimos Hatzidamianos 1, Sotiris Diplaris 1,2, Ioannis Athanasiadis 1,2, Pericles A. Mitkas 1,2 1 Department of Electrical and Computer Engineering Aristotle

More information

Information Visualization WS 2013/14 11 Visual Analytics

Information Visualization WS 2013/14 11 Visual Analytics 1 11.1 Definitions and Motivation Lot of research and papers in this emerging field: Visual Analytics: Scope and Challenges of Keim et al. Illuminating the path of Thomas and Cook 2 11.1 Definitions and

More information

Nagarjuna College Of

Nagarjuna College Of Nagarjuna College Of Information Technology (Bachelor in Information Management) TRIBHUVAN UNIVERSITY Project Report on World s successful data mining and data warehousing projects Submitted By: Submitted

More information

Using Generalized Forecasts for Online Currency Conversion

Using Generalized Forecasts for Online Currency Conversion Using Generalized Forecasts for Online Currency Conversion Kazuo Iwama and Kouki Yonezawa School of Informatics Kyoto University Kyoto 606-8501, Japan {iwama,yonezawa}@kuis.kyoto-u.ac.jp Abstract. El-Yaniv

More information

Name of pattern types 1 Process control patterns 2 Logic architectural patterns 3 Organizational patterns 4 Analytic patterns 5 Design patterns 6

Name of pattern types 1 Process control patterns 2 Logic architectural patterns 3 Organizational patterns 4 Analytic patterns 5 Design patterns 6 The Researches on Unified Pattern of Information System Deng Zhonghua,Guo Liang,Xia Yanping School of Information Management, Wuhan University Wuhan, Hubei, China 430072 Abstract: This paper discusses

More information

What mathematical optimization can, and cannot, do for biologists. Steven Kelk Department of Knowledge Engineering (DKE) Maastricht University, NL

What mathematical optimization can, and cannot, do for biologists. Steven Kelk Department of Knowledge Engineering (DKE) Maastricht University, NL What mathematical optimization can, and cannot, do for biologists Steven Kelk Department of Knowledge Engineering (DKE) Maastricht University, NL Introduction There is no shortage of literature about the

More information

Specification and Analysis of Contracts Lecture 1 Introduction

Specification and Analysis of Contracts Lecture 1 Introduction Specification and Analysis of Contracts Lecture 1 Introduction Gerardo Schneider gerardo@ifi.uio.no http://folk.uio.no/gerardo/ Department of Informatics, University of Oslo SEFM School, Oct. 27 - Nov.

More information

Course Curriculum for Master Degree in Medical Laboratory Sciences/Clinical Biochemistry

Course Curriculum for Master Degree in Medical Laboratory Sciences/Clinical Biochemistry Course Curriculum for Master Degree in Medical Laboratory Sciences/Clinical Biochemistry The Master Degree in Medical Laboratory Sciences /Clinical Biochemistry, is awarded by the Faculty of Graduate Studies

More information

Evolutionary Detection of Rules for Text Categorization. Application to Spam Filtering

Evolutionary Detection of Rules for Text Categorization. Application to Spam Filtering Advances in Intelligent Systems and Technologies Proceedings ECIT2004 - Third European Conference on Intelligent Systems and Technologies Iasi, Romania, July 21-23, 2004 Evolutionary Detection of Rules

More information

InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis

InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis WHITE PAPER By InSyBio Ltd Konstantinos Theofilatos Bioinformatician, PhD InSyBio Technical Sales Manager August

More information

Healthcare Measurement Analysis Using Data mining Techniques

Healthcare Measurement Analysis Using Data mining Techniques www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik

More information

Committee on WIPO Standards (CWS)

Committee on WIPO Standards (CWS) E CWS/1/5 ORIGINAL: ENGLISH DATE: OCTOBER 13, 2010 Committee on WIPO Standards (CWS) First Session Geneva, October 25 to 29, 2010 PROPOSAL FOR THE PREPARATION OF A NEW WIPO STANDARD ON THE PRESENTATION

More information

Week 1: Introduction. Transcript of Week 1 Podcast

Week 1: Introduction. Transcript of Week 1 Podcast Transcript of Week 1 Podcast Week 1: Introduction This is the Week 1 Podcast for the course, Information System and Service Analysis and Design. This week is about Project Management. Systems development

More information

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM. DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,

More information

Fragment Assembly Problem: Sequencing the Human Genome

Fragment Assembly Problem: Sequencing the Human Genome Fragment Assembly Problem: Sequencing the Human Genome Time 1-2 class periods (judge based on group of students) Level High school biology or computer science Purpose This activity helps develop students

More information

Biological Sequence Data Formats

Biological Sequence Data Formats Biological Sequence Data Formats Here we present three standard formats in which biological sequence data (DNA, RNA and protein) can be stored and presented. Raw Sequence: Data without description. FASTA

More information

Application of Graph-based Data Mining to Metabolic Pathways

Application of Graph-based Data Mining to Metabolic Pathways Application of Graph-based Data Mining to Metabolic Pathways Chang Hun You, Lawrence B. Holder, Diane J. Cook School of Electrical Engineering and Computer Science Washington State University Pullman,

More information

Overview of the TACITUS Project

Overview of the TACITUS Project Overview of the TACITUS Project Jerry R. Hobbs Artificial Intelligence Center SRI International 1 Aims of the Project The specific aim of the TACITUS project is to develop interpretation processes for

More information

A Survey of Evolutionary Algorithms for Data Mining and Knowledge Discovery

A Survey of Evolutionary Algorithms for Data Mining and Knowledge Discovery A Survey of Evolutionary Algorithms for Data Mining and Knowledge Discovery Alex A. Freitas Postgraduate Program in Computer Science, Pontificia Universidade Catolica do Parana Rua Imaculada Conceicao,

More information

How To Find Influence Between Two Concepts In A Network

How To Find Influence Between Two Concepts In A Network 2014 UKSim-AMSS 16th International Conference on Computer Modelling and Simulation Influence Discovery in Semantic Networks: An Initial Approach Marcello Trovati and Ovidiu Bagdasar School of Computing

More information

Mathematics Cognitive Domains Framework: TIMSS 2003 Developmental Project Fourth and Eighth Grades

Mathematics Cognitive Domains Framework: TIMSS 2003 Developmental Project Fourth and Eighth Grades Appendix A Mathematics Cognitive Domains Framework: TIMSS 2003 Developmental Project Fourth and Eighth Grades To respond correctly to TIMSS test items, students need to be familiar with the mathematics

More information

Big Data Mining Services and Knowledge Discovery Applications on Clouds

Big Data Mining Services and Knowledge Discovery Applications on Clouds Big Data Mining Services and Knowledge Discovery Applications on Clouds Domenico Talia DIMES, Università della Calabria & DtoK Lab Italy talia@dimes.unical.it Data Availability or Data Deluge? Some decades

More information

Computational Systems Biology. Lecture 2: Enzymes

Computational Systems Biology. Lecture 2: Enzymes Computational Systems Biology Lecture 2: Enzymes 1 Images from: David L. Nelson, Lehninger Principles of Biochemistry, IV Edition, Freeman ed. or under creative commons license (search for images at http://search.creativecommons.org/)

More information

Information Technology and Knowledge Management

Information Technology and Knowledge Management Information Technology and Knowledge Management E. Shimemura and Y. Nakamori Japan Advanced Institute of Science and Technology 1-1 Asahidai, Tatsunokuchi, Ishikawa 923-1292, Japan Abstract This paper

More information

THE IMPACT OF INHERITANCE ON SECURITY IN OBJECT-ORIENTED DATABASE SYSTEMS

THE IMPACT OF INHERITANCE ON SECURITY IN OBJECT-ORIENTED DATABASE SYSTEMS THE IMPACT OF INHERITANCE ON SECURITY IN OBJECT-ORIENTED DATABASE SYSTEMS David L. Spooner Computer Science Department Rensselaer Polytechnic Institute Troy, New York 12180 The object-oriented programming

More information

High-Performance IP Service Node with Layer 4 to 7 Packet Processing Features

High-Performance IP Service Node with Layer 4 to 7 Packet Processing Features UDC 621.395.31:681.3 High-Performance IP Service Node with Layer 4 to 7 Packet Processing Features VTsuneo Katsuyama VAkira Hakata VMasafumi Katoh VAkira Takeyama (Manuscript received February 27, 2001)

More information

A Framework of Model-Driven Web Application Testing

A Framework of Model-Driven Web Application Testing A Framework of Model-Driven Web Application Testing Nuo Li, Qin-qin Ma, Ji Wu, Mao-zhong Jin, Chao Liu Software Engineering Institute, School of Computer Science and Engineering, Beihang University, China

More information

Knowledge-based Approach in Information Systems Life Cycle and Information Systems Architecture

Knowledge-based Approach in Information Systems Life Cycle and Information Systems Architecture 5 th Slovakian-Hungarian Joint Symposium on Applied Machine Intelligence and Informatics January 25-26, 2007 Poprad, Slovakia Knowledge-based Approach in Information Systems Life Cycle and Information

More information

3 SOFTWARE AND PROGRAMMING LANGUAGES

3 SOFTWARE AND PROGRAMMING LANGUAGES 3 SOFTWARE AND PROGRAMMING LANGUAGES 3.1 INTRODUCTION In the previous lesson we discussed about the different parts and configurations of computer. It has been mentioned that programs or instructions have

More information

11 Tips to make the requirements definition process more effective and results more usable

11 Tips to make the requirements definition process more effective and results more usable 1 11 Tips to make the s definition process more effective and results more usable This article discusses what I believe are the key techniques for making s definition process repeatable from project to

More information

Some Research Challenges for Big Data Analytics of Intelligent Security

Some Research Challenges for Big Data Analytics of Intelligent Security Some Research Challenges for Big Data Analytics of Intelligent Security Yuh-Jong Hu hu at cs.nccu.edu.tw Emerging Network Technology (ENT) Lab. Department of Computer Science National Chengchi University,

More information

ARCHITECTURE DESIGN OF SECURITY SYSTEM

ARCHITECTURE DESIGN OF SECURITY SYSTEM Trakia Journal of Sciences, Vol. 8, No. 3, pp 77-82, 2010 Copyright 2009 Trakia University Available online at: http://www.uni-sz.bg ISSN 1313-7050 (print) ISSN 1313-3551 (online) Review ARCHITECTURE DESIGN

More information

O&M Service for Megasolar Power Plants and Precise Monitoring Techniques for PV Modules

O&M Service for Megasolar Power Plants and Precise Monitoring Techniques for PV Modules 403 Hitachi Review Vol. 63 (2014), No. 7 Featured Articles O&M Service for Megasolar Power Plants and Precise Monitoring Techniques for PV Modules Tohru Kohno Tetsuharu Ohya Tomoharu Nakamura OVERVIEW:

More information

Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6

Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6 Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6 In the last lab, you learned how to perform basic multiple sequence alignments. While useful in themselves for determining conserved residues

More information

A Multiple DNA Sequence Translation Tool Incorporating Web Robot and Intelligent Recommendation Techniques

A Multiple DNA Sequence Translation Tool Incorporating Web Robot and Intelligent Recommendation Techniques Proceedings of the 2007 WSEAS International Conference on Computer Engineering and Applications, Gold Coast, Australia, January 17-19, 2007 402 A Multiple DNA Sequence Translation Tool Incorporating Web

More information

FINDING RELATION BETWEEN AGING AND

FINDING RELATION BETWEEN AGING AND FINDING RELATION BETWEEN AGING AND TELOMERE BY APRIORI AND DECISION TREE Jieun Sung 1, Youngshin Joo, and Taeseon Yoon 1 Department of National Science, Hankuk Academy of Foreign Studies, Yong-In, Republic

More information

On Efficient Collaboration between Lawyers and Software Engineers when Transforming Legal Regulations to Law-related Requirements

On Efficient Collaboration between Lawyers and Software Engineers when Transforming Legal Regulations to Law-related Requirements Proceedings of the 2n d International Conference on Information Technology, ICIT 2010 28-30 June 2010, Gdansk, Poland. On Efficient Collaboration between Lawyers and Software Engineers when Transforming

More information

Feed Forward Loops in Biological Systems

Feed Forward Loops in Biological Systems Feed Forward Loops in Biological Systems Dr. M. Vijayalakshmi School of Chemical and Biotechnology SASTRA University Joint Initiative of IITs and IISc Funded by MHRD Page 1 of 7 Table of Contents 1 INTRODUCTION...

More information

Lab 2/Phylogenetics/September 16, 2002 1 PHYLOGENETICS

Lab 2/Phylogenetics/September 16, 2002 1 PHYLOGENETICS Lab 2/Phylogenetics/September 16, 2002 1 Read: Tudge Chapter 2 PHYLOGENETICS Objective of the Lab: To understand how DNA and protein sequence information can be used to make comparisons and assess evolutionary

More information

Biological Databases and Protein Sequence Analysis

Biological Databases and Protein Sequence Analysis Biological Databases and Protein Sequence Analysis Introduction M. Madan Babu, Center for Biotechnology, Anna University, Chennai 25, India Bioinformatics is the application of Information technology to

More information

REQUIREMENTS FOR THE WORKFLOW-BASED SUPPORT OF RELEASE MANAGEMENT PROCESSES IN THE AUTOMOTIVE SECTOR

REQUIREMENTS FOR THE WORKFLOW-BASED SUPPORT OF RELEASE MANAGEMENT PROCESSES IN THE AUTOMOTIVE SECTOR REQUIREMENTS FOR THE WORKFLOW-BASED SUPPORT OF RELEASE MANAGEMENT PROCESSES IN THE AUTOMOTIVE SECTOR Ulrich Bestfleisch, Joachim Herbst DaimlerChrysler AG Research and Technology Data and Process Management

More information

Questions? Assignment. Techniques for Gathering Requirements. Gathering and Analysing Requirements

Questions? Assignment. Techniques for Gathering Requirements. Gathering and Analysing Requirements Questions? Assignment Why is proper project management important? What is goal of domain analysis? What is the difference between functional and non- functional requirements? Why is it important for requirements

More information

BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS

BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title:

More information

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object Anne Monceaux 1, Joanna Guss 1 1 EADS-CCR, Centreda 1, 4 Avenue Didier Daurat 31700 Blagnac France

More information

DATA MINING - SELECTED TOPICS

DATA MINING - SELECTED TOPICS DATA MINING - SELECTED TOPICS Peter Brezany Institute for Software Science University of Vienna E-mail : brezany@par.univie.ac.at 1 MINING SPATIAL DATABASES 2 Spatial Database Systems SDBSs offer spatial

More information

Improving Knowledge-Based System Performance by Reordering Rule Sequences

Improving Knowledge-Based System Performance by Reordering Rule Sequences Improving Knowledge-Based System Performance by Reordering Rule Sequences Neli P. Zlatareva Department of Computer Science Central Connecticut State University 1615 Stanley Street New Britain, CT 06050

More information

1-04-10 Configuration Management: An Object-Based Method Barbara Dumas

1-04-10 Configuration Management: An Object-Based Method Barbara Dumas 1-04-10 Configuration Management: An Object-Based Method Barbara Dumas Payoff Configuration management (CM) helps an organization maintain an inventory of its software assets. In traditional CM systems,

More information

A Knowledge-based Product Derivation Process and some Ideas how to Integrate Product Development

A Knowledge-based Product Derivation Process and some Ideas how to Integrate Product Development A Knowledge-based Product Derivation Process and some Ideas how to Integrate Product Development (Position paper) Lothar Hotz and Andreas Günter HITeC c/o Fachbereich Informatik Universität Hamburg Hamburg,

More information

FOR IMMEDIATE RELEASE

FOR IMMEDIATE RELEASE FOR IMMEDIATE RELEASE Hitachi Developed Basic Artificial Intelligence Technology that Enables Logical Dialogue Analyzes huge volumes of text data on issues under debate, and presents reasons and grounds

More information

A leader in the development and application of information technology to prevent and treat disease.

A leader in the development and application of information technology to prevent and treat disease. A leader in the development and application of information technology to prevent and treat disease. About MOLECULAR HEALTH Molecular Health was founded in 2004 with the vision of changing healthcare. Today

More information

Extraction and Visualization of Protein-Protein Interactions from PubMed

Extraction and Visualization of Protein-Protein Interactions from PubMed Extraction and Visualization of Protein-Protein Interactions from PubMed Ulf Leser Knowledge Management in Bioinformatics Humboldt-Universität Berlin Finding Relevant Knowledge Find information about Much

More information

Classnotes 5: 1. Design and Information Flow A data flow diagram (DFD) is a graphical technique that is used to depict information flow, i.e.

Classnotes 5: 1. Design and Information Flow A data flow diagram (DFD) is a graphical technique that is used to depict information flow, i.e. Classnotes 5: 1. Design and Information Flow A data flow diagram (DFD) is a graphical technique that is used to depict information flow, i.e., a representation of information as a continuous flow that

More information

M.Sc. in Nano Technology with specialisation in Nano Biotechnology

M.Sc. in Nano Technology with specialisation in Nano Biotechnology M.Sc. in Nano Technology with specialisation in Nano Biotechnology Nanotechnology is all about designing, fabricating and controlling materials, components and machinery with dimensions on the nanoscale,

More information

clooca : Web based tool for Domain Specific Modeling

clooca : Web based tool for Domain Specific Modeling clooca : Web based tool for Domain Specific Modeling Shuhei Hiya, Kenji Hisazumi, Akira Fukuda, and Tsuneo Nakanishi Kyushu University 744 Motooka Nishi-ku, Fukuoka 819-0395, Japan {hiya,nel,fukuda,tun@f.ait.kyushu-u.ac.jp

More information

WiSoft - Effective analysis software packages

WiSoft - Effective analysis software packages Intuitive, Powerful, Precise Smart, Modular, Versatile WiSoft - Effective analysis software packages A sophisticated image processing algorithmic library, operated by a unique and simple software interface

More information

ISSN: 2321-7782 (Online) Volume 3, Issue 7, July 2015 International Journal of Advance Research in Computer Science and Management Studies

ISSN: 2321-7782 (Online) Volume 3, Issue 7, July 2015 International Journal of Advance Research in Computer Science and Management Studies ISSN: 2321-7782 (Online) Volume 3, Issue 7, July 2015 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online

More information

Selvita Integrated drug discovery collaborations

Selvita Integrated drug discovery collaborations Selvita Integrated drug discovery collaborations Magdalena Żabka Selvita overview Origins Polish biotechnology research company headquartered in Krakow Mission Resources We deliver comprehensive solutions

More information

Database Optimizing Services

Database Optimizing Services Database Systems Journal vol. I, no. 2/2010 55 Database Optimizing Services Adrian GHENCEA 1, Immo GIEGER 2 1 University Titu Maiorescu Bucharest, Romania 2 Bodenstedt-Wilhelmschule Peine, Deutschland

More information

200630 - FBIO - Fundations of Bioinformatics

200630 - FBIO - Fundations of Bioinformatics Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 200 - FME - School of Mathematics and Statistics 1004 - UB - (ENG)Universitat de Barcelona MASTER'S DEGREE IN STATISTICS AND

More information

Doctor of Philosophy in Computer Science

Doctor of Philosophy in Computer Science Doctor of Philosophy in Computer Science Background/Rationale The program aims to develop computer scientists who are armed with methods, tools and techniques from both theoretical and systems aspects

More information

A Conceptual Approach to Data Visualization for User Interface Design of Smart Grid Operation Tools

A Conceptual Approach to Data Visualization for User Interface Design of Smart Grid Operation Tools A Conceptual Approach to Data Visualization for User Interface Design of Smart Grid Operation Tools Dong-Joo Kang and Sunju Park Yonsei University unlimit0909@hotmail.com, boxenju@yonsei.ac.kr Abstract

More information

For additional information on the program, see the current university catalog.

For additional information on the program, see the current university catalog. For information call: Tel: (818) 77-81 Fax: (818) 77-08 E-mail: chemistry.office@csun.edu Website: http://www.csun.edu/chemistry Or write: Department of Chemistry and Biochemistry California State University,

More information

2 AIMS: an Agent-based Intelligent Tool for Informational Support

2 AIMS: an Agent-based Intelligent Tool for Informational Support Aroyo, L. & Dicheva, D. (2000). Domain and user knowledge in a web-based courseware engineering course, knowledge-based software engineering. In T. Hruska, M. Hashimoto (Eds.) Joint Conference knowledge-based

More information

Gouvernement du Québec Ministère de l Éducation, 2004 04-00808 ISBN 2-550-43538-9

Gouvernement du Québec Ministère de l Éducation, 2004 04-00808 ISBN 2-550-43538-9 Gouvernement du Québec Ministère de l Éducation, 2004 04-00808 ISBN 2-550-43538-9 Legal deposit Bibliothèque nationale du Québec, 2004 1. INTRODUCTION This Definition of the Domain for Summative Evaluation

More information

Depth-of-Knowledge Levels for Four Content Areas Norman L. Webb March 28, 2002. Reading (based on Wixson, 1999)

Depth-of-Knowledge Levels for Four Content Areas Norman L. Webb March 28, 2002. Reading (based on Wixson, 1999) Depth-of-Knowledge Levels for Four Content Areas Norman L. Webb March 28, 2002 Language Arts Levels of Depth of Knowledge Interpreting and assigning depth-of-knowledge levels to both objectives within

More information

University Uses Business Intelligence Software to Boost Gene Research

University Uses Business Intelligence Software to Boost Gene Research Microsoft SQL Server 2008 R2 Customer Solution Case Study University Uses Business Intelligence Software to Boost Gene Research Overview Country or Region: Scotland Industry: Education Customer Profile

More information

Data Integration via Constrained Clustering: An Application to Enzyme Clustering

Data Integration via Constrained Clustering: An Application to Enzyme Clustering Data Integration via Constrained Clustering: An Application to Enzyme Clustering Elisa Boari de Lima Raquel Cardoso de Melo Minardi Wagner Meira Jr. Mohammed Javeed Zaki Abstract When multiple data sources

More information

Data Mining and Soft Computing. Francisco Herrera

Data Mining and Soft Computing. Francisco Herrera Francisco Herrera Research Group on Soft Computing and Information Intelligent Systems (SCI 2 S) Dept. of Computer Science and A.I. University of Granada, Spain Email: herrera@decsai.ugr.es http://sci2s.ugr.es

More information

(Refer Slide Time: 01:52)

(Refer Slide Time: 01:52) Software Engineering Prof. N. L. Sarda Computer Science & Engineering Indian Institute of Technology, Bombay Lecture - 2 Introduction to Software Engineering Challenges, Process Models etc (Part 2) This

More information

Pipe Cleaner Proteins. Essential question: How does the structure of proteins relate to their function in the cell?

Pipe Cleaner Proteins. Essential question: How does the structure of proteins relate to their function in the cell? Pipe Cleaner Proteins GPS: SB1 Students will analyze the nature of the relationships between structures and functions in living cells. Essential question: How does the structure of proteins relate to their

More information

Abstract. The DNA promoter sequences domain theory and database have become popular for

Abstract. The DNA promoter sequences domain theory and database have become popular for Journal of Artiæcial Intelligence Research 2 è1995è 361í367 Submitted 8è94; published 3è95 Research Note On the Informativeness of the DNA Promoter Sequences Domain Theory Julio Ortega Computer Science

More information

MYRIAD, HITACHI, ORACLE & FRIEDLI JOIN FORCES TO MAP THE ENTIRE HUMAN PROTEOME

MYRIAD, HITACHI, ORACLE & FRIEDLI JOIN FORCES TO MAP THE ENTIRE HUMAN PROTEOME FOR IMMEDIATE RELEASE MYRIAD, HITACHI, ORACLE & FRIEDLI JOIN FORCES TO MAP THE ENTIRE HUMAN PROTEOME - $185 Million Collaboration to Determine All Human Protein Interactions And Decipher Biochemical Pathways

More information

Using Semantic Data Mining for Classification Improvement and Knowledge Extraction

Using Semantic Data Mining for Classification Improvement and Knowledge Extraction Using Semantic Data Mining for Classification Improvement and Knowledge Extraction Fernando Benites and Elena Sapozhnikova University of Konstanz, 78464 Konstanz, Germany. Abstract. The objective of this

More information