Intégration de connaissances ontologiques pour l apprentissage des réseaux bayésiens

Size: px
Start display at page:

Download "Intégration de connaissances ontologiques pour l apprentissage des réseaux bayésiens"

Transcription

1 Bayesian networks Ontologies BNs and Ontologies Intégration de connaissances ontologiques pour l apprentissage des réseaux bayésiens Montassar Ben Messaoud 12, Mouna Ben Ishak 12, Philippe Leray 2, Nahla Ben Amor 1 1 LARODEC, ISG, Université de Tunis, Tunisie 2 Connaissances et Décision, LINA UMR 6241, Nantes, France Colloque Apprentissage Artificiel & Fouille de Données 28 juin 2012, Univ. Paris 13 Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 1/22

2 Bayesian networks Ontologies BNs and Ontologies Motivations Knowledge-based systems aim to make expertise available for decision making and information sharing To resolve some complex problems, the combination of different knowledge-based systems can be very powerful Our idea Combine Two KBSs : Bayesian Networks and Ontologies, to help both Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 2/22

3 Bayesian networks Ontologies BNs and Ontologies Motivations Knowledge-based systems aim to make expertise available for decision making and information sharing To resolve some complex problems, the combination of different knowledge-based systems can be very powerful Our idea Combine Two KBSs : Bayesian Networks and Ontologies, to help both Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 2/22

4 Bayesian networks Ontologies BNs and Ontologies Motivations Knowledge-based systems aim to make expertise available for decision making and information sharing To resolve some complex problems, the combination of different knowledge-based systems can be very powerful Our idea Combine Two KBSs : Bayesian Networks and Ontologies, to help both Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 2/22

5 Bayesian networks Ontologies BNs and Ontologies Outline... 1 Bayesian networks BN definition BN learning 2 Ontologies Ontology definition Ontology learning 3 BNs and Ontologies Existing work Our proposal Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 3/22

6 Bayesian networks Ontologies BNs and Ontologies Bayesian network definition Definition [Pearl, 1985] A Bayesian network (BN) is defined by one qualitative description of (conditional) dependences / independences between variables directed acyclic graph (DAG) one quantitative description of these dependences conditional probability distributions (CPDs) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 4/22

7 Bayesian networks Ontologies BNs and Ontologies Example one topological order : B, E, A, R, T (not unique) P(Burglary)=[ ] P(Earthquake)=[ ] B Burglary E Earthquake P(Radio Earthquake) Earthquake = Y N Radio=Y Radio=N P(TV Radio) A Alarm P(Alarm Burglary,Earthquake) R Radio Radio = Y N TV=Y TV=N Burglary,Earthquake = Y,Y Y,N N,Y N,N Alarm=Y Alarm=N Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 5/22 T TV

8 Bayesian networks Ontologies BNs and Ontologies Consequence Chain rule P(S) = P(S 1 ) P(S 2 S 1 ) P(S 3 S 1, S 2 ) P(S n S 1... S n 1 ) Consequence with a BN P(S i S 1... S i 1 ) = P(S i parents(s i )) so P(S) = Π n i=1 P(S i parents(s i )) The (global) joint probability distribution is decomposed in a product of (local) conditional distributions BN = compact representation of the joint distribution P(S) given some information about dependence relationships between variables Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 6/22

9 Bayesian networks Ontologies BNs and Ontologies Consequence Chain rule P(S) = P(S 1 ) P(S 2 S 1 ) P(S 3 S 1, S 2 ) P(S n S 1... S n 1 ) Consequence with a BN P(S i S 1... S i 1 ) = P(S i parents(s i )) so P(S) = Π n i=1 P(S i parents(s i )) The (global) joint probability distribution is decomposed in a product of (local) conditional distributions BN = compact representation of the joint distribution P(S) given some information about dependence relationships between variables Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 6/22

10 Bayesian networks Ontologies BNs and Ontologies Markov equivalence Definition B 1 and B 2 are Markov equivalent iff both describe exactly the same conditional (in)dependence statements Graphical properties B 1 and B 2 have the same skeleton, V-structures and inferred edges All the equivalent graphs (= equivalence class) can be summarized by one partially directed DAG named CPDAG or Essential Graph Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 7/22

11 Bayesian networks Ontologies BNs and Ontologies Markov equivalence Definition B 1 and B 2 are Markov equivalent iff both describe exactly the same conditional (in)dependence statements Graphical properties B 1 and B 2 have the same skeleton, V-structures and inferred edges All the equivalent graphs (= equivalence class) can be summarized by one partially directed DAG named CPDAG or Essential Graph Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 7/22

12 Bayesian networks Ontologies BNs and Ontologies Markov equivalence A S A S T L B T L B O O X D X D Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 8/22

13 Bayesian networks Ontologies BNs and Ontologies BN structure learning How to build / learn a Bayesian Network? 1 DAG is known, how to determine the CPDs? from experts : knowledge elicitation from complete data / incomplete data 2 DAG is unknown, how to determine it (or the CPDAG)? from complete data / incomplete data Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 9/22

14 Bayesian networks Ontologies BNs and Ontologies Structure learning is a complex task Size of the solution space The number of possible DAGs with n variables is super-exponential w.r.t n (Robinson 77) { NS(n) = 1, n = 0 or 1 n i=1 ( 1)i+1( n i ) 2 i(n 1) NS(n i), n > 1 NS(5) = NS(10) = An exhaustive search is impossible! One thousand millenniums = seconds Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 10/22

15 Bayesian networks Ontologies BNs and Ontologies Structure learning algorithms How to search a good BN? Constraint-based methods BN = independence model find CI in data in order to build the DAG Score-based methods BN = probabilistic model that must fit data as well as possible search the DAG space in order to maximize a scoring function Hybrid methods Some issues Identifiability : learning algorithms can t distinguish between any structure in the same equivalence class more Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 11/22

16 Bayesian networks Ontologies BNs and Ontologies Structure learning algorithms How to search a good BN? Constraint-based methods BN = independence model find CI in data in order to build the DAG Score-based methods BN = probabilistic model that must fit data as well as possible search the DAG space in order to maximize a scoring function Hybrid methods Some issues Learning algorithms can deal with a priori knowledge in order to reduce search space : white list, black list,node ordering, repetition of local structures... more Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 11/22

17 Bayesian networks Ontologies BNs and Ontologies Outline... 1 Bayesian networks BN definition BN learning 2 Ontologies Ontology definition Ontology learning 3 BNs and Ontologies Existing work Our proposal Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 12/22

18 Bayesian networks Ontologies BNs and Ontologies Ontology Basics Definition [Gruber, 1993] shared understanding within a community of people declarative specification of entities and their relationships with each other separate the domain knowledge from the operational knowledge Construction/Evolution: expertise and/or machine learning Reasoning: description logic reasoners Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 13/22

19 Bayesian networks Ontologies BNs and Ontologies Elements of an ontology C: Classes (concepts) P: Attributes (properties) H: Hierarchical structure (is-a, part-of relations) R: Other semantic relationships I: Instances (individuals) Fire Flood Landslide Catastrophes Tsunami Volcano A: Axioms (logic statements) Earthquake Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 14/22

20 Bayesian networks Ontologies BNs and Ontologies Elements of an ontology C: Classes (concepts) Landslide P: Attributes (properties) H: Hierarchical structure (is-a, part-of relations) Fire Catastrophes Tsunami R: Other semantic relationships I: Instances (individuals) Flood Name Localization Date Nb_killed Volcano A: Axioms (logic statements) Earthquake Name Mgnitude Localization Date Nb_killed Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 14/22

21 Bayesian networks Ontologies BNs and Ontologies Elements of an ontology C: Classes (concepts) P: Attributes (properties) Catastrophes H: Hierarchical structure (is-a, part-of relations) R: Other semantic relationships is-a Man-made is-a Natural I: Instances (individuals) is-a is-a is-a is-a is-a is-a A: Axioms (logic statements) Flood Fire Earthquake Landslide Volcano Tsunami Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 14/22

22 Bayesian networks Ontologies BNs and Ontologies Elements of an ontology C: Classes (concepts) P: Attributes (properties) H: Hierarchical structure (is-a, part-of relations) R: Other semantic relationships I: Instances (individuals) A: Axioms (logic statements) Catastrophes is-a is-a Man-made Natural is-a is-a is-a is-a is-a is-a Flood Fire Earthquake Landslide Volcano Tsunami Causes Causes Causes Causes Causes Causes Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 14/22

23 Bayesian networks Ontologies BNs and Ontologies Elements of an ontology C: Classes (concepts) Catastrophes P: Attributes (properties) H: Hierarchical structure (is-a, part-of relations) is-a Man-made is-a Natural R: Other semantic relationships is-a is-a is-a is-a is-a is-a I: Instances (individuals) Flood Fire Earthquake Landslide Volcano Tsunami A: Axioms (logic statements) is-instance is-instance is-instance Aleppo Shaanxi Haiti Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 14/22

24 Bayesian networks Ontologies BNs and Ontologies Elements of an ontology C: Classes (concepts) Catastrophes P: Attributes (properties) H: Hierarchical structure (is-a, part-of relations) is-a Man-made is-a Natural R: Other semantic relationships is-a is-a is-a is-a is-a is-a I: Instances (individuals) Flood Fire Earthquake Landslide Volcano Tsunami A: Axioms (logic statements) is-instance is-instance is-instance Aleppo Shaanxi Haiti Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 14/22

25 Bayesian networks Ontologies BNs and Ontologies Ontology learning : subtasks Ontology population Get new instances of concept(s) already present in the ontology Ontology enrichment Update (add or modify) concepts, properties and relations in a given ontology Evolution vs. revolution Evolution (ontology continuity): Add new knowledge Revolution (ontology discontinuity): Modify existing knowledge Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 15/22

26 Bayesian networks Ontologies BNs and Ontologies Ontology learning : subtasks Ontology population Get new instances of concept(s) already present in the ontology Ontology enrichment Update (add or modify) concepts, properties and relations in a given ontology Evolution vs. revolution Evolution (ontology continuity): Add new knowledge Revolution (ontology discontinuity): Modify existing knowledge Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 15/22

27 Bayesian networks Ontologies BNs and Ontologies Ontology learning : subtasks Ontology population Get new instances of concept(s) already present in the ontology Ontology enrichment Update (add or modify) concepts, properties and relations in a given ontology Evolution vs. revolution Evolution (ontology continuity): Add new knowledge Revolution (ontology discontinuity): Modify existing knowledge Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 15/22

28 Bayesian networks Ontologies BNs and Ontologies Ontology learning : methods [Buitelaar & Cimiano, 2008] Bridging the gap between text and knowledge Natural Language Processing [Buitelaar et al., 2003, Velardi et al., 2005] Concept extraction Taxonomy learning (is-a, part-of) Population (information extraction) Machine Learning Clustering for taxonomy learning [Bisson et al., 2000] Association rules for relation discovery [Madche & Staab, 2000] ILP for relation discovery [Rudolph et al., 2007] Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 16/22

29 Bayesian networks Ontologies BNs and Ontologies Ontology learning : methods [Buitelaar & Cimiano, 2008] Bridging the gap between text and knowledge Natural Language Processing [Buitelaar et al., 2003, Velardi et al., 2005] Concept extraction Taxonomy learning (is-a, part-of) Population (information extraction) Machine Learning Clustering for taxonomy learning [Bisson et al., 2000] Association rules for relation discovery [Madche & Staab, 2000] ILP for relation discovery [Rudolph et al., 2007] Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 16/22

30 Bayesian networks Ontologies BNs and Ontologies Ontology learning : methods [Buitelaar & Cimiano, 2008] Bridging the gap between text and knowledge Natural Language Processing [Buitelaar et al., 2003, Velardi et al., 2005] Concept extraction Taxonomy learning (is-a, part-of) Population (information extraction) Machine Learning Clustering for taxonomy learning [Bisson et al., 2000] Association rules for relation discovery [Madche & Staab, 2000] ILP for relation discovery [Rudolph et al., 2007] Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 16/22

31 Bayesian networks Ontologies BNs and Ontologies Outline... 1 Bayesian networks BN definition BN learning 2 Ontologies Ontology definition Ontology learning 3 BNs and Ontologies Existing work Our proposal Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 17/22

32 Bayesian networks Ontologies BNs and Ontologies Existing work Bayesian Network = Ontology BayesOWL [Ding & Peng, 2004] OntoBayes [Yang & Calmet, 2005] PR-OWL [Costa & Laskey, 2006] Use of BNs for probabilistic modeling and reasoning (no learning) Ontology = Bayesian Network BN basic construction using ontologies [Devitt et al., 2006] Ontology-based semi-automatic construction of BN in E-health applications [Jeon & Ko, 2007] Use of ontologies to manually build a BN (without data) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 18/22

33 Bayesian networks Ontologies BNs and Ontologies Existing work Bayesian Network = Ontology BayesOWL [Ding & Peng, 2004] OntoBayes [Yang & Calmet, 2005] PR-OWL [Costa & Laskey, 2006] Use of BNs for probabilistic modeling and reasoning (no learning) Ontology = Bayesian Network BN basic construction using ontologies [Devitt et al., 2006] Ontology-based semi-automatic construction of BN in E-health applications [Jeon & Ko, 2007] Use of ontologies to manually build a BN (without data) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 18/22

34 Bayesian networks Ontologies BNs and Ontologies Existing work Bayesian Network = Ontology BayesOWL [Ding & Peng, 2004] OntoBayes [Yang & Calmet, 2005] PR-OWL [Costa & Laskey, 2006] Use of BNs for probabilistic modeling and reasoning (no learning) Ontology = Bayesian Network BN basic construction using ontologies [Devitt et al., 2006] Ontology-based semi-automatic construction of BN in E-health applications [Jeon & Ko, 2007] Use of ontologies to manually build a BN (without data) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 18/22

35 Bayesian networks Ontologies BNs and Ontologies Existing work Bayesian Network = Ontology BayesOWL [Ding & Peng, 2004] OntoBayes [Yang & Calmet, 2005] PR-OWL [Costa & Laskey, 2006] Use of BNs for probabilistic modeling and reasoning (no learning) Ontology = Bayesian Network BN basic construction using ontologies [Devitt et al., 2006] Ontology-based semi-automatic construction of BN in E-health applications [Jeon & Ko, 2007] Use of ontologies to manually build a BN (without data) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 18/22

36 Bayesian networks Ontologies BNs and Ontologies Our proposal Bridging the gap between text and knowledge Ontology + data = Bayesian Network = Ontology BN Structure Learning 2 variations Ontology Evolution 1 Causal BNs more semantical causal discovery SemCaDo 1.0 [Ben Messaoud & al. 2009] causal discovery for ontology evolution SemCaDo 2.0 [Ben Messaoud & al., 2009] more 2 Object-Oriented BNs OOBN structure learning and ontology evolution O2C [Ben Ishak et al., 2011] more Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 19/22

37 Bayesian networks Ontologies BNs and Ontologies Our proposal Bridging the gap between data and knowledge Ontology + data = Bayesian Network = Ontology BN Structure Learning 2 variations Ontology Evolution 1 Causal BNs more semantical causal discovery SemCaDo 1.0 [Ben Messaoud & al. 2009] causal discovery for ontology evolution SemCaDo 2.0 [Ben Messaoud & al., 2009] more 2 Object-Oriented BNs OOBN structure learning and ontology evolution O2C [Ben Ishak et al., 2011] more Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 19/22

38 Bayesian networks Ontologies BNs and Ontologies Our proposal Bridging the gap between data and knowledge Ontology + data = Bayesian Network = Ontology BN Structure Learning 2 variations Ontology Evolution 1 Causal BNs more semantical causal discovery SemCaDo 1.0 [Ben Messaoud & al. 2009] causal discovery for ontology evolution SemCaDo 2.0 [Ben Messaoud & al., 2009] more 2 Object-Oriented BNs OOBN structure learning and ontology evolution O2C [Ben Ishak et al., 2011] more Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 19/22

39 Bayesian networks Ontologies BNs and Ontologies Our proposal Bridging the gap between data and knowledge Ontology + data = Bayesian Network = Ontology BN Structure Learning 2 variations Ontology Evolution 1 Causal BNs more semantical causal discovery SemCaDo 1.0 [Ben Messaoud & al. 2009] causal discovery for ontology evolution SemCaDo 2.0 [Ben Messaoud & al., 2009] more 2 Object-Oriented BNs OOBN structure learning and ontology evolution O2C [Ben Ishak et al., 2011] more Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 19/22

40 Bayesian networks Ontologies BNs and Ontologies Conclusion BN structure learning is NP hard, need of prior knowledge Ontology evolution is difficult, based on text-mining Cooperation can help for both tasks Originality of our proposal BN structure learning : use ontology instead of expert knowledge : separation between expert acquisition and structure learning Ontology evolution : use BN structure learning to directly discover relationships from data Difficulties No similar work or benchmark for a comparative study SemCaDo : one concept = attribute = node O2C : OOBN learning is too complex Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 20/22

41 Bayesian networks Ontologies BNs and Ontologies Conclusion BN structure learning is NP hard, need of prior knowledge Ontology evolution is difficult, based on text-mining Cooperation can help for both tasks Originality of our proposal BN structure learning : use ontology instead of expert knowledge : separation between expert acquisition and structure learning Ontology evolution : use BN structure learning to directly discover relationships from data Difficulties No similar work or benchmark for a comparative study SemCaDo : one concept = attribute = node O2C : OOBN learning is too complex Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 20/22

42 Bayesian networks Ontologies BNs and Ontologies Conclusion BN structure learning is NP hard, need of prior knowledge Ontology evolution is difficult, based on text-mining Cooperation can help for both tasks Originality of our proposal BN structure learning : use ontology instead of expert knowledge : separation between expert acquisition and structure learning Ontology evolution : use BN structure learning to directly discover relationships from data Difficulties No similar work or benchmark for a comparative study SemCaDo : one concept = attribute = node O2C : OOBN learning is too complex Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 20/22

43 Bayesian networks Ontologies BNs and Ontologies Future works What next? Creation of benchmarks Generalize SemCaDo or O2C with more general models Ideas : Multi Entity BNs [Laskey, 2006] Relational BNs [Getoor et al., 2007] Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 21/22

44 Bayesian networks Ontologies BNs and Ontologies Thank you for your attention Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 22/22

45 Outline... 4 Bayesian network Structure Learning constraint-based methods score-based methods local search methods 5 Causal Bayesian Networks definition causal BN structure learning 6 SemCaDo algorithm definition experimental study 7 O2C algorithm definition algorithm Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 1/49

46 Constraint-based methods How to search a good BN? Constraint-based methods BN = independence model find CI in data in order to build the DAG Score-based methods BN = probabilistic model that must fit data as well as possible search the DAG space in order to maximize a scoring function Hybrid methods Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 2/49

47 Constraint-based methods Two reference algorithms Pearl et Verma : IC, IC* Spirtes, Glymour et Scheines : SGS, PC, CI, FCI Common principle Build an undirected graph describing direct dependences between variables (χ 2 tests) by adding edges (Pearl et Verma) by deleting edges (SGS) Detect V-structures (from previous statistical tests) Propagate some edge orientation (inferred edges) in order to obtain a CPDAG Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 3/49

48 constraint-based methods Some inconvenients Reliability of CI test conditionally to several variables with a limited amount of data) SGS heuristic : if df < N 10, then declare dependence Combinatorial explosion of the number of tests PC heuristic : begin with order 0 (X A X B ) then order 1 (X A X B X C ), etc... Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 4/49

49 PC algorithm Step 0 : undirected complete graph Left : target BN used to generate 5000 samples A S A S T L B T L B O O X D X D Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 5/49

50 PC algorithm Step 1a : delete all order 0 independences discovered χ 2 : S A L A B A O A X A D A T S L T O B X B A S A S T L B T L B O O X D X D Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 5/49

51 PC algorithm Step 1a : delete all order 1 independences discovered χ 2 : T A O O S L X S L B T S X T O D T O... A S A S T L B T L B O O X D X D Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 5/49

52 PC algorithm Step 1a : delete all order 2 independences discovered χ 2 : D S {L, B} X O {T, L} D O {T, L} A S A S T L B T L B O O X D X D Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 5/49

53 PC algorithm Step 2 : research V-structures χ 2 : one V-structure T O L is discovered A S A S T L B T L B O O X D X D Step 3 : inferred edges no one in this example Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 5/49

54 PC algorithm From CPDAG to DAG Orientation of the remaining undirected edges (only constraint : do not create any new V-structure) A S A S T L B T L B O O X D X D Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 5/49

55 PC algorithm Obtained DAG versus target one χ 2 test with 5000 samples fails to discover A T, O X and O D A S A S T L B T L B O O X D X D Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 5/49

56 Score-based methods How to search a good BN? Constraint-based methods BN = independence model find CI in data in order to build the DAG Score-based methods BN = probabilistic model that must fit data as well as possible search the DAG space in order to maximize a scoring function Hybrid methods Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 6/49

57 Notion of score General principle : Occam razor Pluralitas non est ponenda sine neccesitate (La pluralité (des notions) ne devrait pas être posée sans nécessité) plurality should not be posited without necessity Frustra fit per plura quod potest fieri per pauciora (C est en vain que l on fait avec plusieurs ce que l on peut faire avec un petit nombre) It is pointless to do with more what can be done with fewer = Parcimony principle : find a model Fitting the data D : likelihood : L(D θ, B) The simplest possible : dimension of B : Dim(B) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 7/49

58 Score examples AIC and BIC Compromise between likelihood and complexity Application of AIC (Akaïke 70) and BIC (Schwartz 78) criteria S AIC (B, D) = log L(D θ MV, B) Dim(B) S BIC (B, D) = log L(D θ MV, B) 1 Dim(B) log N 2 Bayesian scores : BD, BDe, BDeu S BD (B, D) = P(B, D) (Cooper et Herskovits 92) BDe = BD + score equivalence (Heckerman 94) S BD (B, D) = P(B) n q i i=1 j=1 Γ(α ij ) Γ(N ij + α ij ) r i k=1 Γ(N ijk + α ijk ) Γ(α ijk ) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 8/49

59 Score properties Two important properties Decomposability (Global)Score(B, D) = n (local)score(x i, pa i ) i=1 Score equivalence If two BN B 1 and B 2 are Markov equivalent then S(B 1, D) = S(B 2, D) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 9/49

60 Score properties Two important properties Decomposability (Global)Score(B, D) = n (local)score(x i, pa i ) i=1 Score equivalence If two BN B 1 and B 2 are Markov equivalent then S(B 1, D) = S(B 2, D) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 9/49

61 Heuristic exploration of search space Search space and heuristics B space restriction to tree space : Chow&Liu, MWST DAG with node ordering : K2 algorithm greedy search genetic algorithms,... E space greedy equivalence search Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 10/49

62 Restriction to tree space Principle What is the best tree connecting all the nodes, i.e. maximizing a weight defined for each possible edge? Answer : maximal weighted spanning tree (MWST) weight = mutual information [Chow & Liu, 1968] W (X A, X B ) = a,b N ab N log N abn N a. N.b weight = any local score variation [Heckerman, 1994] W (X A, X B ) = score(x A, Pa(X A ) = X B ) score(x A, ) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 11/49

63 Restriction to tree space Principle What is the best tree connecting all the nodes, i.e. maximizing a weight defined for each possible edge? Answer : maximal weighted spanning tree (MWST) weight = mutual information [Chow & Liu, 1968] W (X A, X B ) = a,b N ab N log N abn N a. N.b weight = any local score variation [Heckerman, 1994] W (X A, X B ) = score(x A, Pa(X A ) = X B ) score(x A, ) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 11/49

64 Restriction to tree space Remarks MWST returns an undirected tree This undirected tree = CPDAG of all the directed tree with this skeleton Obtain a directed tree by (randomly) choosing one root and orienting the edges with a depth first search over this tree Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 12/49

65 Example : obtained DAG vs. target one A S A S T L B T L B O O X D X D MWST can not discover cycles neither V-structures (tree space!) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 13/49

66 Heuristic exploration of search space Search space and heuristics B space restriction to tree space : Chow&Liu, MWST DAG with node ordering : K2 algorithm greedy search genetic algorithms,... E space greedy equivalence search Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 14/49

67 Greedy search Principle Exploration of the search space with traversal operators add edge invert edge delete edge and respect the DAG definition (no cycle) Exploration can begin from any given DAG Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 15/49

68 Example : obtained DAG vs. target one A S A S T L B T L B O O X D X D start = empty graph. GS result = local optimum :-( Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 16/49

69 Example : obtained DAG vs. target one A S A S T L B T L B O O X D X D start = MWST result. GS result is better Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 16/49

70 Heuristic exploration of search space Search space and heuristics B space restriction to tree space : Chow&Liu, MWST DAG with node ordering : K2 algorithm greedy search genetic algorithms,... E space greedy equivalence search Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 17/49

71 What about changing our search space Preliminaries IC/PC result = CPDAG MWST result = CPDAG Score-based methods do not distinguish equivalent DAGs Search in E E = CPDAG space Better properties : YES 2 equivalent structures = 1 unique structure in E Better size : NO E size is quasi similar to DAG space asymptotic ratio is 3,7 : [Gillispie & Perlman, 2001] Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 18/49

72 Greedy Equivalent Search Principe [Chickering, 2002] Greedy search in E Phase 1 : add edges until convergence Phase 2 : delete edges until convergence Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 19/49

73 Add edge examples in E neighborhood of E 0 neighborhood of E 1 X 1 X 2 X 1 X 2 X 1 X 2 X 1 X 2 X 1 X 2 X 3 X 4 X 3 X 4 X 3 X 4 X 3 X 4 X 3 X 4 X 1 X 2 X 1 X 2 X 1 X 2 X 1 X 2 X 1 X 2 X 1 X 2 E 0 X 3 X 4 X 3 X 4 X 3 X 4 X 3 X 4 X 3 X 4 X 3 X 4 X 1 X 2 X 1 X 2 X 1 X 2 X 1 X 2 X 1 X 2 E 1 X 3 X 4 X 3 X 4 X 3 X 4 X 3 X 4 X 3 X 4 Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 20/49

74 Score-based methods How to search a good BN? Constraint-based methods BN = independence model find CI in data in order to build the DAG Score-based methods BN = probabilistic model that must fit data as well as possible search the DAG space in order to maximize a scoring/fitness function Hybrid methods Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 21/49

75 Hybrid methods = local search methods Local search and global learning Search one local neighborhood for a given node T Reiterate for each T Learn the global structure with these local informations which neighborhood? PC(T ) : Parents and Children T (without distinction) MB(T ) : Markov Blanket of T - Parents, children and spouses Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 22/49

76 Hybrid methods = local search methods Local search and global learning Search one local neighborhood for a given node T Reiterate for each T Learn the global structure with these local informations which neighborhood? PC(T ) : Parents and Children T (without distinction) MB(T ) : Markov Blanket of T - Parents, children and spouses Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 22/49

77 Local search identification Identification of MB(T) or PC(T) IAMB [Aliferis et al., 2002] MMPC [Tsamardinos et al., 2003],... Hybrid structure learning algorithms MMHC [Tsamardinos et al., 2006] = MMPC + Greedy search back Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 23/49

78 Outline... 4 Bayesian network Structure Learning constraint-based methods score-based methods local search methods 5 Causal Bayesian Networks definition causal BN structure learning 6 SemCaDo algorithm definition experimental study 7 O2C algorithm definition algorithm Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 24/49

79 A BN is not a causal model Usual BN A B does not imply direct causal relationship between A and B, Only edges from the CPDAG represent causal relationships Confusion When the DAG is given by an expert, this graph is very often causal When the DAG is learnt from data, no reason to be causal! Causal BN Each A B represents one direct causal relationship, i.e. A is the direct cause which generates B Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 25/49

80 A BN is not a causal model Usual BN A B does not imply direct causal relationship between A and B, Only edges from the CPDAG represent causal relationships Confusion When the DAG is given by an expert, this graph is very often causal When the DAG is learnt from data, no reason to be causal! Causal BN Each A B represents one direct causal relationship, i.e. A is the direct cause which generates B Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 25/49

81 A BN is not a causal model Usual BN A B does not imply direct causal relationship between A and B, Only edges from the CPDAG represent causal relationships Confusion When the DAG is given by an expert, this graph is very often causal When the DAG is learnt from data, no reason to be causal! Causal BN Each A B represents one direct causal relationship, i.e. A is the direct cause which generates B Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 25/49

82 Intervention vs. Observation Probabilistic inference : we observe B = b, we compute P(A B = b) Causal inference [Pearl 00]: we manipulate/intervene upon B : do(b = b) example with A B P(A do(b = b)) = P(A), P(B do(a = a)) = P(B A = a) example with A B P(A do(b = b)) = P(A B = b), P(B do(a = a)) = P(B) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 26/49

83 Causal structure learning Usual situation : observational data whatever the method, the right result is the CPDAG partial determination of the causal structure How to find a full causal graph? Use only experimental data, and decide at every step the more interesting experiment to realize (active learning [Murphy, 2001],...) Use only observational data, for a very specific distribution (LiNGAM models [Hoyer et al., 2008]) Another idea: MyCaDo algorithm [Meganck et al., 2006] Use (already existing) observational data to find the CPDAG Complete the orientation with experimental data Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 27/49

84 Causal structure learning Usual situation : observational data whatever the method, the right result is the CPDAG partial determination of the causal structure How to find a full causal graph? Use only experimental data, and decide at every step the more interesting experiment to realize (active learning [Murphy, 2001],...) Use only observational data, for a very specific distribution (LiNGAM models [Hoyer et al., 2008]) Another idea: MyCaDo algorithm [Meganck et al., 2006] Use (already existing) observational data to find the CPDAG Complete the orientation with experimental data Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 27/49

85 Causal structure learning Usual situation : observational data whatever the method, the right result is the CPDAG partial determination of the causal structure How to find a full causal graph? Use only experimental data, and decide at every step the more interesting experiment to realize (active learning [Murphy, 2001],...) Use only observational data, for a very specific distribution (LiNGAM models [Hoyer et al., 2008]) Another idea: MyCaDo algorithm [Meganck et al., 2006] Use (already existing) observational data to find the CPDAG Complete the orientation with experimental data Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 27/49

86 MyCaDo algorithm Données d'observation Algorithme d'apprentissage de structure d'un RB Système Données expérimentales CPDAG Choix de l'exp. Réalisation de l'exp. Analyse des résultats Réseau Bayésien causal Algorithme MyCaDo (1) Choice of the experiment = what variable M manipulate? the one potentially orienting more edges by taking into account experiment/observation cost back Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 28/49

87 MyCaDo algorithm Données d'observation Algorithme d'apprentissage de structure d'un RB Système Données expérimentales CPDAG Choix de l'exp. Réalisation de l'exp. Analyse des résultats Réseau Bayésien causal (2) Experimentation Algorithme MyCaDo do(m = m) for all possible values m observe all candidate variables C (C M) back Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 28/49

88 MyCaDo algorithm Données d'observation Algorithme d'apprentissage de structure d'un RB Système Données expérimentales CPDAG Choix de l'exp. Réalisation de l'exp. Analyse des résultats Réseau Bayésien causal Algorithme MyCaDo (3) Result analysis : P(C M) (obs.) P(C do(m)) (exp.)? if equal C M else M C orient some other edges by applying specific rules back Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 28/49

89 Outline... 4 Bayesian network Structure Learning constraint-based methods score-based methods local search methods 5 Causal Bayesian Networks definition causal BN structure learning 6 SemCaDo algorithm definition experimental study 7 O2C algorithm definition algorithm Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 29/49

90 SemCaDo assumptions BN Ontology general assumptions Nodes Concepts Random variables Concept attributes Causal dependencies Semantic causal relations Data Concept-attribute instances SemCaDo specific assumptions Causal relations concern concepts sharing the same semantic type Ontology continuity (evolution) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 30/49

91 SemCaDo assumptions BN Ontology general assumptions Nodes Concepts Random variables Concept attributes Causal dependencies Semantic causal relations Data Concept-attribute instances SemCaDo specific assumptions Causal relations concern concepts sharing the same semantic type Ontology continuity (evolution) Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 30/49

92 Example Catastrophes is-a is-a Man-made Natural is-a is-a is-a is-a is-a is-a Flood Fire Earthquake Landslide Volcano Tsunami Causes Causes Causes Causes Causes Causes Our causal BN will represent the grey part of the ontology Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 31/49

93 SemCaDo three main steps Domain ontology Inter. Data 1 st step Structure learning Choice of experience 3 rd step Ontology evolution Perform the experience Obs. Data PDAG Analyse the results CBN 2 nd step Causal discovery Enriched ontology Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 32/49

94 1 st step: initial BN structure learning Extraction of the causal relationships (R ) from the ontology Integration of these edge as constraints in the structure learning algorithm [De Campos & al., 2007] Continuity : these edges will not be questioned during learning Interest Ontology helps in reducing the search task complexity Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 33/49

95 1 st step: initial BN structure learning Extraction of the causal relationships (R ) from the ontology Integration of these edge as constraints in the structure learning algorithm [De Campos & al., 2007] Continuity : these edges will not be questioned during learning Interest Ontology helps in reducing the search task complexity Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 33/49

96 2 nd step: serendipitous causal discovery Experimentations are needed in order to find the causal orientation of some edges Our solution : MyCaDo [Meganck & al., 2006], iterative causal discovery process Adaptation to take into account ontological knowledge : Rada distance on H between one set of concepts and their most specific common subsumer Interest Ontology helps in potentially orienting the more unexpected (serendipitous) links Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 34/49

97 2 nd step: serendipitous causal discovery Experimentations are needed in order to find the causal orientation of some edges Our solution : MyCaDo [Meganck & al., 2006], iterative causal discovery process Adaptation to take into account ontological knowledge : Rada distance on H between one set of concepts and their most specific common subsumer Interest Ontology helps in potentially orienting the more unexpected (serendipitous) links Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 34/49

98 3 rd Step: Ontology evolution process [Stojanovic et al., 2002] Interest BN structure learning from data helps in discovering new relations in the ontology Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 35/49

99 3 rd Step: Ontology evolution process [Stojanovic et al., 2002] Interest BN structure learning from data helps in discovering new relations in the ontology Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 35/49

100 Experimental study (1) Benchmark : no existing real benchmark & system :-( BN graph : random generation (50 to 200 nodes) Ontology : Causal relationships : BN edges Hierarchy of concept : generation by clustering BN nodes Data is generated by using BN as a generative model Experimental protocol Hierarchy of concepts and 10% to 40% of existing causal relationships are given as inputs Semantic gain : cumulative Rada distance of the discovered relationships Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 36/49

101 Experimental study (1) Benchmark : no existing real benchmark & system :-( BN graph : random generation (50 to 200 nodes) Ontology : Causal relationships : BN edges Hierarchy of concept : generation by clustering BN nodes Data is generated by using BN as a generative model Experimental protocol Hierarchy of concepts and 10% to 40% of existing causal relationships are given as inputs Semantic gain : cumulative Rada distance of the discovered relationships Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 36/49

102 Experimental study (2) Ontology helps structure learning SemCaDo performs better in less steps than MyCaDo back Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 37/49

103 Experimental study (2) Structure learning helps ontology evolution Original causal discoveries are discovered first and can be added to the ontology back Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 37/49

104 Outline... 4 Bayesian network Structure Learning constraint-based methods score-based methods local search methods 5 Causal Bayesian Networks definition causal BN structure learning 6 SemCaDo algorithm definition experimental study 7 O2C algorithm definition algorithm Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 38/49

105 Object Oriented Bayesian Networks An extension of BNs using the object paradigm [Bangsø and Wuillemin, 2000a; Bangsø and Wuillemin, 2000b; Koller and Pfeffer, 1997] Support several aspects of the object oriented modeling. (e.g., inheritance, instantiation) Designed to model large and complex domains OOBN structure learning : OO-SEM This algorithm [Langseth and Nielsen, 2003] is based on 2 steps Generation of a prior OOBN based on a prior expert knowledge Grouping nodes into instantiations and instantiations into classes Giving a prior information about the candidate interfaces Adaptation of Structural EM algorithm to learn the final structure Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 39/49

106 Object Oriented Bayesian Networks An extension of BNs using the object paradigm [Bangsø and Wuillemin, 2000a; Bangsø and Wuillemin, 2000b; Koller and Pfeffer, 1997] Support several aspects of the object oriented modeling. (e.g., inheritance, instantiation) Designed to model large and complex domains OOBN structure learning : OO-SEM This algorithm [Langseth and Nielsen, 2003] is based on 2 steps Generation of a prior OOBN based on a prior expert knowledge Grouping nodes into instantiations and instantiations into classes Giving a prior information about the candidate interfaces Adaptation of Structural EM algorithm to learn the final structure Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 39/49

107 OOBN structure learning Drawbacks This prior knowledge is not always obvious to obtain The expert should be familiar with the object oriented modeling Our idea Harness ontologies representation capabilities in order to generate the prior OOBN structure Ontologies Concepts Cp Properties P cpi Inheritance relations H R Semantic relations S R OOBNs Classes Real nodes Class hierarchies Links/ Interfaces Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 40/49

108 OOBN structure learning Drawbacks This prior knowledge is not always obvious to obtain The expert should be familiar with the object oriented modeling Our idea Harness ontologies representation capabilities in order to generate the prior OOBN structure Ontologies Concepts Cp Properties P cpi Inheritance relations H R Semantic relations S R OOBNs Classes Real nodes Class hierarchies Links/ Interfaces Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 40/49

109 The 2OC approach Ontology Morphing process Prior OOBN Learning process Data (1) ontology to prior OOBN [Ben Ishak et al., 2011a] Final OOBN Change detection process Proposals of evolution it seems interesting to refine the ontology used initially! (2) final OOBN to ontology [Ben Ishak et al., 2011b] Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Apprentissage Amor RB et Ontologies 41/49

Bayesian Networks and Statistical Learning Applications to complex system modelling and diagnosis

Bayesian Networks and Statistical Learning Applications to complex system modelling and diagnosis Bayesian Networks and Statistical Learning Applications to complex system modelling and diagnosis Philippe LERAY, Olivier François, Ahmad Faour contact: Philippe.Leray@insa-rouen.fr PSI (Perception, Systems

More information

A Comparison of Novel and State-of-the-Art Polynomial Bayesian Network Learning Algorithms

A Comparison of Novel and State-of-the-Art Polynomial Bayesian Network Learning Algorithms A Comparison of Novel and State-of-the-Art Polynomial Bayesian Network Learning Algorithms Laura E. Brown Ioannis Tsamardinos Constantin F. Aliferis Vanderbilt University, Department of Biomedical Informatics

More information

A Hybrid Anytime Algorithm for the Construction of Causal Models From Sparse Data.

A Hybrid Anytime Algorithm for the Construction of Causal Models From Sparse Data. 142 A Hybrid Anytime Algorithm for the Construction of Causal Models From Sparse Data. Denver Dash t Department of Physics and Astronomy and Decision Systems Laboratory University of Pittsburgh Pittsburgh,

More information

The Basics of Graphical Models

The Basics of Graphical Models The Basics of Graphical Models David M. Blei Columbia University October 3, 2015 Introduction These notes follow Chapter 2 of An Introduction to Probabilistic Graphical Models by Michael Jordan. Many figures

More information

Bayesian Networks. Mausam (Slides by UW-AI faculty)

Bayesian Networks. Mausam (Slides by UW-AI faculty) Bayesian Networks Mausam (Slides by UW-AI faculty) Bayes Nets In general, joint distribution P over set of variables (X 1 x... x X n ) requires exponential space for representation & inference BNs provide

More information

Big Data, Machine Learning, Causal Models

Big Data, Machine Learning, Causal Models Big Data, Machine Learning, Causal Models Sargur N. Srihari University at Buffalo, The State University of New York USA Int. Conf. on Signal and Image Processing, Bangalore January 2014 1 Plan of Discussion

More information

Scaling Bayesian Network Parameter Learning with Expectation Maximization using MapReduce

Scaling Bayesian Network Parameter Learning with Expectation Maximization using MapReduce Scaling Bayesian Network Parameter Learning with Expectation Maximization using MapReduce Erik B. Reed Carnegie Mellon University Silicon Valley Campus NASA Research Park Moffett Field, CA 94035 erikreed@cmu.edu

More information

Bayesian Networks and Classifiers in Project Management

Bayesian Networks and Classifiers in Project Management Bayesian Networks and Classifiers in Project Management Daniel Rodríguez 1, Javier Dolado 2 and Manoranjan Satpathy 1 1 Dept. of Computer Science The University of Reading Reading, RG6 6AY, UK drg@ieee.org,

More information

Protein Protein Interaction Networks

Protein Protein Interaction Networks Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics

More information

Bayesian Networks. Read R&N Ch. 14.1-14.2. Next lecture: Read R&N 18.1-18.4

Bayesian Networks. Read R&N Ch. 14.1-14.2. Next lecture: Read R&N 18.1-18.4 Bayesian Networks Read R&N Ch. 14.1-14.2 Next lecture: Read R&N 18.1-18.4 You will be expected to know Basic concepts and vocabulary of Bayesian networks. Nodes represent random variables. Directed arcs

More information

Causal Discovery Using A Bayesian Local Causal Discovery Algorithm

Causal Discovery Using A Bayesian Local Causal Discovery Algorithm MEDINFO 2004 M. Fieschi et al. (Eds) Amsterdam: IOS Press 2004 IMIA. All rights reserved Causal Discovery Using A Bayesian Local Causal Discovery Algorithm Subramani Mani a,b, Gregory F. Cooper a mani@cs.uwm.edu

More information

Building Large-Scale Bayesian Networks

Building Large-Scale Bayesian Networks Building Large-Scale Bayesian Networks Martin Neil 1, Norman Fenton 1 and Lars Nielsen 2 1 Risk Assessment and Decision Analysis Research (RADAR) group, Computer Science Department, Queen Mary and Westfield

More information

How To Find Influence Between Two Concepts In A Network

How To Find Influence Between Two Concepts In A Network 2014 UKSim-AMSS 16th International Conference on Computer Modelling and Simulation Influence Discovery in Semantic Networks: An Initial Approach Marcello Trovati and Ovidiu Bagdasar School of Computing

More information

3. The Junction Tree Algorithms

3. The Junction Tree Algorithms A Short Course on Graphical Models 3. The Junction Tree Algorithms Mark Paskin mark@paskin.org 1 Review: conditional independence Two random variables X and Y are independent (written X Y ) iff p X ( )

More information

Some Research Challenges for Big Data Analytics of Intelligent Security

Some Research Challenges for Big Data Analytics of Intelligent Security Some Research Challenges for Big Data Analytics of Intelligent Security Yuh-Jong Hu hu at cs.nccu.edu.tw Emerging Network Technology (ENT) Lab. Department of Computer Science National Chengchi University,

More information

Chapter 28. Bayesian Networks

Chapter 28. Bayesian Networks Chapter 28. Bayesian Networks The Quest for Artificial Intelligence, Nilsson, N. J., 2009. Lecture Notes on Artificial Intelligence, Spring 2012 Summarized by Kim, Byoung-Hee and Lim, Byoung-Kwon Biointelligence

More information

Center for Causal Discovery (CCD) of Biomedical Knowledge from Big Data University of Pittsburgh Carnegie Mellon University Pittsburgh Supercomputing

Center for Causal Discovery (CCD) of Biomedical Knowledge from Big Data University of Pittsburgh Carnegie Mellon University Pittsburgh Supercomputing Center for Causal Discovery (CCD) of Biomedical Knowledge from Big Data University of Pittsburgh Carnegie Mellon University Pittsburgh Supercomputing Center Yale University PIs: Ivet Bahar, Jeremy Berg,

More information

Using Bayesian Networks to Analyze Expression Data ABSTRACT

Using Bayesian Networks to Analyze Expression Data ABSTRACT JOURNAL OF COMPUTATIONAL BIOLOGY Volume 7, Numbers 3/4, 2 Mary Ann Liebert, Inc. Pp. 6 62 Using Bayesian Networks to Analyze Expression Data NIR FRIEDMAN, MICHAL LINIAL, 2 IFTACH NACHMAN, 3 and DANA PE

More information

Learning Instance-Specific Predictive Models

Learning Instance-Specific Predictive Models Journal of Machine Learning Research 11 (2010) 3333-3369 Submitted 3/09; Revised 7/10; Published 12/10 Learning Instance-Specific Predictive Models Shyam Visweswaran Gregory F. Cooper Department of Biomedical

More information

FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM

FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM International Journal of Innovative Computing, Information and Control ICIC International c 0 ISSN 34-48 Volume 8, Number 8, August 0 pp. 4 FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT

More information

Course: Model, Learning, and Inference: Lecture 5

Course: Model, Learning, and Inference: Lecture 5 Course: Model, Learning, and Inference: Lecture 5 Alan Yuille Department of Statistics, UCLA Los Angeles, CA 90095 yuille@stat.ucla.edu Abstract Probability distributions on structured representation.

More information

LaPIe: Collective Communications adapted to Grid Environments

LaPIe: Collective Communications adapted to Grid Environments LaPIe: Collective Communications adapted to Grid Environments Luiz Angelo Barchet-Estefanel Thesis Supervisor: M Denis TRYSTRAM Co-Supervisor: M Grégory MOUNIE ID-IMAG Laboratory Grenoble - France LaPIe:

More information

Learning a Bayesian network model for predicting wildfire behavior

Learning a Bayesian network model for predicting wildfire behavior Learning a Bayesian network model for predicting wildfire behavior K. Zwirglmaier, P. Papakosta, D. Straub Engineering Risk Analysis Group, Technische Universität München, Germany ABSTRACT: A Bayesian

More information

Integration of ACO in a Constraint Programming Language

Integration of ACO in a Constraint Programming Language Integration of ACO in a Constraint Programming Language Madjid Khichane 12, Patrick Albert 1, and Christine Solnon 2 1 ILOG 2 LIRIS, UMR 5205 CNRS / University of Lyon ANTS 08 Motivations Ant Colony Optimization

More information

What mathematical optimization can, and cannot, do for biologists. Steven Kelk Department of Knowledge Engineering (DKE) Maastricht University, NL

What mathematical optimization can, and cannot, do for biologists. Steven Kelk Department of Knowledge Engineering (DKE) Maastricht University, NL What mathematical optimization can, and cannot, do for biologists Steven Kelk Department of Knowledge Engineering (DKE) Maastricht University, NL Introduction There is no shortage of literature about the

More information

Model-Based Cluster Analysis for Web Users Sessions

Model-Based Cluster Analysis for Web Users Sessions Model-Based Cluster Analysis for Web Users Sessions George Pallis, Lefteris Angelis, and Athena Vakali Department of Informatics, Aristotle University of Thessaloniki, 54124, Thessaloniki, Greece gpallis@ccf.auth.gr

More information

Sanjeev Kumar. contribute

Sanjeev Kumar. contribute RESEARCH ISSUES IN DATAA MINING Sanjeev Kumar I.A.S.R.I., Library Avenue, Pusa, New Delhi-110012 sanjeevk@iasri.res.in 1. Introduction The field of data mining and knowledgee discovery is emerging as a

More information

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks Text Analytics World, Boston, 2013 Lars Hard, CTO Agenda Difficult text analytics tasks Feature extraction Bio-inspired

More information

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10 1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom

More information

Large-Sample Learning of Bayesian Networks is NP-Hard

Large-Sample Learning of Bayesian Networks is NP-Hard Journal of Machine Learning Research 5 (2004) 1287 1330 Submitted 3/04; Published 10/04 Large-Sample Learning of Bayesian Networks is NP-Hard David Maxwell Chickering David Heckerman Christopher Meek Microsoft

More information

A Framework for Ontology-Based Knowledge Management System

A Framework for Ontology-Based Knowledge Management System A Framework for Ontology-Based Knowledge Management System Jiangning WU Institute of Systems Engineering, Dalian University of Technology, Dalian, 116024, China E-mail: jnwu@dlut.edu.cn Abstract Knowledge

More information

01219211 Software Development Training Camp 1 (0-3) Prerequisite : 01204214 Program development skill enhancement camp, at least 48 person-hours.

01219211 Software Development Training Camp 1 (0-3) Prerequisite : 01204214 Program development skill enhancement camp, at least 48 person-hours. (International Program) 01219141 Object-Oriented Modeling and Programming 3 (3-0) Object concepts, object-oriented design and analysis, object-oriented analysis relating to developing conceptual models

More information

Reusable Knowledge-based Components for Building Software. Applications: A Knowledge Modelling Approach

Reusable Knowledge-based Components for Building Software. Applications: A Knowledge Modelling Approach Reusable Knowledge-based Components for Building Software Applications: A Knowledge Modelling Approach Martin Molina, Jose L. Sierra, Jose Cuena Department of Artificial Intelligence, Technical University

More information

6.852: Distributed Algorithms Fall, 2009. Class 2

6.852: Distributed Algorithms Fall, 2009. Class 2 .8: Distributed Algorithms Fall, 009 Class Today s plan Leader election in a synchronous ring: Lower bound for comparison-based algorithms. Basic computation in general synchronous networks: Leader election

More information

THE EVOLUTION OF DATA CENTER MANAGEMENT WITH BUSINESS SERVICE MONITORING & ITIL

THE EVOLUTION OF DATA CENTER MANAGEMENT WITH BUSINESS SERVICE MONITORING & ITIL THE EVOLUTION OF DATA CENTER MANAGEMENT WITH BUSINESS SERVICE MONITORING & ITIL SUMMARY The infrastructure in a modern corporate data center manages critical business information for the enterprise. In

More information

Principles of Data Mining by Hand&Mannila&Smyth

Principles of Data Mining by Hand&Mannila&Smyth Principles of Data Mining by Hand&Mannila&Smyth Slides for Textbook Ari Visa,, Institute of Signal Processing Tampere University of Technology October 4, 2010 Data Mining: Concepts and Techniques 1 Differences

More information

Classification algorithm in Data mining: An Overview

Classification algorithm in Data mining: An Overview Classification algorithm in Data mining: An Overview S.Neelamegam #1, Dr.E.Ramaraj *2 #1 M.phil Scholar, Department of Computer Science and Engineering, Alagappa University, Karaikudi. *2 Professor, Department

More information

UNIVERSITY OF LYON DOCTORAL SCHOOL OF COMPUTER SCIENCES AND MATHEMATICS P H D T H E S I S. Specialty : Computer Science. Author

UNIVERSITY OF LYON DOCTORAL SCHOOL OF COMPUTER SCIENCES AND MATHEMATICS P H D T H E S I S. Specialty : Computer Science. Author UNIVERSITY OF LYON DOCTORAL SCHOOL OF COMPUTER SCIENCES AND MATHEMATICS P H D T H E S I S Specialty : Computer Science Author Sérgio Rodrigues de Morais on November 16, 29 Bayesian Network Structure Learning

More information

Doctor of Philosophy in Computer Science

Doctor of Philosophy in Computer Science Doctor of Philosophy in Computer Science Background/Rationale The program aims to develop computer scientists who are armed with methods, tools and techniques from both theoretical and systems aspects

More information

Recommender Systems: Content-based, Knowledge-based, Hybrid. Radek Pelánek

Recommender Systems: Content-based, Knowledge-based, Hybrid. Radek Pelánek Recommender Systems: Content-based, Knowledge-based, Hybrid Radek Pelánek 2015 Today lecture, basic principles: content-based knowledge-based hybrid, choice of approach,... critiquing, explanations,...

More information

Application of Adaptive Probing for Fault Diagnosis in Computer Networks 1

Application of Adaptive Probing for Fault Diagnosis in Computer Networks 1 Application of Adaptive Probing for Fault Diagnosis in Computer Networks 1 Maitreya Natu Dept. of Computer and Information Sciences University of Delaware, Newark, DE, USA, 19716 Email: natu@cis.udel.edu

More information

A Learning Based Method for Super-Resolution of Low Resolution Images

A Learning Based Method for Super-Resolution of Low Resolution Images A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 emre.ugur@ceng.metu.edu.tr Abstract The main objective of this project is the study of a learning based method

More information

INFORMATION SECURITY RISK ASSESSMENT UNDER UNCERTAINTY USING DYNAMIC BAYESIAN NETWORKS

INFORMATION SECURITY RISK ASSESSMENT UNDER UNCERTAINTY USING DYNAMIC BAYESIAN NETWORKS INFORMATION SECURITY RISK ASSESSMENT UNDER UNCERTAINTY USING DYNAMIC BAYESIAN NETWORKS R. Sarala 1, M.Kayalvizhi 2, G.Zayaraz 3 1 Associate Professor, Computer Science and Engineering, Pondicherry Engineering

More information

An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data

An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data n Introduction to the Use of ayesian Network to nalyze Gene Expression Data Cristina Manfredotti Dipartimento di Informatica, Sistemistica e Comunicazione (D.I.S.Co. Università degli Studi Milano-icocca

More information

Agenda. Interface Agents. Interface Agents

Agenda. Interface Agents. Interface Agents Agenda Marcelo G. Armentano Problem Overview Interface Agents Probabilistic approach Monitoring user actions Model of the application Model of user intentions Example Summary ISISTAN Research Institute

More information

! " # The Logic of Descriptions. Logics for Data and Knowledge Representation. Terminology. Overview. Three Basic Features. Some History on DLs

!  # The Logic of Descriptions. Logics for Data and Knowledge Representation. Terminology. Overview. Three Basic Features. Some History on DLs ,!0((,.+#$),%$(-&.& *,2(-$)%&2.'3&%!&, Logics for Data and Knowledge Representation Alessandro Agostini agostini@dit.unitn.it University of Trento Fausto Giunchiglia fausto@dit.unitn.it The Logic of Descriptions!$%&'()*$#)

More information

PUBLIC HEALTH OPTOMETRY ECONOMICS. Kevin D. Frick, PhD

PUBLIC HEALTH OPTOMETRY ECONOMICS. Kevin D. Frick, PhD Chapter Overview PUBLIC HEALTH OPTOMETRY ECONOMICS Kevin D. Frick, PhD This chapter on public health optometry economics describes the positive and normative uses of economic science. The terms positive

More information

Decision Trees and Networks

Decision Trees and Networks Lecture 21: Uncertainty 6 Today s Lecture Victor R. Lesser CMPSCI 683 Fall 2010 Decision Trees and Networks Decision Trees A decision tree is an explicit representation of all the possible scenarios from

More information

Web-Based Genomic Information Integration with Gene Ontology

Web-Based Genomic Information Integration with Gene Ontology Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, kai.xu@nicta.com.au Abstract. Despite the dramatic growth of online genomic

More information

How To Develop Software

How To Develop Software Software Engineering Prof. N.L. Sarda Computer Science & Engineering Indian Institute of Technology, Bombay Lecture-4 Overview of Phases (Part - II) We studied the problem definition phase, with which

More information

Study Manual. Probabilistic Reasoning. Silja Renooij. August 2015

Study Manual. Probabilistic Reasoning. Silja Renooij. August 2015 Study Manual Probabilistic Reasoning 2015 2016 Silja Renooij August 2015 General information This study manual was designed to help guide your self studies. As such, it does not include material that is

More information

Reverse Engineering of Relational Databases to Ontologies: An Approach Based on an Analysis of HTML Forms

Reverse Engineering of Relational Databases to Ontologies: An Approach Based on an Analysis of HTML Forms Reverse Engineering of Relational Databases to Ontologies: An Approach Based on an Analysis of HTML Forms Irina Astrova 1, Bela Stantic 2 1 Tallinn University of Technology, Ehitajate tee 5, 19086 Tallinn,

More information

Static Program Transformations for Efficient Software Model Checking

Static Program Transformations for Efficient Software Model Checking Static Program Transformations for Efficient Software Model Checking Shobha Vasudevan Jacob Abraham The University of Texas at Austin Dependable Systems Large and complex systems Software faults are major

More information

DETERMINING THE CONDITIONAL PROBABILITIES IN BAYESIAN NETWORKS

DETERMINING THE CONDITIONAL PROBABILITIES IN BAYESIAN NETWORKS Hacettepe Journal of Mathematics and Statistics Volume 33 (2004), 69 76 DETERMINING THE CONDITIONAL PROBABILITIES IN BAYESIAN NETWORKS Hülya Olmuş and S. Oral Erbaş Received 22 : 07 : 2003 : Accepted 04

More information

Solving the Vehicle Routing Problem with Multiple Trips by Adaptive Memory Programming

Solving the Vehicle Routing Problem with Multiple Trips by Adaptive Memory Programming Solving the Vehicle Routing Problem with Multiple Trips by Adaptive Memory Programming Alfredo Olivera and Omar Viera Universidad de la República Montevideo, Uruguay ICIL 05, Montevideo, Uruguay, February

More information

Comparison of K-means and Backpropagation Data Mining Algorithms

Comparison of K-means and Backpropagation Data Mining Algorithms Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and

More information

Life of A Knowledge Base (KB)

Life of A Knowledge Base (KB) Life of A Knowledge Base (KB) A knowledge base system is a special kind of database management system to for knowledge base management. KB extraction: knowledge extraction using statistical models in NLP/ML

More information

Process Mining. ^J Springer. Discovery, Conformance and Enhancement of Business Processes. Wil M.R van der Aalst Q UNIVERS1TAT.

Process Mining. ^J Springer. Discovery, Conformance and Enhancement of Business Processes. Wil M.R van der Aalst Q UNIVERS1TAT. Wil M.R van der Aalst Process Mining Discovery, Conformance and Enhancement of Business Processes Q UNIVERS1TAT m LIECHTENSTEIN Bibliothek ^J Springer Contents 1 Introduction I 1.1 Data Explosion I 1.2

More information

Ant Colony Optimization and Constraint Programming

Ant Colony Optimization and Constraint Programming Ant Colony Optimization and Constraint Programming Christine Solnon Series Editor Narendra Jussien WILEY Table of Contents Foreword Acknowledgements xi xiii Chapter 1. Introduction 1 1.1. Overview of the

More information

Compression algorithm for Bayesian network modeling of binary systems

Compression algorithm for Bayesian network modeling of binary systems Compression algorithm for Bayesian network modeling of binary systems I. Tien & A. Der Kiureghian University of California, Berkeley ABSTRACT: A Bayesian network (BN) is a useful tool for analyzing the

More information

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup Network Anomaly Detection A Machine Learning Perspective Dhruba Kumar Bhattacharyya Jugal Kumar KaKta»C) CRC Press J Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor

More information

How To Understand The Network Of A Network

How To Understand The Network Of A Network Roles in Networks Roles in Networks Motivation for work: Let topology define network roles. Work by Kleinberg on directed graphs, used topology to define two types of roles: authorities and hubs. (Each

More information

Model Averaging for Prediction with Discrete Bayesian Networks

Model Averaging for Prediction with Discrete Bayesian Networks Journal of Machine Learning Research 5 (2004) 1177 1203 Submitted 11/03; Revised 9/04; Published 9/04 Model Averaging for Prediction with Discrete Bayesian Networks Denver Dash Intel Research SC12-303

More information

A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS INTRODUCTION

A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS INTRODUCTION A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS Kyoungjin Park Alper Yilmaz Photogrammetric and Computer Vision Lab Ohio State University park.764@osu.edu yilmaz.15@osu.edu ABSTRACT Depending

More information

Chapter 6: Episode discovery process

Chapter 6: Episode discovery process Chapter 6: Episode discovery process Algorithmic Methods of Data Mining, Fall 2005, Chapter 6: Episode discovery process 1 6. Episode discovery process The knowledge discovery process KDD process of analyzing

More information

Data Mining - Evaluation of Classifiers

Data Mining - Evaluation of Classifiers Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010

More information

Estimating Software Reliability In the Absence of Data

Estimating Software Reliability In the Absence of Data Estimating Software Reliability In the Absence of Data Joanne Bechta Dugan (jbd@virginia.edu) Ganesh J. Pai (gpai@virginia.edu) Department of ECE University of Virginia, Charlottesville, VA NASA OSMA SAS

More information

STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and

STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and Clustering Techniques and STATISTICA Case Study: Defining Clusters of Shopping Center Patrons STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table

More information

{ Mining, Sets, of, Patterns }

{ Mining, Sets, of, Patterns } { Mining, Sets, of, Patterns } A tutorial at ECMLPKDD2010 September 20, 2010, Barcelona, Spain by B. Bringmann, S. Nijssen, N. Tatti, J. Vreeken, A. Zimmermann 1 Overview Tutorial 00:00 00:45 Introduction

More information

TAMRI: A Tool for Supporting Task Distribution in Global Software Development Projects

TAMRI: A Tool for Supporting Task Distribution in Global Software Development Projects TAMRI: A Tool for Supporting Task Distribution in Global Software Development Projects REMIDI 2009, Limerick, Ireland Ansgar Lamersdorf a_lamers@informatik.uni-kl.de Jürgen Münch Juergen.Muench@iese.fraunhofer.de

More information

New Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Introduction

New Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Introduction Introduction New Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Predictive analytics encompasses the body of statistical knowledge supporting the analysis of massive data sets.

More information

Classification-oriented structure learning in Bayesian networks for multimodal event detection in videos

Classification-oriented structure learning in Bayesian networks for multimodal event detection in videos Classification-oriented structure learning in Bayesian networks for multimodal event detection in videos Guillaume Gravier, Claire-Hélène Demarty, Siwar Baghdadi, Patrick Gros To cite this version: Guillaume

More information

MODELING CLIENT INFORMATION SYSTEMS USING BAYESIAN NETWORKS. Main results of the PhD Thesis. Gábor Vámos

MODELING CLIENT INFORMATION SYSTEMS USING BAYESIAN NETWORKS. Main results of the PhD Thesis. Gábor Vámos Budapest University of Technology and Economics Faculty of Electrical Engineering and Informatics Department of Control Engineering and Information Technology MODELING CLIENT INFORMATION SYSTEMS USING

More information

STIC-AMSUD project meeting, Recife, Brazil, July 2008. Quality Management. Current work and perspectives at InCo

STIC-AMSUD project meeting, Recife, Brazil, July 2008. Quality Management. Current work and perspectives at InCo InCo Universidad de la República STIC-AMSUD project meeting, Recife, Brazil, July 2008 Quality Management Current work and perspectives at InCo Lorena Etcheverry, Laura González, Adriana Marotta, Verónika

More information

Towards Rule-based System for the Assembly of 3D Bricks

Towards Rule-based System for the Assembly of 3D Bricks Universal Journal of Communications and Network 3(4): 77-81, 2015 DOI: 10.13189/ujcn.2015.030401 http://www.hrpub.org Towards Rule-based System for the Assembly of 3D Bricks Sanguk Noh School of Computer

More information

72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD

72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD 72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD Paulo Gottgtroy Auckland University of Technology Paulo.gottgtroy@aut.ac.nz Abstract This paper is

More information

A Peer-to-peer Extension of Network-Enabled Server Systems

A Peer-to-peer Extension of Network-Enabled Server Systems A Peer-to-peer Extension of Network-Enabled Server Systems Eddy Caron 1, Frédéric Desprez 1, Cédric Tedeschi 1 Franck Petit 2 1 - GRAAL Project / LIP laboratory 2 - LaRIA laboratory E-Science 2005 - December

More information

Dynamic Load Balancing in Charm++ Abhinav S Bhatele Parallel Programming Lab, UIUC

Dynamic Load Balancing in Charm++ Abhinav S Bhatele Parallel Programming Lab, UIUC Dynamic Load Balancing in Charm++ Abhinav S Bhatele Parallel Programming Lab, UIUC Outline Dynamic Load Balancing framework in Charm++ Measurement Based Load Balancing Examples: Hybrid Load Balancers Topology-aware

More information

BUSINESS RULES AND GAP ANALYSIS

BUSINESS RULES AND GAP ANALYSIS Leading the Evolution WHITE PAPER BUSINESS RULES AND GAP ANALYSIS Discovery and management of business rules avoids business disruptions WHITE PAPER BUSINESS RULES AND GAP ANALYSIS Business Situation More

More information

Heuristics for Dynamically Adapting Constraint Propagation in Constraint Programming

Heuristics for Dynamically Adapting Constraint Propagation in Constraint Programming Heuristics for Dynamically Adapting Constraint Propagation in Constraint Programming Kostas Stergiou AI Lab University of the Aegean Greece CPAIOR 09 Workshop on Bound reduction techniques for CP and MINLP

More information

Big Data Science. Prof. Lise Getoor University of Maryland, College Park. http://www.cs.umd.edu/~getoor. October 17, 2013

Big Data Science. Prof. Lise Getoor University of Maryland, College Park. http://www.cs.umd.edu/~getoor. October 17, 2013 Big Data Science Prof Lise Getoor University of Maryland, College Park October 17, 2013 http://wwwcsumdedu/~getoor BIG Data is not flat 2004-2013 lonnitaylor Data is multi-modal, multi-relational, spatio-temporal,

More information

IE 680 Special Topics in Production Systems: Networks, Routing and Logistics*

IE 680 Special Topics in Production Systems: Networks, Routing and Logistics* IE 680 Special Topics in Production Systems: Networks, Routing and Logistics* Rakesh Nagi Department of Industrial Engineering University at Buffalo (SUNY) *Lecture notes from Network Flows by Ahuja, Magnanti

More information

The Classes P and NP. mohamed@elwakil.net

The Classes P and NP. mohamed@elwakil.net Intractable Problems The Classes P and NP Mohamed M. El Wakil mohamed@elwakil.net 1 Agenda 1. What is a problem? 2. Decidable or not? 3. The P class 4. The NP Class 5. TheNP Complete class 2 What is a

More information

Overview of major concepts in the service oriented extended OeBTO

Overview of major concepts in the service oriented extended OeBTO Modelling business policies and behaviour based on extended Open edi Business Transaction Ontology (OeBTO) Introduction Model Driven Development (MDD) provides a basis for the alignment between business

More information

Traffic Engineering for Multiple Spanning Tree Protocol in Large Data Centers

Traffic Engineering for Multiple Spanning Tree Protocol in Large Data Centers Traffic Engineering for Multiple Spanning Tree Protocol in Large Data Centers Ho Trong Viet, Yves Deville, Olivier Bonaventure, Pierre François ICTEAM, Université catholique de Louvain (UCL), Belgium.

More information

AUTOMATION OF ENERGY DEMAND FORECASTING. Sanzad Siddique, B.S.

AUTOMATION OF ENERGY DEMAND FORECASTING. Sanzad Siddique, B.S. AUTOMATION OF ENERGY DEMAND FORECASTING by Sanzad Siddique, B.S. A Thesis submitted to the Faculty of the Graduate School, Marquette University, in Partial Fulfillment of the Requirements for the Degree

More information

Process Mining by Measuring Process Block Similarity

Process Mining by Measuring Process Block Similarity Process Mining by Measuring Process Block Similarity Joonsoo Bae, James Caverlee 2, Ling Liu 2, Bill Rouse 2, Hua Yan 2 Dept of Industrial & Sys Eng, Chonbuk National Univ, South Korea jsbae@chonbukackr

More information

NEW VERSION OF DECISION SUPPORT SYSTEM FOR EVALUATING TAKEOVER BIDS IN PRIVATIZATION OF THE PUBLIC ENTERPRISES AND SERVICES

NEW VERSION OF DECISION SUPPORT SYSTEM FOR EVALUATING TAKEOVER BIDS IN PRIVATIZATION OF THE PUBLIC ENTERPRISES AND SERVICES NEW VERSION OF DECISION SUPPORT SYSTEM FOR EVALUATING TAKEOVER BIDS IN PRIVATIZATION OF THE PUBLIC ENTERPRISES AND SERVICES Silvija Vlah Kristina Soric Visnja Vojvodic Rosenzweig Department of Mathematics

More information

Effect of Using Neural Networks in GA-Based School Timetabling

Effect of Using Neural Networks in GA-Based School Timetabling Effect of Using Neural Networks in GA-Based School Timetabling JANIS ZUTERS Department of Computer Science University of Latvia Raina bulv. 19, Riga, LV-1050 LATVIA janis.zuters@lu.lv Abstract: - The school

More information

XPC Algorithms For Structured Learning in Bayesian Networks

XPC Algorithms For Structured Learning in Bayesian Networks An Algorithm to Handle Structural Uncertainties in Learning Bayesian Network Cleuber Moreira Fernandes Politec Informática Ltda SIG quadra 4 lote 173, Zip Code 70.610-440, Brasília, DF, Brazil cleuber@compeduc.net

More information

Expert Systems. A knowledge-based approach to intelligent systems. Intelligent System. Motivation. Approaches & Ingredients. Terminology.

Expert Systems. A knowledge-based approach to intelligent systems. Intelligent System. Motivation. Approaches & Ingredients. Terminology. Motivation Expert Systems A knowledge-based approach to intelligent systems Intelligent System Peter Lucas Department of Information and Knowledge Systems Institute for Computing and Information Sciences

More information

LABEL PROPAGATION ON GRAPHS. SEMI-SUPERVISED LEARNING. ----Changsheng Liu 10-30-2014

LABEL PROPAGATION ON GRAPHS. SEMI-SUPERVISED LEARNING. ----Changsheng Liu 10-30-2014 LABEL PROPAGATION ON GRAPHS. SEMI-SUPERVISED LEARNING ----Changsheng Liu 10-30-2014 Agenda Semi Supervised Learning Topics in Semi Supervised Learning Label Propagation Local and global consistency Graph

More information

Unsupervised learning: Clustering

Unsupervised learning: Clustering Unsupervised learning: Clustering Salissou Moutari Centre for Statistical Science and Operational Research CenSSOR 17 th September 2013 Unsupervised learning: Clustering 1/52 Outline 1 Introduction What

More information

Data mining and official statistics

Data mining and official statistics Quinta Conferenza Nazionale di Statistica Data mining and official statistics Gilbert Saporta président de la Société française de statistique 5@ S Roma 15, 16, 17 novembre 2000 Palazzo dei Congressi Piazzale

More information

Chapter 6: distributed systems

Chapter 6: distributed systems Chapter 6: distributed systems Strongly related to communication between processes is the issue of how processes in distributed systems synchronize. Synchronization is all about doing the right thing at

More information

The Relationship between Heuristic, Causal and Statistical Models of Knowledge, and Big Data

The Relationship between Heuristic, Causal and Statistical Models of Knowledge, and Big Data 2014 KIE Conference: Knowledge, Innovation and Entrepreneurship Theme: Knowledge The Relationship between Heuristic, Causal and Statistical Models of Knowledge, and Big Data D. Graham Faculty of Business

More information

Data Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland

Data Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland Data Mining and Knowledge Discovery in Databases (KDD) State of the Art Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland 1 Conference overview 1. Overview of KDD and data mining 2. Data

More information

The University of Jordan

The University of Jordan The University of Jordan Master in Web Intelligence Non Thesis Department of Business Information Technology King Abdullah II School for Information Technology The University of Jordan 1 STUDY PLAN MASTER'S

More information

Ontology and automatic code generation on modeling and simulation

Ontology and automatic code generation on modeling and simulation Ontology and automatic code generation on modeling and simulation Youcef Gheraibia Computing Department University Md Messadia Souk Ahras, 41000, Algeria youcef.gheraibia@gmail.com Abdelhabib Bourouis

More information

Join Bayes Nets: A New Type of Bayes net for Relational Data

Join Bayes Nets: A New Type of Bayes net for Relational Data Join Bayes Nets: A New Type of Bayes net for Relational Data Bayes nets, statistical-relational learning, knowledge discovery in databases Abstract Many real-world data are maintained in relational format,

More information