Community structure in resting state complex networks



Similar documents
Tensor Decomposition Approaches for the Modeling of Multi-Graphs

Curriculum vitae, Mikkel N. Schmidt

Dirichlet Processes A gentle tutorial

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks

Section for Cognitive Systems DTU Informatics, Technical University of Denmark

Projects at the Danish Research Centre for Magnetic Resonance

Statistical Models in Data Mining

Learning Gaussian process models from big data. Alan Qi Purdue University Joint work with Z. Xu, F. Yan, B. Dai, and Y. Zhu

Bayesian Statistics: Indian Buffet Process

8. Machine Learning Applied Artificial Intelligence

Obtaining Knowledge. Lecture 7 Methods of Scientific Observation and Analysis in Behavioral Psychology and Neuropsychology.

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

The Wondrous World of fmri statistics

How To Use Neural Networks In Data Mining

Lecture/Recitation Topic SMA 5303 L1 Sampling and statistical distributions

Statistical Considerations in Magnetic Resonance Imaging of Brain Function

Bayesian networks - Time-series models - Apache Spark & Scala

How to report the percentage of explained common variance in exploratory factor analysis

Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases. Andreas Züfle

Clarify Some Issues on the Sparse Bayesian Learning for Sparse Signal Recovery

Web Document Clustering

Network Analysis I & II

Sanjeev Kumar. contribute

Stabilization by Conceptual Duplication in Adaptive Resonance Theory

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

Learning outcomes. Knowledge and understanding. Competence and skills

Part 2: Community Detection

Cognitive Neuroscience. Questions. Multiple Methods. Electrophysiology. Multiple Methods. Approaches to Thinking about the Mind

Machine Learning Introduction

Keywords: Mobility Prediction, Location Prediction, Data Mining etc

DATA MINING TECHNIQUES AND APPLICATIONS

Preventing Denial-of-request Inference Attacks in Location-sharing Services

IJCSES Vol.7 No.4 October 2013 pp Serials Publications BEHAVIOR PERDITION VIA MINING SOCIAL DIMENSIONS

Specific Usage of Visual Data Analysis Techniques

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Data quality in Accounting Information Systems

Internet Investigations of Text Material to Compare Programs Across Institutions

University of California San Francisco, CA, USA 4 Department of Veterans Affairs Medical Center, San Francisco, CA, USA

Adam Anthony Baldwin-Wallace College Voice: (440) Department of Mathematics and Computer Science 275 Eastland Rd

Data Science Center Eindhoven. Big Data: Challenges and Opportunities for Mathematicians. Alessandro Di Bucchianico

Complex Network Analysis of Brain Connectivity: An Introduction LABREPORT 5

Machine Learning and Statistics: What s the Connection?

Tensor Factorization for Multi-Relational Learning

Quality Control of National Genetic Evaluation Results Using Data-Mining Techniques; A Progress Report

EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set

HT2015: SC4 Statistical Data Mining and Machine Learning

Chapter 28. Bayesian Networks

Machine Learning and Data Analysis overview. Department of Cybernetics, Czech Technical University in Prague.

Update: MRI in Multiple sclerosis

MarsBaR Documentation

runl I IUI%I/\L Magnetic Resonance Imaging

Crowdclustering with Sparse Pairwise Labels: A Matrix Completion Approach

Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control

NEW VERSION OF DECISION SUPPORT SYSTEM FOR EVALUATING TAKEOVER BIDS IN PRIVATIZATION OF THE PUBLIC ENTERPRISES AND SERVICES

Volume 4, Issue 1, January 2016 International Journal of Advance Research in Computer Science and Management Studies

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING

The data set we have taken is about calculating body fat percentage for an individual.

Probabilistic Evidential Reasoning with Symbolic Argumentation for Space Situation Awareness

A Bayesian hierarchical surrogate outcome model for multiple sclerosis

Prediction of Heart Disease Using Naïve Bayes Algorithm

Scaling Bayesian Network Parameter Learning with Expectation Maximization using MapReduce

Latent Dirichlet Markov Allocation for Sentiment Analysis

FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM

The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2

Self-Organising Data Mining

FastForward I/O and Storage: ACG 6.6 Demonstration

Learning diagnostic diagrams in transport-based data-collection systems

Comparative Analysis of EM Clustering Algorithm and Density Based Clustering Algorithm Using WEKA tool.

Software Packages The following data analysis software packages will be showcased:

Estimation of Sympathetic and Parasympathetic Level during Orthostatic Stress using Artificial Neural Networks

An Overview of Knowledge Discovery Database and Data mining Techniques

Statistics Graduate Courses

MapReduce Approach to Collective Classification for Networks

2. MATERIALS AND METHODS

Lecture 9: Introduction to Pattern Analysis

Classification algorithm in Data mining: An Overview

Projects at the Danish Research Centre for Magnetic Resonance

Static Data Mining Algorithm with Progressive Approach for Mining Knowledge

EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT S ACADEMIC PERFORMANCE

The Chinese Restaurant Process

Topic models for Sentiment analysis: A Literature Survey

A Spectral Clustering Approach to Validating Sensors via Their Peers in Distributed Sensor Networks

7 The use of fmri. to detect neural responses to cognitive tasks: is there confounding by task related changes in heart rate?

MRI Brain Tumor Segmentation Using Improved ACO

Methods of Data Analysis Working with probability distributions

The Open University s repository of research publications and other research outputs

Tracking Groups of Pedestrians in Video Sequences

FACULTY OF ALLIED HEALTH SCIENCES

Healthcare Measurement Analysis Using Data mining Techniques

Software Development Cost and Time Forecasting Using a High Performance Artificial Neural Network Model

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

RESTing-state fmri data analysis toolkit (REST) Manual

Price Prediction of Share Market using Artificial Neural Network (ANN)

Data Mining. Cluster Analysis: Advanced Concepts and Algorithms

Free software solutions for MEG/EEG source imaging

Exponential Random Graph Models for Social Network Analysis. Danny Wyatt 590AI March 6, 2009

Bayesian Penalized Methods for High Dimensional Data

The spam vs. non-spam s classifier

A Connectivity-Based Popularity Prediction Approach for Social Networks

Transcription:

Community structure in resting state complex networks Kasper Winther Andersen 1,2 ; Kristoffer H. Madsen 2 ; Hartwig Roman Siebner 2,3,4 ; Mikkel N. Schmidt 1 ; Morten Mørup 1 ; Lars Kai Hansen 1 1: Department of Applied Mathematics and Computer Science, Technical University of Denmark, Matematiktorvet, Bygning 303 B, 2800 Kgs. Lyngby, Denmark 2: Danish Research Centre for Magnetic Resonance, Centre for Functional and Diagnostic Imaging and Research, Copenhagen University Hospital Hvidovre, Kettegaard Alle 30, 2650 Hvidovre, Denmark 3: Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3B, 2200 København N, Denmark 4: Department of Neurology, Copenhagen University Hospital Bispebjerg, Bispebjerg Bakke 23, 2400 København NV, Denmark

Analysis of resting state functional magnetic resonance imaging (rs-fmri) using graph theory is of increasing interest. Here, brain regions are represented as nodes and links between nodes are present if the statistical dependence of the time courses exceeds a given threshold. So-called communities can be found as clusters of nodes with high internal linking and sparse linking to the rest of the network and the degree to which a network can be divided into such communities can be measured using the modularity index [1]. Modularity has shown to be related to e.g. normal aging [2], working memory [3], and motor learning [4]. However, it is still unknown whether modularity reflects the best grouping of nodes. Here we use the Infinite Relational Model (IRM) [5,6] and two constrained variants thereof and test their clustering reproducibility and ability to predict unseen data. Three rs-fmri datasets were used. The Copenhagen set was acquired locally and consists of 30 healthy controls (HC), TR=2.49s, 480 vol. In addition we used two datasets from the FCON1000 database. The Leipzig set consists of 37 HC, TR=2.3s, 195 vol. and Beijing Zhang part2 sets consists of 42 HC, TR=2s, and 225 vol. Data preprocessing included realignment and normalization to MNI305 space. The Copenhagen data was filtered using physiological (cardiac, respiration, motion) regressors [7]. The mean time series for each of the 116 brain regions included in the AAL atlas [8] was extracted for each subject and their correlation matrix was formed. The brain networks were created by thresholding these matrices individually to maintain the top d-percent links (Fig. 1). IRM is a nonparametric Bayesian Network model, which can be used to infer node clustering. Contrary to modularity, the IRM models both within-cluster and between-cluster link probabilities. Therefore nodes with a high internal linking could in fact be clustered separately if their relation to other clusters is different. By placing a Chinese Restaurant Process (CRP) prior on the node partitioning the number of clusters is determined during the model inference. A Beta distribution is used as prior for the cluster link probabilities, these probabilities can however be integrated out and thus model inference only involves sampling the node cluster assignments using Markov chain Monte Carlo. Here we seek common node clustering and cluster link probabilities over subjects. In addition we consider two

constrained versions of the IRM. The Bayesian Community Detection (BCD) [9], which for a given cluster restricts between-cluster link probabilities to be strictly lower than the within-cluster link probability. The Infinite Diagonal Model (IDM) allows individual within-cluster probabilities but restricts all between-cluster link probabilities to be equal. IDM can thus be seen as a probabilistic version of modularity. The NPAIRS [10] split-half framework was used to evaluate the models predictability and reproducibility for different graph link densities (2, 4, 8, 16, and 32%). Predictability was measured using test likelihood of data from one split when the model was trained on the other split. Predictability was calculated as the normalized mutual information between node partitions of the two splits. For all link densities and datasets investigated BCD and IRM showed clearly better predictability than IDM and BCD had better or on par predictability compared with IRM (Fig. 2). For link densities greater than 4% IRM and BCD had higher reproducibility than IDM, and BCD had better or on par reproducibility compared with IRM. Resting state organization in fmri networks is consistent with a model based on community structure. Modelling between-cluster linking significantly increases predictability and reproducibility. Node partitioning and link probabilities together represent a compressed description of the network, which allows for group inference, behavioral measures correlation, and is a potential new expressive bio-marker.

Figure 1

Figure 2

References [1] Newman, M. (2004), Finding and evaluating community structure in networks. Physical Review E, 69(2), 026113. [2] Meunier, D. (2009), Age-related changes in modular organization of human brain functional networks, NeuroImage, 44(3), 715 23. [3] Stevens, A. A. (2012), Functional brain network modularity captures inter- and intra-individual variation in working memory capacity, PloS one, 7(1), e30468. [4] Bassett, D. S. (2011), Dynamic reconfiguration of human brain networks during learning, Proceedings of the National Academy of Sciences of the United States of America, 108(18), 7641 6. [5] Kemp, C. (2006), Learning systems of concepts with an infinite relational model, In Proceedings of the 21th National Conference on Artificial Intelligence (AAAI) (pp. 381 388). [6] Xu, Z. (2006), Infinite hidden relational models. In In Proceedings of the 22nd International Conference on Uncertainity in Artificial Intelligence (UAI). [7] Glover, G. H. (2000), Image-based method for retrospective correction of physiological motion effects in fmri: RETROICOR, Magnetic Resonance in Medicine, 44(1), 162 167. [8] Tzourio-Mazoyer, N. (2002), Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage, 15(1), 273 89. [9] Mørup, M. (2012), Bayesian community detection, Neural computation, 24(9), 2434 56. [10] Strother, S. C. (2002), The quantitative evaluation of functional neuroimaging experiments: the NPAIRS data analysis framework, NeuroImage, 15(4), 747 71.