Meshfree Methods in Conformation Dynamics. Marcus Weber



Similar documents
(Big Data) Approaches to Designing Functional Molecules. Christof Schütte. Research Center Matheon Mathematics for key technologies

Unsupervised and supervised dimension reduction: Algorithms and connections

Gaussian Processes to Speed up Hamiltonian Monte Carlo

Appendix A. Rayleigh Ratios and the Courant-Fischer Theorem

The HipGISAXS Software Suite: Recovering Nanostructures at Scale

NEW VERSION OF DECISION SUPPORT SYSTEM FOR EVALUATING TAKEOVER BIDS IN PRIVATIZATION OF THE PUBLIC ENTERPRISES AND SERVICES

Dmitri Krioukov CAIDA/UCSD

Exploratory data analysis for microarray data

Language: English Lecturer: Gianni de Fabritiis. Teaching staff: Language: English Lecturer: Jordi Villà i Freixa

T : Classification as Spam or Ham using Naive Bayes Classifier. Santosh Tirunagari :

Pricing and calibration in local volatility models via fast quantization

A Toolbox for Bicluster Analysis in R

Dynamic Eigenvalues for Scalar Linear Time-Varying Systems

VISUALIZING SPACE-TIME UNCERTAINTY OF DENGUE FEVER OUTBREAKS. Dr. Eric Delmelle Geography & Earth Sciences University of North Carolina at Charlotte

Big Data Research in Berlin BBDC and Apache Flink

Dirichlet Processes A gentle tutorial

Learning outcomes. Knowledge and understanding. Competence and skills

Fortgeschrittene Computerintensive Methoden: Fuzzy Clustering Steffen Unkel

USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS

A Novel Fuzzy Clustering Method for Outlier Detection in Data Mining

Going Big in Data Dimensionality:

B490 Mining the Big Data. 2 Clustering

EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set

How to report the percentage of explained common variance in exploratory factor analysis

Statistics in Applications III. Distribution Theory and Inference

Curriculum Vitae. Ulrich Franke. Personal information. Research Interests. Professional Memberships

Oracle8i Spatial: Experiences with Extensible Databases

Prof. Dr. Frank Riedel

Part 2: Community Detection

A Partially Supervised Metric Multidimensional Scaling Algorithm for Textual Data Visualization

A Fuzzy AHP based Multi-criteria Decision-making Model to Select a Cloud Service

CURRICULUM VITAE. De Montfort University - Leicester - UK 2008 PhD Degree in Architecture-Product and Spatial Design

Computational Statistics for Big Data


Mobile Phone APP Software Browsing Behavior using Clustering Analysis

Introduction to Markov Chain Monte Carlo

Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics

Cluster Analysis: Advanced Concepts

Visualization of large data sets using MDS combined with LVQ.

Review Jeopardy. Blue vs. Orange. Review Jeopardy

Klaus D. Goepel No 10 Changi Business Park Central 2 Hansapoint@CBP #06-01/08 Singapore drklaus@bpmsg.com ABSTRACT

Stefan Vigerske Curriculum Vitae

Review of Modern Techniques of Qualitative Data Clustering

Richard M. Heiberger Professor Emeritus of Statistics Temple University January 2016

QoS-Aware Storage Virtualization for Cloud File Systems. Christoph Kleineweber (Speaker) Alexander Reinefeld Thorsten Schütt. Zuse Institute Berlin

SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING

1 Finite difference example: 1D implicit heat equation

Distance Metric Learning in Data Mining (Part I) Fei Wang and Jimeng Sun IBM TJ Watson Research Center

Load Balancing in Cellular Networks with User-in-the-loop: A Spatial Traffic Shaping Approach

Introduction to Principal Components and FactorAnalysis

An Evaluation Model for Determining Insurance Policy Using AHP and Fuzzy Logic: Case Studies of Life and Annuity Insurances

LECTURE 11 : GLASSY DYNAMICS - Intermediate scattering function - Mean square displacement and beyond - Dynamic heterogeneities - Isoconfigurational

FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM

Statistics Graduate Courses

A New Method for Dimensionality Reduction using K- Means Clustering Algorithm for High Dimensional Data Set

Math 550 Notes. Chapter 7. Jesse Crawford. Department of Mathematics Tarleton State University. Fall 2010

Real-Time Credit-Card Fraud Detection using Artificial Neural Network Tuned by Simulated Annealing Algorithm

A Study of Web Log Analysis Using Clustering Techniques

How To Cluster

A Content based Spam Filtering Using Optical Back Propagation Technique

Research Associate at Universität Hamburg. Research Associate at University of Bath

Natural Language Querying for Content Based Image Retrieval System

Lloyd Spencer Lincoln Re

MCMC Using Hamiltonian Dynamics

Fuzzy Clustering Technique for Numerical and Categorical dataset

IJCSES Vol.7 No.4 October 2013 pp Serials Publications BEHAVIOR PERDITION VIA MINING SOCIAL DIMENSIONS

Principal Component Analysis

Graph Embedding to Improve Supervised Classification and Novel Class Detection: Application to Prostate Cancer

Library (versus Language) Based Parallelism in Factoring: Experiments in MPI. Dr. Michael Alexander Dr. Sonja Sewera.

DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS

Similar matrices and Jordan form

Prof. Dr. D. W. Cunningham, Berliner Strasse 35A, Cottbus, Germany

Calculating Effect-Sizes

Subspace Analysis and Optimization for AAM Based Face Alignment

Hidden Markov Models

Load Balancing Techniques

Program Understanding in Software Engineering

Network Selection Using TOPSIS in Vertical Handover Decision Schemes for Heterogeneous Wireless Networks

A Software Tool for. Automatically Veried Operations on. Intervals and Probability Distributions. Daniel Berleant and Hang Cheng

Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies

Inner products on R n, and more

Market Risk Analysis. Quantitative Methods in Finance. Volume I. The Wiley Finance Series

Isotropic Entanglement

Sound. References: L.D. Landau & E.M. Lifshitz: Fluid Mechanics, Chapter VIII F. Shu: The Physics of Astrophysics, Vol. 2, Gas Dynamics, Chapter 8

Distributed computing of failure probabilities for structures in civil engineering

How To Solve The Line Connectivity Problem In Polynomatix

Methods of Data Analysis Working with probability distributions

THREE DIMENSIONAL REPRESENTATION OF AMINO ACID CHARAC- TERISTICS

IMPLEMENTING A SPC INTEGRATED SYSTEM TO IMPROVE MANUFACTURING PROCESSES IN AUTOMOTIVE INDUSTRY

Clustering & Visualization

A Computational Framework for Exploratory Data Analysis

Transcription:

Meshfree Methods in Conformation Dynamics Marcus Weber Zuse Institute Berlin Free University Berlin Berlin Center for Genom-based Bioinformatics DFG Research Center Matheon

ZIB Scientific Computing: Dept. Computational Drug Design Peter Deuflhard, Frank Cordes, Marcus Weber, Susanna Kube, Holger Meyer, Ulrich Nowak, Alexander Riemer Dept. Scientific Visualization Hans-Christian Hege, Daniel Baum, Johannes Schmidt-Ehrenberg, Timm Baumeister, Maro Bader Cooperation: FU Biocomputing: Christof Schütte, Wilhelm Huisinga, Alexander Fischer, Illja Horenko, Carsten Hartmann, Phillip Metzner, Eike Meerbach Berlin Center for Genome-based Bioinformatics (BCBio) DFG Research Center Matheon 2 Conformation Dynamics

Outline 1. Motivation and Aims of Conformation Dynamics 2. Conformations 3. Robust Perron Cluster Analysis 4. Sampling Scheme 5. Meshless Basis Functions 3 Conformation Dynamics

1. Motivation and Aims of Conformation Dynamics 4 Conformation Dynamics

Motivation: Example N-pentane ZIBgridfree, amiramol 5 Conformation Dynamics

Motivation conformation dynamics identification of molecular conformations inter conformational dynamics sampling according to Boltzmann distr. 6 Conformation Dynamics

2. Conformations 7 Conformation Dynamics

Decomposition of Botzmann Distribution Conformations = Decomposition of the Boltzmann distribution into overlapping densities V Ω ZIBmol, amiramol 8 Conformation Dynamics

Approximation of Conformations V Ω 9 Conformation Dynamics

Approximation of Conformations V Ω 10 Conformation Dynamics

Approximation of Conformations V transition regions multiple minima Ω 11 Conformation Dynamics

Approximation of Conformations χ 1 0 Ω 12 Conformation Dynamics

Conformations as membership functions Conformations are membership functions (fuzzy sets): χ i : Ω [0, 1], i = 1,..., n C, n C i=1 Total spatial Boltzmann distribution: χ i (q) = 1, q Ω. π : Ω RI +, π(q) exp( β V (q)) Decomposition of the Boltzmann distribution (partition function w i ): π i (q) = 1 Ω χ i(q) π(q) dq χ i(q) π(q), q Ω. 13 Conformation Dynamics

3. Robust Perron Cluster Analysis 14 Conformation Dynamics

Function approximation Approximation via s positive, partition-of-unity basis functions Φ j, j = 1,..., s: χ i = s j=1 C(j, i) Φ j, 0 C RI s nc, i C(j, i) = 1. Φ Φ Φ Φ 1 2 3 4 C(j,i) χ χ 1 2 j i 15 Conformation Dynamics

Function approximation Approximation via s positive, partition-of-unity basis functions Φ j, j = 1,..., s: χ i = s j=1 C(j, i) Φ j, 0 C RI s nc, i C(j, i) = 1. Further condition: More precisely (Schütte, 2000): χ i (q(τ)) χ i (q(0)). P τ χ i χ i, P τ : L 1,2 (π) L 1,2 (π). 16 Conformation Dynamics

Robust Perron Cluster Analysis (PCCA+) Perturbation theory, optimization approach: Deuflhard, W., 2005 Main idea: if where P C S C C = XA, X RI s n C, A RI n C n C P X = SXΛ, Λ I s. Problem: Find optimal A, such that C is row-stochastic. 17 Conformation Dynamics

Robust Perron Cluster Analysis (PCCA+) P X = SXΛ, Λ I s 18 Conformation Dynamics

Robust Perron Cluster Analysis (PCCA+) P X SX I S 19 Conformation Dynamics

Robust Perron Cluster Analysis (PCCA+) P X SX 20 Conformation Dynamics

Robust Perron Cluster Analysis (PCCA+) P XA SXA 21 Conformation Dynamics

Robust Perron Cluster Analysis (PCCA+) P C SC 22 Conformation Dynamics

Robust Perron Cluster Analysis (PCCA+) PCCA+ can be used for general cluster problems, too. Condition: P, S have a hidden block-diagonal structure. W., 2004. Rungsarityotin, W., Schliep, 2004. W., Kube, 2005. Survival fit, 2 clusters Proportion survived 0.0 0.2 0.4 0.6 0.8 1.0 p=0.000198 0 50 100 150 Time 23 Conformation Dynamics

4. Sampling Scheme 24 Conformation Dynamics

Rescaling Trick S kj = P kj = Ω Ω Φ k (q) Φ j (q) π(q) dq (P τ Φ k (q)) Φ j (q) π(q) dq 25 Conformation Dynamics

Rescaling Trick P, S are numerically not available. Instead of solve P X = SXΛ D 1 P X = D 1 SXΛ, where D is an (s, s)-diagonal matrix with d ii = Ω Φ i(q) π(q) dq. P = D 1 P, S = D 1 S are computable. Left eigenvector for λ 1 = 1 of P and S consists of the diagonal elements of D computation of thermodynamic weights is possible. 26 Conformation Dynamics

Umbrella sampling We have to compute (A= Φ k or A= P τ Φ k ): Ω S kj, P k,j = A Φ j(q) π(q) dq Ω Φ j(q) π(q) dq = Ω A Φ j (q) π(q) Ω Φ j(q) π(q) dq dq I.e. compute observable A via Monte Carlo methods for a density, which is known up to a normalization constant Metropolis-Hastings -type algorithm. 27 Conformation Dynamics

Umbrella sampling Unnormalized density: ( Φ j (q) exp( β V (q)) = exp β [V (q) 1 ) β ln(φ j(q))]. We use HMC with modified potentials (Torrie, Valleau, 1977): V j = V 1 β ln(φ j) 28 Conformation Dynamics

5. Meshless Basis Functions 29 Conformation Dynamics

Example: Cyclohexane W., Meyer, 2005 30 Conformation Dynamics

Ω vs. 3D-representation 31 Conformation Dynamics

Start Discretization 32 Conformation Dynamics

Start Discretization 33 Conformation Dynamics

Start Discretization 34 Conformation Dynamics

Partition of Unity Liu, 2002. Shepard, 1968. Some properties: Φ i (q) = For α Voronoi tesselation. exp( α dist 2 (q i, q)) s j=1 exp( α dist2 (q j, q)) With special choice of dist, Φ i strictly quasi-concave ( window property ). Block-structure of P and S is not sensitive wrt. α. In other words: Eigenvector data X, which is used for clustering, is not sensitive wrt. α. 35 Conformation Dynamics

Summary Conformations: Overlapping decomposition of the Boltzmann distribution. Complexity reduction: Membership functions. Robust Perron Cluster Analysis: Solution of an eigenvalue problem for the Schütte operator + optimization. Sampling: Rescaling trick. Meshless basis functions: Neccessary condition in order to avoid curse of dimensionality. Thank you for your attention!!! 36 Conformation Dynamics

References P. Deuflhard and M. Weber. (2005). Robust Perron Cluster Analysis in Conformation Dynamics. Lin. Alg. App., Vol 398C, pp. 161-184. H. Meyer. (2005). Die Implementierung und Analyse von HuMFree einer gitterfreien Methode zur Konformationsanalyse von Wirkstoffmolekülen. Master Thesis, Free University Berlin. M. Weber. (2005). Meshless Methods in Conformation Dynamics. PhD Thesis, Free University Berlin, in preparation. M. Weber and S. Kube. (2005). Robust Perron Cluster Analysis for Various Applications in Computational Life Science. Submitted to: CompLife05 in Konstanz. M. Weber and H. Meyer. (2005). ZIBgridfree Adaptive Conformation Analysis with Qualified Support of Transition States and Thermodynamic Weights. Submitted to: CompLife05 in Konstanz (25th -27th September). 37 Conformation Dynamics