ANALYSIS OF PROTEOME COMPEXITY BASED ON COUNTING DOMAIN-TO-PROTEIN LINKS
|
|
- Maximilian Mathews
- 7 years ago
- Views:
Transcription
1 COMUTATIONAL STRUCTURAL AND FUNCTIONAL ROTEOMICS ANALYSIS OF ROTEOME COMEXITY BASED ON COUNTING DOMAIN-TO-ROTEIN LINKS Kuznetsov V.A.*, ckalov V.V. 2, Knott G.D. 3, Kanapn A.A. 4 CIT/NIH & SRA Internatonal, Inc. Bethesda, MD, 20892, USA, e-mal: vk28u@nh.gov; 2 Insttute of Theoretcal and Appled Mechancs SB RAS, Novosbrsk, Russa, e-mal: pckalov@tam.nsc.ru; 3 Cvlzed Software, Inc., Slver Sprng, MD, 20906, USA; 4 EMBL-EBI Wellcome Trust Genome Campus, Hnxton, CB0 SD, UK, * Correspondng author: e-mal: alex@eb.ac.uk Keywords: proteome complexty, evoluton, doman-to-proten lnks, skew dstrbutons Summary We defne the lst of domans to protens, together wth the numbers of ther occurrences (lnks to protens) found n the proteome of an organsm to be the doman-to-proten lnkage profle (DL) of the proteome. We estmated the DL for the 56 fully-sequenced genome organsms represented n the Interro database. Ths work presents several quanttatve measures of the complexty of a proteome based on the DL. For each of the 56 studed organsms, we found two large subsets of domans: the domans whch occur two or more tmes n at least one proten, and the domans whch are not duplcated wthn any proten of the proteome. The latter set of domans well reflects the ncreasng trend of bologcal complexty due to evoluton. The statstcal dstrbutons of the number of doman-to-proten lnks n the proteome and the estmates of the dfferences between the DLs for pars of organsms are used as measures of relatve bologcal complexty of the organsms. In partcular, we show quanttatvely the greater complexty of the human proteome, relatve to that of a mouse or a rat. These dfferences are only partally reflected n the number of proten-codng genes estmated for these speces by the sequencng genome projects. Introducton There are no consensus quanttatve measures of the relatve or absolute complexty of an organsm. The bologcal complexty of an organsm should be characterzed by the lst of all protens n a gven proteome and ther dynamc nteractons among themselves (proten network), the other molecules (proten-dna network, etc.). However, the total number of putatve proten sequences based on complete genome sequence data of a gven organsm can be predcted only approxmately, and our knowledge on dynamc nteractons of protens n an organsm s also essentally ncomplete. Thus, we need a more tractable and practcal approaches to descrbng the bologcal complexty. rotens contan short structural and/or functonal 'blocks' (sequences of amno acds) called structural motfs (of length 5-35 nt) and structural domans (of length nt) that are seen repeatedly n many protens of all speces whose genome have been examned. The entre number of domans n nature s probably a very lmted number. The estmated number of such classes of homologous sequences of motfs and domans n nature ranges from 6,000 to 0,000 or so [-3]. In ths work, all such proten blocks wll be called, for brevty, domans. The domans are essental to the bologcal functon (s) of the proten n whch they occur and serve as evolutonarly conserved buldng blocks for the formng of protens. The domans are correspondng to specfc sequences of DNA wthn genes whch have been evolutonarly conserved. DNA correspondng to a doman may occur multple tmes n a gven proten-codng gene and/or n many dfferent proten-codng genes wthn a gven genome, and n the genomes of many speces. We attempt to determne the measures of the bologcal complexty based on a lmted set of domans. 293
2 If a doman D occurs n the proten, we say ths consttutes a doman-to-proten lnk. The lst j of domans together wth the numbers of occurrences of each doman (lnks) found n the proteome of an organsm s called the doman-to-proten lnkage profle (DL) of the proteome. The DL should allow us to characterze the doman-to-proten lnk network for each organsm. Currently, we do not know all domans and all protens n a gven organsm. We can study the DL of organsm by observng sample DLs n representatve proteomes (.e. proten sets whch were specfed for organsms). However, several proten doman/motf databases provde large enough samples of DLs for a varety of organsms. We can analyze the samples of doman-to-proten lnkage nformaton n fully-sequenced genome organsms to categorze ther proteome complextes. Ths work presents several quanttatve measures of the complexty of a proteome based on observed DLs that appear to be approprate and consstent. roten Doman Database Analyzer Our nformaton about DLs of the 56 fully-sequenced genome archaeal, bacteral and eukaryotc organsms was obtaned from the Interro database ( Aprl, 2004) [5], the proten sequences for the organsms were obtaned from the roteome Analyss Database ( Aprl 2004). We developed a roten Doman Database Analyzer (DDA) program whch we used to access the Interro database and download data nto a local MySQL relatonal database []. The MySQL database conssts of a N tot L table, where N tot = 9609 domans; and L = 56 organsms. Each row corresponds to an Interro doman and each column corresponds to an organsm. The (, S) -th entry of the table s the number of occurrences of the doman n the proteome of organsm S. Informaton of ths table was analyzed by DDA data mnng tools, whch nclude the statstcal and graphcal functons, logcal functons, descrptve statstcs, and correlaton analyss. Defntons Let us focus on a specfc organsm S. Let A = { a, a2,... an } be the set of observed domans contaned n the protens of the organsm S. Let B = { b, b2,..., b} be the set of protens n the representatve proteome (sample of observed protens) for the organsm S. We defne the N adjacency matrx C ': = [ c' j ] ( =,2,..., N ; j =,2,..., ) for the organsm S as follows: the value c' j = k, f proten b j contans doman a exactly k tmes. Note that k {0,,2,...}. The value c' s the number of doman-to-proten lnks for the (doman, proten) par a, b }. j { j Let : = C' * = c' j j= denote the number of occurrences of the doman a n all protens of the representatve proteome of S; =,2,..., N. We call the value m the number of redundant doman-to-proten lnks of the doman a of S. Let M denote the total number of doman-toproten lnks n the representatve proteome of the organsm S. M ' = 294 N = j= c' j. The value M s
3 called the redundant connectvty number of the (doman, proten) network. Note that all occurrences of a doman n the protens of S are counted. We also defne the adjacency matrx C, where c j =, f proten b j contans doman a at least once, and we defne c j = 0 otherwse. Ths matrx reflects the non-redundant structure of the doman-toproten lnks n the (doman, proten) network. Ths matrx counts each (doman, proten) lnk only once. Let m = c j j= lnks of the doman a of S. Let M =. We call the value m the number of non-redundant doman-to-proten N c j = j=. Note M M '. We call the matrx C the nonredundant adjacency matrx of S, and we call M the non-redundant connectvty number of the (doman, proten) network of S. The values M and M have been used as the non redundant and redundant measures of the proteome complexty of an organsm [, 4]. To quantfy the complexty assocated wth multple occurrences of the doman a n an organsm, we defne the redundancy value δ m = m, ( ) () To quantfy the relatve complexty of the doman a whch redundantly occurs m ' s tmes n the organsm S and whch redundantly occurs k m ' ( ) tmes n the organsm K, we can defne the dfference of redundant complexty value for the doman a ( s) ( δ =. By omttng the prm symbol n above notatons, we can also ntroduce the dfference of non redundant complexty value of the doman ( s) ( = m m δ m. a To characterze the relatve redundant complexty of the organsm S versus the organsm K, we use the emprcal bvarate dstrbutons of the random values ( µ ', δ ), where =,2,..., N ; µ ' = ( + )/ 2 s a mean value of the numbers of the lnks and N s the total number of domans occurred n doman-to-proten profles of the organsms S and K. By omttng the prm symbol n above notatons, we obtan the emprcal bvarate dstrbuton of the random values ( µ, δ m ), whch we can use as the relatve non redundant complexty measure of the organsms S and K. Results and Dscusson There are at least two dstngush processes of doman s spread n nature: ntegraton of two or more dfferent domans n a new proten (formng non-redundant doman-to-proten lnks) and multplcaton of a doman wthn the same proten (formng redundant doman-to-proten lnks). anels A-C n Fgures show the relatonshps between the numbers of non-redundant and the numbers of redundant doman-to-proten lnks counted for the T. volcanum, E. col and human 295
4 representatve proteomes. These panels demonstrate typcal patterns of the bvarate frequency dstrbutons of the random values ( m, ) (={ ( m, ),...,( m N ( s ), N ( s ) )}) correspondng to the numbers of redundant and non-redundant occurrences of domans n the organsm S. All panels n Fgure show a preferental dstrbuton of ponts along the dagonal; the spread of ponts n the orthogonal drecton to the dagonal s smaller. The extreme cases (T. volcanum and human proteomes) clearly dsplay the ncreased complexty of the human proteome due to the ncreased number of protens whch preferentally combnng dfferent domans together. anel D n Fg. shows the observed values ( m, ) for the 56 organsms s a factor of 0 larger then any specfc organsm. We found that these dstrbuton patterns are typcal for the studed archael, bacteral and eukaryotc organsms. Fgures A-D mply that ncreasng bologcal complexty n nature s mostly assocated wth formaton of new mult-doman protens combnng dfferent domans rater that wth the ncrease of doman-to-proten lnks occurrng due to ncreasng of multplcaton of domans wthn protens n whch domans have been already occurred. Fg.. The emprcal dstrbutons of the numbers of redundant lnks versus the numbers of non redundant lnks counted for the (A) T. volcanum, (B) E. col and (C) human representatve proteomes, and for (D) pooled data of 56 organsms. Dscontnue lne: lnear regresson; sold lne: dagonal. Fg. 2. The relatve complexty of proteome of a human versus a mouse, a rat (C) and A. thalna, respectvely. A: the dstrbuton of ( µ ', δ ); symbol s ndcates the human organsm, k = ndcates the mouse organsm. B, C, D: the dstrbutons of ( µ, δ m ); k=2, 3, 4 ndcate the mouse, the rat and A. thalna, (, respectvely. =,2,..., N s. 296
5 For all of the 56 studed organsms, we also found two dsjonted subsets of domans: the domans whch occurred two or more tmes n at least one proten of a proteome, and the domans whch was not multpled wthn any proten of the proteome (see examples on Fg. ). 454 (47 %) of the 9609 domans occurred two or more tmes n a same proten of the 56 organsms. We also found that f a doman s more common n the entre proteome world, then that doman has a bgger chance to appear multple tmes n a proten of any organsm (see Fg. ). Fgure 2 demonstrates how the proteome complexty of pars of organsms can be compared. Ths fgure shows the relatve proteome complexty measure for a human versus a mouse, a rat and A. thalana, respectvely. For example, the panel A shows the bvarate dstrbutons of dfferences of the number of redundant lnks for the human and mouse organsms, multpled by factor 0.5 wth respect to average number of the redundant lnks of the human and mouse organsms. Ths dstrbuton s hghly asymmetrc about axes x: the number of postve dfferences (abundant for a human) s bgger than the number of negatve dfferences (abundant for a mouse), and the postve hghlyabundant values occur more often n the human sample than n the mouse sample. Smlar asymmetrc trend we observed for our human-mouse comparson n the case of redundant lnks (Fg. 2B). Ths ndcates that the human organsm reuses domans more frequently n same proten and n dfferent protens, and nvents more dverse mult-doman protens than the mouse organsm. Ths analyss suggests that even though that the numbers of non redundant proten-codng genes n a human (~33,600 genes [, 4]) and n a mouse (~32,000 genes, by our current estmate based on the method n [, 4]) are approxmately smlar, and that even the total number of domans are also smlar numbers n these organsms (~5,600 for a human and ~5,00 for a mouse [4]), the proteome complexty and dversty of a sgnfcant fracton of mult-doman protens n a human s hgher than n a mouse. Large asymmetry n the bvarate dstrbuton of the values ( µ, δ ) around axes x we observed for a human versus a rat (Fg. 2C). However, the human and A. thalana proteomes show smlar proteome complexty by our crtera (Fg. 2D). Ths s not a surprse, because even though the number of proten-codng genes n human s larger than the number of proten-codng genes n A. thalana (~26,000-27,000 [, 4]), relatvely recent massve (and mperfect) genome duplcaton events n A. thalana mght dramatcally ncreased of proten repertore due to recombnaton events, whch perhaps ncreased of the doman shufflng n protens, but dd not ncrease of the number of domans (~5,300 domans [4]) presented n the A. thalana proteome. References. Kuznetsov V.A. Statstcs of the numbers of transcrpts and proten sequences encoded n the genome // Computatonal and Statstcal Methods to Genomcs / Eds. Zhang W., Shmulevch I. Kluwer, Boston, Kuznetsov V.A., ckalov V.V., Senko O.V., Knott G.D. Analyss of evolvng proteomes: redctons of the number of proten domans n nature and the number of genes n eukaryotc organsms // J. Bol. Systems V Kuznetsov V.A. Famly of skewed dstrbutons assocated wth the gene expresson and proteome evoluton // Sgnal rocessng. 2003a. V Kuznetsov V.A. A stochastc model of evoluton of conserved proten codng sequence n the archaeal, bacteral and eukaryotc proteomes // Fluctuaton and Nose Letters. 2003b. V. 3. L295 L Mulder N.J., Apweler R., Attwood T.K. et al. Interro Consortum. The Interro Database, 2003 brngs ncreased coverage and new features // Nucl. Acds Res V m 297
Can Auto Liability Insurance Purchases Signal Risk Attitude?
Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang
More informationWhat is Candidate Sampling
What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble
More informationCausal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting
Causal, Explanatory Forecastng Assumes cause-and-effect relatonshp between system nputs and ts output Forecastng wth Regresson Analyss Rchard S. Barr Inputs System Cause + Effect Relatonshp The job of
More informationbenefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).
REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or
More informationCHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES
CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES In ths chapter, we wll learn how to descrbe the relatonshp between two quanttatve varables. Remember (from Chapter 2) that the terms quanttatve varable
More information8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by
6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng
More informationLecture 2 Sequence Alignment. Burr Settles IBS Summer Research Program 2008 bsettles@cs.wisc.edu www.cs.wisc.edu/~bsettles/ibs08/
Lecture 2 Sequence lgnment Burr Settles IBS Summer Research Program 2008 bsettles@cs.wsc.edu www.cs.wsc.edu/~bsettles/bs08/ Sequence lgnment: Task Defnton gven: a par of sequences DN or proten) a method
More informationForecasting the Direction and Strength of Stock Market Movement
Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract - Stock market s one of the most complcated systems
More informationCHAPTER 14 MORE ABOUT REGRESSION
CHAPTER 14 MORE ABOUT REGRESSION We learned n Chapter 5 that often a straght lne descrbes the pattern of a relatonshp between two quanttatve varables. For nstance, n Example 5.1 we explored the relatonshp
More informationRing structure of splines on triangulations
www.oeaw.ac.at Rng structure of splnes on trangulatons N. Vllamzar RICAM-Report 2014-48 www.rcam.oeaw.ac.at RING STRUCTURE OF SPLINES ON TRIANGULATIONS NELLY VILLAMIZAR Introducton For a trangulated regon
More informationCHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol
CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK Sample Stablty Protocol Background The Cholesterol Reference Method Laboratory Network (CRMLN) developed certfcaton protocols for total cholesterol, HDL
More informationPortfolio Loss Distribution
Portfolo Loss Dstrbuton Rsky assets n loan ortfolo hghly llqud assets hold-to-maturty n the bank s balance sheet Outstandngs The orton of the bank asset that has already been extended to borrowers. Commtment
More informationSTATISTICAL DATA ANALYSIS IN EXCEL
Mcroarray Center STATISTICAL DATA ANALYSIS IN EXCEL Lecture 6 Some Advanced Topcs Dr. Petr Nazarov 14-01-013 petr.nazarov@crp-sante.lu Statstcal data analyss n Ecel. 6. Some advanced topcs Correcton for
More informationv a 1 b 1 i, a 2 b 2 i,..., a n b n i.
SECTION 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS 455 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS All the vector spaces we have studed thus far n the text are real vector spaces snce the scalars are
More informationAn Alternative Way to Measure Private Equity Performance
An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate
More informationExhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation
Exhaustve Regresson An Exploraton of Regresson-Based Data Mnng Technques Usng Super Computaton Antony Daves, Ph.D. Assocate Professor of Economcs Duquesne Unversty Pttsburgh, PA 58 Research Fellow The
More informationHow To Calculate The Accountng Perod Of Nequalty
Inequalty and The Accountng Perod Quentn Wodon and Shlomo Ytzha World Ban and Hebrew Unversty September Abstract Income nequalty typcally declnes wth the length of tme taen nto account for measurement.
More informationDEFINING %COMPLETE IN MICROSOFT PROJECT
CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,
More informationAn Interest-Oriented Network Evolution Mechanism for Online Communities
An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne
More informationFREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES
FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES Zuzanna BRO EK-MUCHA, Grzegorz ZADORA, 2 Insttute of Forensc Research, Cracow, Poland 2 Faculty of Chemstry, Jagellonan
More informationHow To Understand The Results Of The German Meris Cloud And Water Vapour Product
Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller
More information1. Measuring association using correlation and regression
How to measure assocaton I: Correlaton. 1. Measurng assocaton usng correlaton and regresson We often would lke to know how one varable, such as a mother's weght, s related to another varable, such as a
More informationBrigid Mullany, Ph.D University of North Carolina, Charlotte
Evaluaton And Comparson Of The Dfferent Standards Used To Defne The Postonal Accuracy And Repeatablty Of Numercally Controlled Machnng Center Axes Brgd Mullany, Ph.D Unversty of North Carolna, Charlotte
More informationSPEE Recommended Evaluation Practice #6 Definition of Decline Curve Parameters Background:
SPEE Recommended Evaluaton Practce #6 efnton of eclne Curve Parameters Background: The producton hstores of ol and gas wells can be analyzed to estmate reserves and future ol and gas producton rates and
More informationTHE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES
The goal: to measure (determne) an unknown quantty x (the value of a RV X) Realsaton: n results: y 1, y 2,..., y j,..., y n, (the measured values of Y 1, Y 2,..., Y j,..., Y n ) every result s encumbered
More informationConversion between the vector and raster data structures using Fuzzy Geographical Entities
Converson between the vector and raster data structures usng Fuzzy Geographcal Enttes Cdála Fonte Department of Mathematcs Faculty of Scences and Technology Unversty of Combra, Apartado 38, 3 454 Combra,
More informationRisk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008
Rsk-based Fatgue Estmate of Deep Water Rsers -- Course Project for EM388F: Fracture Mechancs, Sprng 2008 Chen Sh Department of Cvl, Archtectural, and Envronmental Engneerng The Unversty of Texas at Austn
More informationNPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6
PAR TESTS If a WEIGHT varable s specfed, t s used to replcate a case as many tmes as ndcated by the weght value rounded to the nearest nteger. If the workspace requrements are exceeded and samplng has
More informationGRAVITY DATA VALIDATION AND OUTLIER DETECTION USING L 1 -NORM
GRAVITY DATA VALIDATION AND OUTLIER DETECTION USING L 1 -NORM BARRIOT Jean-Perre, SARRAILH Mchel BGI/CNES 18.av.E.Beln 31401 TOULOUSE Cedex 4 (France) Emal: jean-perre.barrot@cnes.fr 1/Introducton The
More informationTwo Faces of Intra-Industry Information Transfers: Evidence from Management Earnings and Revenue Forecasts
Two Faces of Intra-Industry Informaton Transfers: Evdence from Management Earnngs and Revenue Forecasts Yongtae Km Leavey School of Busness Santa Clara Unversty Santa Clara, CA 95053-0380 TEL: (408) 554-4667,
More informationLinear Circuits Analysis. Superposition, Thevenin /Norton Equivalent circuits
Lnear Crcuts Analyss. Superposton, Theenn /Norton Equalent crcuts So far we hae explored tmendependent (resste) elements that are also lnear. A tmendependent elements s one for whch we can plot an / cure.
More informationStatistical Methods to Develop Rating Models
Statstcal Methods to Develop Ratng Models [Evelyn Hayden and Danel Porath, Österrechsche Natonalbank and Unversty of Appled Scences at Manz] Source: The Basel II Rsk Parameters Estmaton, Valdaton, and
More informationThe Development of Web Log Mining Based on Improve-K-Means Clustering Analysis
The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.
More informationDo Changes in Customer Satisfaction Lead to Changes in Sales Performance in Food Retailing?
Do Changes n Customer Satsfacton Lead to Changes n Sales Performance n Food Retalng? Mguel I. Gómez Research Assocate Food Industry Management Program Department of Appled Economcs and Management Cornell
More informationDamage detection in composite laminates using coin-tap method
Damage detecton n composte lamnates usng con-tap method S.J. Km Korea Aerospace Research Insttute, 45 Eoeun-Dong, Youseong-Gu, 35-333 Daejeon, Republc of Korea yaeln@kar.re.kr 45 The con-tap test has the
More informationAn Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services
An Evaluaton of the Extended Logstc, Smple Logstc, and Gompertz Models for Forecastng Short Lfecycle Products and Servces Charles V. Trappey a,1, Hsn-yng Wu b a Professor (Management Scence), Natonal Chao
More information) of the Cell class is created containing information about events associated with the cell. Events are added to the Cell instance
Calbraton Method Instances of the Cell class (one nstance for each FMS cell) contan ADC raw data and methods assocated wth each partcular FMS cell. The calbraton method ncludes event selecton (Class Cell
More informationData Visualization by Pairwise Distortion Minimization
Communcatons n Statstcs, Theory and Methods 34 (6), 005 Data Vsualzaton by Parwse Dstorton Mnmzaton By Marc Sobel, and Longn Jan Lateck* Department of Statstcs and Department of Computer and Informaton
More informationCalibration and Linear Regression Analysis: A Self-Guided Tutorial
Calbraton and Lnear Regresson Analyss: A Self-Guded Tutoral Part The Calbraton Curve, Correlaton Coeffcent and Confdence Lmts CHM314 Instrumental Analyss Department of Chemstry, Unversty of Toronto Dr.
More informationExtending Probabilistic Dynamic Epistemic Logic
Extendng Probablstc Dynamc Epstemc Logc Joshua Sack May 29, 2008 Probablty Space Defnton A probablty space s a tuple (S, A, µ), where 1 S s a set called the sample space. 2 A P(S) s a σ-algebra: a set
More informationA DATA MINING APPLICATION IN A STUDENT DATABASE
JOURNAL OF AERONAUTICS AND SPACE TECHNOLOGIES JULY 005 VOLUME NUMBER (53-57) A DATA MINING APPLICATION IN A STUDENT DATABASE Şenol Zafer ERDOĞAN Maltepe Ünversty Faculty of Engneerng Büyükbakkalköy-Istanbul
More informationModule 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur
Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..
More informationPOLYSA: A Polynomial Algorithm for Non-binary Constraint Satisfaction Problems with and
POLYSA: A Polynomal Algorthm for Non-bnary Constrant Satsfacton Problems wth and Mguel A. Saldo, Federco Barber Dpto. Sstemas Informátcos y Computacón Unversdad Poltécnca de Valenca, Camno de Vera s/n
More informationMacro Factors and Volatility of Treasury Bond Returns
Macro Factors and Volatlty of Treasury Bond Returns Jngzh Huang Department of Fnance Smeal Colleage of Busness Pennsylvana State Unversty Unversty Park, PA 16802, U.S.A. Le Lu School of Fnance Shangha
More informationFactors Affecting Outsourcing for Information Technology Services in Rural Hospitals: Theory and Evidence
Factors Affectng Outsourcng for Informaton Technology Servces n Rural Hosptals: Theory and Evdence Bran E. Whtacre Department of Agrcultural Economcs Oklahoma State Unversty bran.whtacre@okstate.edu J.
More informationTraffic State Estimation in the Traffic Management Center of Berlin
Traffc State Estmaton n the Traffc Management Center of Berln Authors: Peter Vortsch, PTV AG, Stumpfstrasse, D-763 Karlsruhe, Germany phone ++49/72/965/35, emal peter.vortsch@ptv.de Peter Möhl, PTV AG,
More informationProperties of Indoor Received Signal Strength for WLAN Location Fingerprinting
Propertes of Indoor Receved Sgnal Strength for WLAN Locaton Fngerprntng Kamol Kaemarungs and Prashant Krshnamurthy Telecommuncatons Program, School of Informaton Scences, Unversty of Pttsburgh E-mal: kakst2,prashk@ptt.edu
More informationProperties of real networks: degree distribution
Propertes of real networks: degree dstrbuton Nodes wth small degrees are most frequent. The fracton of hghly connected nodes decreases, but s not zero. Look closer: use a logarthmc plot. 10 0 10-1 10 0
More informationSketching Sampled Data Streams
Sketchng Sampled Data Streams Florn Rusu, Aln Dobra CISE Department Unversty of Florda Ganesvlle, FL, USA frusu@cse.ufl.edu adobra@cse.ufl.edu Abstract Samplng s used as a unversal method to reduce the
More informationFeature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College
Feature selecton for ntruson detecton Slobodan Petrovć NISlab, Gjøvk Unversty College Contents The feature selecton problem Intruson detecton Traffc features relevant for IDS The CFS measure The mrmr measure
More informationReturns to Experience in Mozambique: A Nonparametric Regression Approach
Returns to Experence n Mozambque: A Nonparametrc Regresson Approach Joel Muzma Conference Paper nº 27 Conferênca Inaugural do IESE Desafos para a nvestgação socal e económca em Moçambque 19 de Setembro
More informationAnalysis of Premium Liabilities for Australian Lines of Business
Summary of Analyss of Premum Labltes for Australan Lnes of Busness Emly Tao Honours Research Paper, The Unversty of Melbourne Emly Tao Acknowledgements I am grateful to the Australan Prudental Regulaton
More informationImperial College London
F. Fang 1, C.C. Pan 1, I.M. Navon 2, M.D. Pggott 1, G.J. Gorman 1, P.A. Allson 1 and A.J.H. Goddard 1 1 Appled Modellng and Computaton Group Department of Earth Scence and Engneerng Imperal College London,
More informationAutomated information technology for ionosphere monitoring of low-orbit navigation satellite signals
Automated nformaton technology for onosphere montorng of low-orbt navgaton satellte sgnals Alexander Romanov, Sergey Trusov and Alexey Romanov Federal State Untary Enterprse Russan Insttute of Space Devce
More informationOn the Optimal Control of a Cascade of Hydro-Electric Power Stations
On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;
More informationThe Choice of Direct Dealing or Electronic Brokerage in Foreign Exchange Trading
The Choce of Drect Dealng or Electronc Brokerage n Foregn Exchange Tradng Mchael Melvn & Ln Wen Arzona State Unversty Introducton Electronc Brokerage n Foregn Exchange Start from a base of zero n 1992
More informationINVESTIGATION OF VEHICULAR USERS FAIRNESS IN CDMA-HDR NETWORKS
21 22 September 2007, BULGARIA 119 Proceedngs of the Internatonal Conference on Informaton Technologes (InfoTech-2007) 21 st 22 nd September 2007, Bulgara vol. 2 INVESTIGATION OF VEHICULAR USERS FAIRNESS
More informationA statistical approach to determine Microbiologically Influenced Corrosion (MIC) Rates of underground gas pipelines.
A statstcal approach to determne Mcrobologcally Influenced Corroson (MIC) Rates of underground gas ppelnes. by Lech A. Grzelak A thess submtted to the Delft Unversty of Technology n conformty wth the requrements
More informationA Fast Incremental Spectral Clustering for Large Data Sets
2011 12th Internatonal Conference on Parallel and Dstrbuted Computng, Applcatons and Technologes A Fast Incremental Spectral Clusterng for Large Data Sets Tengteng Kong 1,YeTan 1, Hong Shen 1,2 1 School
More informationwhere the coordinates are related to those in the old frame as follows.
Chapter 2 - Cartesan Vectors and Tensors: Ther Algebra Defnton of a vector Examples of vectors Scalar multplcaton Addton of vectors coplanar vectors Unt vectors A bass of non-coplanar vectors Scalar product
More informationRisk Model of Long-Term Production Scheduling in Open Pit Gold Mining
Rsk Model of Long-Term Producton Schedulng n Open Pt Gold Mnng R Halatchev 1 and P Lever 2 ABSTRACT Open pt gold mnng s an mportant sector of the Australan mnng ndustry. It uses large amounts of nvestments,
More informationSelf-Consistent Proteomic Field Theory of Stochastic Gene Switches
88 Bophyscal Journal Volume 88 February 005 88 850 Self-Consstent Proteomc Feld Theory of Stochastc Gene Swtches Aleksandra M. Walczak,* Masak Sasa, y and Peter G. Wolynes* z *Department of Physcs, Center
More informationImplementation of Deutsch's Algorithm Using Mathcad
Implementaton of Deutsch's Algorthm Usng Mathcad Frank Roux The followng s a Mathcad mplementaton of Davd Deutsch's quantum computer prototype as presented on pages - n "Machnes, Logc and Quantum Physcs"
More informationHÜCKEL MOLECULAR ORBITAL THEORY
1 HÜCKEL MOLECULAR ORBITAL THEORY In general, the vast maorty polyatomc molecules can be thought of as consstng of a collecton of two electron bonds between pars of atoms. So the qualtatve pcture of σ
More informationPerformance Analysis and Coding Strategy of ECOC SVMs
Internatonal Journal of Grd and Dstrbuted Computng Vol.7, No. (04), pp.67-76 http://dx.do.org/0.457/jgdc.04.7..07 Performance Analyss and Codng Strategy of ECOC SVMs Zhgang Yan, and Yuanxuan Yang, School
More information1 Example 1: Axis-aligned rectangles
COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture # 6 Scrbe: Aaron Schld February 21, 2013 Last class, we dscussed an analogue for Occam s Razor for nfnte hypothess spaces that, n conjuncton
More informationCopulas. Modeling dependencies in Financial Risk Management. BMI Master Thesis
Copulas Modelng dependences n Fnancal Rsk Management BMI Master Thess Modelng dependences n fnancal rsk management Modelng dependences n fnancal rsk management 3 Preface Ths paper has been wrtten as part
More informationNEURO-FUZZY INFERENCE SYSTEM FOR E-COMMERCE WEBSITE EVALUATION
NEURO-FUZZY INFERENE SYSTEM FOR E-OMMERE WEBSITE EVALUATION Huan Lu, School of Software, Harbn Unversty of Scence and Technology, Harbn, hna Faculty of Appled Mathematcs and omputer Scence, Belarusan State
More informationAbstract. 260 Business Intelligence Journal July IDENTIFICATION OF DEMAND THROUGH STATISTICAL DISTRIBUTION MODELING FOR IMPROVED DEMAND FORECASTING
260 Busness Intellgence Journal July IDENTIFICATION OF DEMAND THROUGH STATISTICAL DISTRIBUTION MODELING FOR IMPROVED DEMAND FORECASTING Murphy Choy Mchelle L.F. Cheong School of Informaton Systems, Sngapore
More informationSIMPLE LINEAR CORRELATION
SIMPLE LINEAR CORRELATION Smple lnear correlaton s a measure of the degree to whch two varables vary together, or a measure of the ntensty of the assocaton between two varables. Correlaton often s abused.
More informationChapter 8 Group-based Lending and Adverse Selection: A Study on Risk Behavior and Group Formation 1
Chapter 8 Group-based Lendng and Adverse Selecton: A Study on Rsk Behavor and Group Formaton 1 8.1 Introducton Ths chapter deals wth group formaton and the adverse selecton problem. In several theoretcal
More informationIs There A Tradeoff between Employer-Provided Health Insurance and Wages?
Is There A Tradeoff between Employer-Provded Health Insurance and Wages? Lye Zhu, Southern Methodst Unversty October 2005 Abstract Though most of the lterature n health nsurance and the labor market assumes
More informationEfficient Project Portfolio as a tool for Enterprise Risk Management
Effcent Proect Portfolo as a tool for Enterprse Rsk Management Valentn O. Nkonov Ural State Techncal Unversty Growth Traectory Consultng Company January 5, 27 Effcent Proect Portfolo as a tool for Enterprse
More informationCovariate-based pricing of automobile insurance
Insurance Markets and Companes: Analyses and Actuaral Computatons, Volume 1, Issue 2, 2010 José Antono Ordaz (Span), María del Carmen Melgar (Span) Covarate-based prcng of automoble nsurance Abstract Ths
More informationBERNSTEIN POLYNOMIALS
On-Lne Geometrc Modelng Notes BERNSTEIN POLYNOMIALS Kenneth I. Joy Vsualzaton and Graphcs Research Group Department of Computer Scence Unversty of Calforna, Davs Overvew Polynomals are ncredbly useful
More informationTHE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION
Internatonal Journal of Electronc Busness Management, Vol. 3, No. 4, pp. 30-30 (2005) 30 THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION Yu-Mn Chang *, Yu-Cheh
More informationManagement Quality and Equity Issue Characteristics: A Comparison of SEOs and IPOs
Management Qualty and Equty Issue Characterstcs: A Comparson of SEOs and IPOs Thomas J. Chemmanur * Imants Paegls ** and Karen Smonyan *** Current verson: November 2009 (Accepted, Fnancal Management, February
More informationA Probabilistic Theory of Coherence
A Probablstc Theory of Coherence BRANDEN FITELSON. The Coherence Measure C Let E be a set of n propostons E,..., E n. We seek a probablstc measure C(E) of the degree of coherence of E. Intutvely, we want
More informationL10: Linear discriminants analysis
L0: Lnear dscrmnants analyss Lnear dscrmnant analyss, two classes Lnear dscrmnant analyss, C classes LDA vs. PCA Lmtatons of LDA Varants of LDA Other dmensonalty reducton methods CSCE 666 Pattern Analyss
More informationMicroarray data normalization and transformation
revew Mcroarray data normalzaton and transformaton John Quackenbush do:38/ng3 Nature Publshng Group http://wwwnaturecom/naturegenetcs Underlyng every mcroarray experment s an expermental queston that one
More informationPSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12
14 The Ch-squared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed
More informationA Secure Password-Authenticated Key Agreement Using Smart Cards
A Secure Password-Authentcated Key Agreement Usng Smart Cards Ka Chan 1, Wen-Chung Kuo 2 and Jn-Chou Cheng 3 1 Department of Computer and Informaton Scence, R.O.C. Mltary Academy, Kaohsung 83059, Tawan,
More informationMultiple-Period Attribution: Residuals and Compounding
Multple-Perod Attrbuton: Resduals and Compoundng Our revewer gave these authors full marks for dealng wth an ssue that performance measurers and vendors often regard as propretary nformaton. In 1994, Dens
More informationProduct-Form Stationary Distributions for Deficiency Zero Chemical Reaction Networks
Bulletn of Mathematcal Bology (21 DOI 1.17/s11538-1-9517-4 ORIGINAL ARTICLE Product-Form Statonary Dstrbutons for Defcency Zero Chemcal Reacton Networks Davd F. Anderson, Gheorghe Cracun, Thomas G. Kurtz
More informationManagement Quality, Financial and Investment Policies, and. Asymmetric Information
Management Qualty, Fnancal and Investment Polces, and Asymmetrc Informaton Thomas J. Chemmanur * Imants Paegls ** and Karen Smonyan *** Current verson: December 2007 * Professor of Fnance, Carroll School
More informationMarginal Benefit Incidence Analysis Using a Single Cross-section of Data. Mohamed Ihsan Ajwad and Quentin Wodon 1. World Bank.
Margnal Beneft Incdence Analyss Usng a Sngle Cross-secton of Data Mohamed Ihsan Ajwad and uentn Wodon World Bank August 200 Abstract In a recent paper, Lanjouw and Ravallon proposed an attractve and smple
More informationIT09 - Identity Management Policy
IT09 - Identty Management Polcy Introducton 1 The Unersty needs to manage dentty accounts for all users of the Unersty s electronc systems and ensure that users hae an approprate leel of access to these
More information"Research Note" APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES *
Iranan Journal of Scence & Technology, Transacton B, Engneerng, ol. 30, No. B6, 789-794 rnted n The Islamc Republc of Iran, 006 Shraz Unversty "Research Note" ALICATION OF CHARGE SIMULATION METHOD TO ELECTRIC
More informationEnterprise Master Patient Index
Enterprse Master Patent Index Healthcare data are captured n many dfferent settngs such as hosptals, clncs, labs, and physcan offces. Accordng to a report by the CDC, patents n the Unted States made an
More informationHow To Analyze The Flow Patterns Of A Fracture Network
Flud Flow Complexty n Fracture Networks: Analyss wth Graph Theory and LBM Ghaffar, H.O. Department of Cvl Engneerng and Lassonde Insttute, Unversty of Toronto, Toronto, Canada Nasser, M.H.B. Department
More informationStaff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall
SP 2005-02 August 2005 Staff Paper Department of Appled Economcs and Management Cornell Unversty, Ithaca, New York 14853-7801 USA Farm Savngs Accounts: Examnng Income Varablty, Elgblty, and Benefts Brent
More informationThe impact of hard discount control mechanism on the discount volatility of UK closed-end funds
Investment Management and Fnancal Innovatons, Volume 10, Issue 3, 2013 Ahmed F. Salhn (Egypt) The mpact of hard dscount control mechansm on the dscount volatlty of UK closed-end funds Abstract The mpact
More informationDescription of the Force Method Procedure. Indeterminate Analysis Force Method 1. Force Method con t. Force Method con t
Indeternate Analyss Force Method The force (flexblty) ethod expresses the relatonshps between dsplaceents and forces that exst n a structure. Prary objectve of the force ethod s to deterne the chosen set
More informationWhen do data mining results violate privacy? Individual Privacy: Protect the record
When do data mnng results volate prvacy? Chrs Clfton March 17, 2004 Ths s jont work wth Jashun Jn and Murat Kantarcıoğlu Indvdual Prvacy: Protect the record Indvdual tem n database must not be dsclosed
More informationThe OC Curve of Attribute Acceptance Plans
The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4
More informationEvaluating the generalizability of an RCT using electronic health records data
Evaluatng the generalzablty of an RCT usng electronc health records data 3 nterestng questons Is our RCT representatve? How can we generalze RCT results? Can we use EHR* data as a control group? *) Electronc
More informationOn-Line Fault Detection in Wind Turbine Transmission System using Adaptive Filter and Robust Statistical Features
On-Lne Fault Detecton n Wnd Turbne Transmsson System usng Adaptve Flter and Robust Statstcal Features Ruoyu L Remote Dagnostcs Center SKF USA Inc. 3443 N. Sam Houston Pkwy., Houston TX 77086 Emal: ruoyu.l@skf.com
More informationTime Domain simulation of PD Propagation in XLPE Cables Considering Frequency Dependent Parameters
Internatonal Journal of Smart Grd and Clean Energy Tme Doman smulaton of PD Propagaton n XLPE Cables Consderng Frequency Dependent Parameters We Zhang a, Jan He b, Ln Tan b, Xuejun Lv b, Hong-Je L a *
More informationADVERSE SELECTION IN INSURANCE MARKETS: POLICYHOLDER EVIDENCE FROM THE U.K. ANNUITY MARKET *
ADVERSE SELECTION IN INSURANCE MARKETS: POLICYHOLDER EVIDENCE FROM THE U.K. ANNUITY MARKET * Amy Fnkelsten Harvard Unversty and NBER James Poterba MIT and NBER * We are grateful to Jeffrey Brown, Perre-Andre
More informationBayesian Network Based Causal Relationship Identification and Funding Success Prediction in P2P Lending
Proceedngs of 2012 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 25 (2012) (2012) IACSIT Press, Sngapore Bayesan Network Based Causal Relatonshp Identfcaton and Fundng Success
More information