assessment of algorithms for inferring positional weight matrix motifs of transcription factor binding sites using protein binding microarray data评估算法推断位置权重矩阵图案的转录因子结合位点使用蛋白质绑定微阵列数据.pdf
文本预览下载声明
Assessment of Algorithms for Inferring Positional Weight
Matrix Motifs of Transcription Factor Binding Sites Using
Protein Binding Microarray Data
Yaron Orenstein, Chaim Linhart, Ron Shamir*
Blavatnik School of Computer Science, Tel-Aviv University, Tel-Aviv, Israel
Abstract
The new technology of protein binding microarrays (PBMs) allows simultaneous measurement of the binding intensities of a
transcription factor to tens of thousands of synthetic double-stranded DNA probes, covering all possible 10-mers. A key
computational challenge is inferring the binding motif from these data. We present a systematic comparison of four
methods developed specifically for reconstructing a binding site motif represented as a positional weight matrix from PBM
data. The reconstructed motifs were evaluated in terms of three criteria: concordance with reference motifs from the
literature and ability to predict in vivo and in vitro bindings. The evaluation encompassed over 200 transcription factors and
some 300 assays. The results show a tradeoff between how the methods perform according to the different criteria, and a
dichotomy of method types. Algorithms that construct motifs with low information content predict PBM probe ranking
more faithfully, while methods that produce highly informative motifs match reference motifs better. Interestingly, in
predicting high-affinity binding, all methods give far poorer results for in vivo assays compared to in vitro assays.
Citation: Orenstein Y, Linhart C, Shamir R (2012) Assessment of Algorithms for Inferring Positional Weight Matrix Motifs of Transcription Factor Binding Sites
Using Protein Binding Microarray Data. PLoS ONE 7(9): e46145. doi:10.1371/journal.pone.0046145
Editor: Ying Xu, University of Georgia, United States of America
Received July 23, 2012; Accepted August 27, 2012; Published Sep
显示全部