TherMos: Estimating protein-DNA binding energies from in vivo binding profiles

TherMos: Estimating protein-DNA binding energies from in vivo binding profiles
Title:
TherMos: Estimating protein-DNA binding energies from in vivo binding profiles
Other Titles:
Nucleic Acids Research
Publication Date:
16 April 2013
Citation:
Wenjie Sun, Xiaoming Hu, Michael H. K. Lim, Calista K. L. Ng, Siew Hua Choo, Diogo S. Castro, Daniela Drechsel, François Guillemot, Prasanna R. Kolatkar, Ralf Jauch, and Shyam Prabhakar TherMos: Estimating protein–DNA binding energies from in vivo binding profiles Nucl. Acids Res. (2013) 41 (11): 5555-5568 first published online April 16, 2013 doi:10.1093/nar/gkt250
Abstract:
Accurately characterizing transcription factor (TF)-DNA affinity is a central goal of regulatory genomics. Although thermodynamics provides the most natural language for describing the continuous range of TF-DNA affinity, traditional motif discovery algorithms focus instead on classification paradigms that aim to discriminate ‘bound’ and ‘unbound’ sequences. Moreover, these algorithms do not directly model the distribution of tags in ChIP-seq data. Here, we present a new algorithm named Thermodynamic Modeling of ChIP-seq (TherMos), which directly estimates a position-specific binding energy matrix (PSEM) from ChIP-seq/exo tag profiles. In cross-validation tests on seven genome-wide TF-DNA binding profiles, one of which we generated via ChIP-seq on a complex developing tissue, TherMos predicted quantitative TF-DNA binding with greater accuracy than five well-known algorithms. We experimentally validated TherMos binding energy models for Klf4 and Esrrb, using a novel protocol to measure PSEMs in vitro. Strikingly, our measurements revealed strong non-additivity at multiple positions within the two PSEMs. Among the algorithms tested, only TherMos was able to model the entire binding energy landscape of Klf4 and Esrrb. Our study reveals new insights into the energetics of TF-DNA binding in vivo and provides an accurate first-principles approach to binding energy inference from ChIP-seq and ChIP-exo data.
License type:
http://creativecommons.org/licenses/by-nc/4.0/
Funding Info:
Joint Council Office of the Agency for Science, Technology and Research, Singapore [JCOAG03_FG02_2009]; Funding for open access charge: The Agency for Science, Technology and Research, Singapore.
Description:
ISSN:
0305-1048
Files uploaded: