Background Correct flower formation requires highly specific temporal and spatial regulation of gene expression. In Arabidopsis thaliana the majority of the master regulators that determine flower organ identity belong to the MADS-domain transcription factor family. The canonical DNA binding motif for this transcription factor family is the CArG-box, which has the consensus CC(A/T)6GG. However, so far, a comprehensive analysis of MADS-domain binding patterns has not yet been performed. Results Eight publicly available ChIP-seq datasets of MADS-domain proteins that regulate the floral transition and flower formation were analyzed. Surprisingly, the preferred DNA binding motif of each protein was a CArG-box with an NAA extension. Furthermore, motifs of other transcription factors were found in the vicinity of binding sites of MADS-domain transcription factors, suggesting that interaction of MADS-domain proteins with other transcription factors is important for target gene regulation. Finally, conservation of CArG-boxes between Arabidopsis ecotypes was assessed to obtain information about their evolutionary importance. CArG-boxes that fully matched the consensus were more conserved than other CArG-boxes, suggesting that the perfect CArG-box is evolutionary more important than other CArG-box variants. Conclusion Our analysis provides detailed insight into MADS-domain protein binding patterns. The results underline the importance of an extended version of the CArG-box and provide a first view on evolutionary conservation of MADS-domain protein binding sites in Arabidopsis ecotypes.
- MADS-domain proteins
- transcription factor binding specifity
- sequence conservation