Promoter Analysis
(examination of the binding sites of Ahr1 using ChIP-seq data from Candida albicans)

(Version: April 2021)

Background


Candida albicans is a human pathogenic fungus which can cause severe infections in immunocompromised people. It can grow in different variations: as unicellular yeast, pseudohyphae and hyphae [1]. This morphological flexibility and especially the hyphal form are crucial for the virulence of the fungus [2, 3]. In recent years, several virulence factors of C. albicans have been identified. They are often closely connected with hyphal growth. This includes the transcription factor Ahr1 which is supposedly involved in the regulation of several virulence associated genes via binding their promoter region.

"Chromatin Immunoprecipitation sequencing" (ChIP-Seq)


Two C. albicans strains with a hyperactive Ahr1 were analyzed using ChIP-seq [4]. This procedure can determine protein-DNA-interactions. The protein is mixed with the DNA, the DNA is fragmented and all unbound DNA pieces are removed. Afterwards, the protein is washed off the DNA and the sequence fragments are analyzed using high throughput sequencing. The result of this analysis were 325 sequence fragments representing potential binding sites of Ahr1 in the C. albicans genome. By using the chromosomal position of those fragments, 532 genes on both DNA strands were identified that harbored the sequences in their promoter regions meaning they could be regulated by Ahr1.

Promoter Analysis


A region of 500 base pairs around the maximum of each peak of a sequence fragment (which is the most abundant location of this fragment in the high throughput sequencing and therefore the most probable binding site) was used as input for the online tool Meme-ChIP (v. 5.1.0) [5]. This way, a highly significant motif was found that is concurrent to an already known binding motif of Ahr1. For further confirmation, the software MochiView (v. 1.46) [6] was used. The integrated motif finder also detected the known Ahr1 motif, which is displayed in figure 1.

bindingmotive
Fig. 1: Binding motif of Ahr1.

MochiView can also be used for visualization of peaks, genes and motifs. It shows clearly (figures 2a and 2b) that Ahr1 binds within the promoter region of important virulence associated genes of C. albicans like ECE1 and ALS3. Frequently, there is more than one potential binding site present in one promoter region.

ECE1-Promotorbereiche ALS3-Promotorbereiche
Fig. 2: Visualization of the Ahr1 binding sites in the promoter regions of a) ECE1 and b) ALS3. The whole sequence region that was detected via ChIP-seq is visible as well as the 500 base pairs central region. Green arrows show the positions of the binding motif.

Altogether, the binding motif was detected in the promoters of 37 virulence relevant genes. The table below shows the motif with the highest MochiView score for each gene. In case there are two motifs with the same score, both are shown.

Tab. 1: Ahr1 binding motif in the promoters of 37 virulence associated genes, the distance of the motif to the transcription start site, the position on leading or lagging strand and the MochiView score.
Gene NameMotif Distance to Gene in bpMotif StrandMotif ScoreMotif Sequence
AHR15586-5.4GGCAACAATTACCGG
ALS11390-4.6GGAAACTTCAAACGA
ALS3340+4.5TGCAAGTTAAACCGA
ALS41649-3.4CACAAGTGTAAGCGA
BCR12178-2.8AGAAAGGAAAAGCGA
BRG15765+5.1TGCAAGAATTACCGA
CDR11337+4.7ATCAACTATTGCCGA
CZF14304-2.4TGCAGTGGTAACCGA
DCK1506+4.5GGAAAGTATAGTCGA
DEF11719-4.0GTCAACTTCTGACGA
DEF1495-4.0GGAAATTAGAAACGA
EAP11204+4.5GGGAAGTTCAAGCGA
ECE12721+4.8GGGAAGAATTACCGA
EFG12311+5.0TGCAACTACAACCGA
FLO82066-3.5GGCAAGAAGTAGAGA
HGC111647-4.3GGAAAGTGGTAGCGA
HGT23442+4.9TGCAACTATTCGCGA
HWP11225-3.9GGCAAGTTTATCCGC
HYR11938+4.8AGCAATATTAGGCGA
IHD1861+5.1TGCAACAATTACCGA
LMO1179+4.4AGAAATTTTTAGCGA
MDR1537-5.4GGTAACTATTGGCGA
NDT801567-5.1GGCAAGTTTAATCGA
RBT198-4.4GGTAAGATTTACCGG
RIM101664-5.1AGCAAGTAGAGCCGA
SAP4807-3.8AGCAATTTTAAGAGA
SAP51144+4.3GGCAATTTTAAGAGA
SAP6833-3.5GGTAATTTTAAGAGA
SFL16551-5.1GGAAACTATTACCGG
SFL22733+4.0AACAAGTAGAGCCGA
SOD5888+4.9GGCATCTTTTCCCGA
SOD51512+4.9GGAAAGTTGAAGCGA
STP2915+2.9TGCAAGACTTGCAGA
TEC15005-4.8GGCAAGTATAAGCTA
UME615928+4.2AACAACTTTAAACGA
WOR15648-5.0AGCAAGTATAGCCGT
WOR21834+5.0GGCATCAATTACCGA

This indicates that Ahr1 is involved in the regulation of a multitude of virulence associated processes like hyphae formation, cell invasion, iron acquisition or host cell damage. Since the promoter of Ahr1 also shows the binding motif, it may regulate itself via a feedback loop.

Homology and Synteny


Homologues of AHR1 (genes which have the same precursor gene), but also of ECE1 and ALS3 can be found in the genomes of C. dubliniensis and C. tropicalis. Both species are close relatives of C. albicans and pathogens. The AHR1 homologues show a very good alignment score of 992 (C. albicans AHR1 vs. C. dubliniensis Cd36_85930) and 914 (C. albicans AHR1 vs. C. tropicalis CTRG_02263) in T-Coffee (v. 11.0) [7]. ECE1 und its homologues show scores of 994 (C. albicans ECE1 vs. C. dubliniensis Cd36_43260) and 852 (C. albicans ECE1 vs. C. tropicalis CTRG_00476).

Also, the spatial order of the genes in the genome (synteny) in all three organisms is similar, as can be visualized in the Candida Gene Order Browser [8], while it is quite different to that in S. cerevisiae, the non-pathogenic fungal model organism.

CGOBrowser.jpg
Fig. 3: Synteny of AHR1 in C. albicans, C. dubliniensis, C. tropicalis and S. cerevisiae.

Therefore, the regulatory mechanisms described here could be conserved in those two species, too.

References


  1. [1] P. E. Sudbery: Growth of Candida albicans hyphae. In: Nat. Rev. Microbiol., 9:737–748, 2011. doi: 10.1038/nrmicro2636
  2. [2] K. Zakikhany, J. R. Naglik, A. Schmidt-Westhausen, G. Holland, M. Schaller, B. Hube: In vivo transcript profiling of Candida albicans identifies a gene essential for interepithelial dissemination. In: Cell Microbiol., 9:2938–2954, 2007. doi: 10.1111/j.1462-5822.2007.01009.x
  3. [3] A. L. Mavor, S. Thewes, B. Hube: Systemic fungal infections caused by Candida species: epidemiology, infection process and virulence attributes. In: Curr. Drug Targets, 6:863–874, 2005. doi: 10.2174/138945005774912735
  4. [4] S. Ruben, E. Garbe, S. Mogavero, D. Albrecht-Eckardt, D. Hellwig, A. Häder, T. Krüger, K. Gerth, I. D. Jacobsen, O. Elshafee, S. Brunke, K. Hünniger, O. Kniemeyer, A. A. Brakhage, J. Morschhäuser, B. Hube, S. Vylkova, O. Kurzai, R. Martin: Ahr1 and Tup1 contribute to the transcriptional control of virulence-associated genes in Candida albicans. In: mBio, 11:2, 2020.
    doi: 10.1128/mBio.00206-20
  5. [5] P. Machanick, T. L. Bailey: MEME-ChIP: motif analysis of large DNAdatasets. In: Bioinformatics, 27:1696–1697, 2011. doi: 10.1093/bioinformatics/btr189
  6. [6] O. R. Homann, A. D. Johnson: MochiView: versatile software for genome browsing and DNA motif analysis. In: BMC Biol., 8:49, 2010. doi: 10.1186/1741-7007-8-49
  7. [7] C. Notredame, D. G. Higgins, J. Heringa: T-Coffee: A novel method for fast and accurate multiple sequence alignment. In: J. Mol. Biol., 302(1):205–217, 2000. doi: 10.1006/jmbi.2000.4042
  8. [8] S. L. Maguire, S. S. ÓhÉigeartaigh, K. P. Byrne, M. S. Schröder, P. O'Gaora, K. H. Wolfe, G. Butler: Comparative genome analysis and gene finding in Candida species using CGOB. In: Mol. Biol. Evol., 30(6):1281–1291, 2013. doi: 10.1093/molbev/mst042