Promoter directionality is controlled by U1 snRNP and polyadenylation signals
Abstract
Transcription of the mammalian genome is pervasive, but productive transcription outside of protein-coding genes is limited by unknown mechanisms1 . In particular, although RNA polymerase II (RNAPII) initiates divergently from most active gene promoters, productive elongation occurs primarily in the sense-coding direction2–4.Here we showin mouse embryonic stem cells that asymmetric sequence determinants flanking gene transcription start sites control promoter directionality by regulating promoter-proximal cleavage and polyadenylation. We find that upstream antisense RNAs are cleaved and polyadenylated at poly(A) sites (PASs) shortly after initiation. De novo motif analysis shows PAS signals and U1 small nuclear ribonucleoprotein (snRNP) recognition sites to be the most depleted and enriched sequences, respectively, in the sense direction relative to the upstream antisense direction. These U1 snRNP sites and PAS sites are progressively gained and lost, respectively, at the 59 end of coding genes during vertebrate evolution. Functional disruption of U1 snRNP activity results in a dramatic increase in promoter-proximal cleavage events in the sense direction with slight increases in the antisense direction. These data suggest that a U1–PAS axis characterized by low U1 snRNP recognition and a high density of PASs in the upstream antisense region reinforces promoter directionality by promoting early termination in upstream antisense regions, whereas proximal sense PAS signals are suppressed by U1 snRNP. We propose that the U1–PAS axis limits pervasive transcription throughout the genome.