Mammals express thousands of long noncoding (lnc) RNAs, a few of which are known to function in tissue development. However, the entire repertoire of lncRNAs in most tissues and species is not defined. Indeed, most lncRNAs are not conserved, raising questions about function. We used RNA sequencing to identify 1109 polyadenylated lncRNAs expressed in erythroblasts, megakaryocytes, and megakaryocyte-erythroid precursors of mice, and 594 in erythroblasts of humans. More than half of these lncRNAs were unannotated, emphasizing the opportunity for new discovery through studies of specialized cell types. Analysis of the mouse erythro-megakaryocytic polyadenylated lncRNA transcriptome indicates that ∼75% arise from promoters and 25% from enhancers, many of which are regulated by key transcription factors including GATA1 and TAL1. Erythroid lncRNA expression is largely conserved among 8 different mouse strains, yet only 15% of mouse lncRNAs are expressed in humans and vice versa, reflecting dramatic species-specificity. RNA interference assays of 21 abundant erythroid-specific murine lncRNAs in primary mouse erythroid precursors identified 7 whose knockdown inhibited terminal erythroid maturation. At least 6 of these 7 functional lncRNAs have no detectable expression in human erythroblasts, suggesting that lack of conservation between mammalian species does not predict lack of function.
All Science Journal Classification (ASJC) codes
- Cell Biology