Abstract
The number of expressed sequences greatly surpasses the estimated number of protein-coding genes in mammalian genomes. An evolutionary approach reveals that only 9% to 14% of human-expressed and mouse-expressed sequences are able to code for proteins. Clustering of these sequences using cross-species relationships suggests that millions of expressed sequences may correspond to only approximately 20,000 distinct protein-coding transcripts.
| Original language | English (US) |
|---|---|
| Pages (from-to) | 1278-1282 |
| Number of pages | 5 |
| Journal | Molecular biology and evolution |
| Volume | 21 |
| Issue number | 7 |
| DOIs | |
| State | Published - Jul 2004 |
All Science Journal Classification (ASJC) codes
- Ecology, Evolution, Behavior and Systematics
- Molecular Biology
- Genetics