Conserved sequence
Conserved sequences are similar or identical sequences which occur in DNA, and cause sequences in RNA, proteins and carbohydrates.
These sequences occur across species. This shows that the sequences has been maintained in evolution despite speciation. The further back up the phylogenetic tree a particular conserved sequence occurs, the more highly conserved it is. Since sequence information is normally transmitted from parents to progeny by genes, a conserved sequence implies that there is a conserved gene.
Conservation of a sequence happens when mutations in a highly conserved region lead to non-viable life forms, that is, a form which is eliminated through natural selection. In other words, the product of the gene is vital to life, and its function is destroyed by almost all changes (mutations) to the sequence.
Conserved nucleic acid sequences
The basic theory, widely agreed, is that highly conserved DNA sequences must have functional value, though the role for many of these highly conserved non-coding DNA sequences is not known. One recent study that eliminated four highly conserved non-coding DNA sequences in mice yielded viable mice with no significant phenotypic differences; the authors described their findings as "unexpected".[1] So there is clearly something here which is not understood.
Many regions of the DNA, including highly conserved DNA sequences, consist of repeated sequence elements. If only one of a set of a repeated sequences was removed, and the repetitions were not needed, then no difference would be seen in the mice. The paper did not report whether the eliminated sequences were repeated sequences.
Conserved protein sequences and structures
Highly conserved proteins are often required for cells to work or divide. Conservation of protein sequences is shown by the presence of identical amino acid residues at analogous parts of proteins. Conservation of protein structures is indicated by the presence of functionally equivalent, though not necessarily identical, amino acid residues and structures between analogous parts of proteins.
Shown below is an amino acid sequence alignment between two human zinc finger proteins. Conserved amino acid sequences are marked by strings of [math]\displaystyle{ \mathrm{*} }[/math] on the third line of the sequence alignment. As can be seen from this alignment, these two proteins contain a number of conserved amino acid sequences (represented by identical letters aligned between the two sequences).
Comparative genomics
The research field which studies the evolution and function of multigene families is called comparative genomics.[2]
Conserved Sequence Media
A multiple sequence alignment of five mammalian histone H1 proteins *Sequences are the amino acids for residues 120-180 of the proteins. Residues that are conserved across all sequences are highlighted in grey.
A sequence logo for the LexA-binding motif of gram-positive bacteria. As the adenosine at position 5 is highly conserved, it appears larger than other characters.
References
- ↑ Ahituv N. Zhu Y. & Visel A. et al 2007. Deletion of ultraconserved elements yields viable mice (2007). "Deletion of Ultraconserved Elements Yields Viable Mice". PLOS Biol. 5 (9): e234. doi:10.1371/journal.pbio.0050234. PMC 1964772. PMID 17803355.
- ↑ Klug, William S. et al 2010. Concepts of genetics. 10th ed, Pearson, p600. ISBN 0-321-79578-4