Description
This track shows the Repeat Masker annotations on the 26 Apr 2016 Mus musculus (house mouse)/GCA_001624185.1_129S1_SvImJ_v1 genome assembly.
This track was created by using Arian Smit's
RepeatMasker
program, which screens DNA sequences
for interspersed repeats and low complexity DNA sequences. The program
outputs a detailed annotation of the repeats that are present in the
query sequence (represented by this track), as well as a modified version
of the query sequence in which all the annotated repeats have been masked
(generally available on the
Downloads page). RepeatMasker uses the
Repbase Update library of repeats from the
Genetic
Information Research Institute (GIRI).
Repbase Update is described in Jurka (2000) in the References section below.
Display Conventions and Configuration
In full display mode, this track displays up to ten different classes of repeats:
- Short interspersed nuclear elements (SINE), which include ALUs
- Long interspersed nuclear elements (LINE)
- Long terminal repeat elements (LTR), which include retroposons
- DNA repeat elements (DNA)
- Simple repeats (micro-satellites)
- Low complexity repeats
- Satellite repeats
- RNA repeats (including RNA, tRNA, rRNA, snRNA, scRNA, srpRNA)
- Other repeats, which includes class RC (Rolling Circle)
- Unknown
The level of color shading in the graphical display reflects the amount of
base mismatch, base deletion, and base insertion associated with a repeat
element. The higher the combined number of these, the lighter the shading.
A "?" at the end of the "Family" or "Class" (for example, DNA?) signifies that
the curator was unsure of the classification. At some point in the future,
either the "?" will be removed or the classification will be changed.
Methods
UCSC has used the most current versions of the RepeatMasker software
and repeat libraries available to generate these data. Note that these
versions may be newer than those that are publicly available on the Internet.
Data are generated using the RepeatMasker -s flag. Additional flags
may be used for certain organisms. Repeats are soft-masked. Alignments may
extend through repeats, but are not permitted to initiate in them.
See the FAQ for more information.
Class profiles
- 1,476,998 - SINE
- 1,261,888 - Simple
- 959,292 - LINE
- 901,347 - LTR
- 157,697 - DNA
- 142,057 - Low_complexity
- 29,158 - Satellite
- 24,344 - Other
- 17,817 - RNA
Detail class profiles
- 1,476,926 - SINE
- 1,261,888 - Simple_repeat
- 959,292 - LINE
- 900,110 - LTR
- 155,826 - DNA
- 142,057 - Low_complexity
- 29,158 - Satellite
- 15,667 - Other
- 8,296 - Unknown
- 7,651 - scRNA
- 4,741 - tRNA
- 2,884 - snRNA
- 1,871 - DNA?
- 1,419 - rRNA
- 1,237 - LTR?
- 662 - RNA
- 460 - srpRNA
- 314 - RC
- 72 - SINE?
- 67 - RC?
Credits
Thanks to Arian Smit, Robert Hubley and GIRI for providing the tools and
repeat libraries used to generate this track.
References
Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0.
http://www.repeatmasker.org. 1996-2010.
Repbase Update is described in:
Jurka J.
Repbase Update: a database and an electronic journal of repetitive elements.
Trends Genet. 2000 Sep;16(9):418-420.
PMID: 10973072
For a discussion of repeats in mammalian genomes, see:
Smit AF.
Interspersed repeats and other mementos of transposable elements in mammalian genomes.
Curr Opin Genet Dev. 1999 Dec;9(6):657-63.
PMID: 10607616
Smit AF.
The origin of interspersed repeats in the human genome.
Curr Opin Genet Dev. 1996 Dec;6(6):743-8.
PMID: 8994846
|
|