This function retrieves cytogenetic bands. If no cytogenetic information is available for the queried species then it will be omitted from in the returned value.
get_cytogenetic_bands( species_name = "homo_sapiens", verbose = FALSE, warnings = TRUE, progress_bar = TRUE )
species_name | The species name, i.e., the scientific name, all letters
lowercase and space replaced by underscore. Examples: |
---|---|
verbose | Whether to be chatty. |
warnings | Whether to print warnings. |
progress_bar | Whether to show a progress bar. |
A tibble
, each row being a cytogenetic band,
of 8 variables:
species_name
Ensembl species name: this is the name used internally
by Ensembl to uniquely identify a species by name. It is the scientific
name but formatted without capitalisation and spacing converted with an
underscore, e.g., 'homo_sapiens'
.
assembly_name
Assembly name.
cytogenetic_band
Name of the cytogenetic_band.
chromosome
Chromosome name.
start
Genomic start position of the cytogenetic band. Starts at 1.
end
Genomic end position of the cytogenetic band. End position is included in the band interval.
stain
Giemsa
stain results: Giemsa negative, 'gneg'
; Giemsa positive, of
increasing intensities, 'gpos25'
, 'gpos50'
, 'gpos75'
,
and 'gpos100'
; centromeric region, 'acen'
; heterochromatin,
either pericentric or telomeric, 'gvar'
; and short arm of
acrocentric chromosomes are coded as 'stalk'
.
strand
Strand.
# Get toplevel sequences for the human genome (default) get_cytogenetic_bands()#> # A tibble: 862 × 8 #> species_name assembly_name cytogenetic_band chromosome start end stain #> <chr> <chr> <chr> <chr> <int> <int> <chr> #> 1 homo_sapiens GRCh38 p11.1 Y 10300001 1.04e7 acen #> 2 homo_sapiens GRCh38 p11.2 Y 600001 1.03e7 gneg #> 3 homo_sapiens GRCh38 p11.31 Y 300001 6 e5 gpos… #> 4 homo_sapiens GRCh38 p11.32 Y 1 3 e5 gneg #> 5 homo_sapiens GRCh38 q11.1 Y 10400001 1.06e7 acen #> 6 homo_sapiens GRCh38 q11.21 Y 10600001 1.24e7 gneg #> 7 homo_sapiens GRCh38 q11.221 Y 12400001 1.71e7 gpos… #> 8 homo_sapiens GRCh38 q11.222 Y 17100001 1.96e7 gneg #> 9 homo_sapiens GRCh38 q11.223 Y 19600001 2.38e7 gpos… #> 10 homo_sapiens GRCh38 q11.23 Y 23800001 2.66e7 gneg #> # … with 852 more rows, and 1 more variable: strand <int># Get toplevel sequences for Mus musculus get_cytogenetic_bands('mus_musculus')#> # A tibble: 82 × 8 #> species_name assembly_name cytogenetic_band chromosome start end stain #> <chr> <chr> <chr> <chr> <int> <int> <chr> #> 1 mus_musculus GRCm39 "" MT 1 1.63e4 gneg #> 2 mus_musculus GRCm39 "" Y 1 9.15e7 gneg #> 3 mus_musculus GRCm39 "cen" 2 110001 1.56e6 acen #> 4 mus_musculus GRCm39 "cen" 2 1555001 3 e6 acen #> 5 mus_musculus GRCm39 "p" 2 1 1.1 e5 gneg #> 6 mus_musculus GRCm39 "q" 2 3000001 1.82e8 gneg #> 7 mus_musculus GRCm39 "cen" 4 110001 1.56e6 acen #> 8 mus_musculus GRCm39 "cen" 4 1555001 3 e6 acen #> 9 mus_musculus GRCm39 "p" 4 1 1.1 e5 gneg #> 10 mus_musculus GRCm39 "q" 4 3000001 1.57e8 gneg #> # … with 72 more rows, and 1 more variable: strand <int>