This function retrieves cytogenetic bands. If no cytogenetic information is available for the queried species then it will be omitted from in the returned value.

get_cytogenetic_bands(
  species_name = "homo_sapiens",
  verbose = FALSE,
  warnings = TRUE,
  progress_bar = TRUE
)

Arguments

species_name

The species name, i.e., the scientific name, all letters lowercase and space replaced by underscore. Examples: 'homo_sapiens' (human), 'ovis_aries' (Domestic sheep) or 'capra_hircus' (Goat).

verbose

Whether to be chatty.

warnings

Whether to print warnings.

progress_bar

Whether to show a progress bar.

Value

A tibble, each row being a cytogenetic band, of 8 variables:

species_name

Ensembl species name: this is the name used internally by Ensembl to uniquely identify a species by name. It is the scientific name but formatted without capitalisation and spacing converted with an underscore, e.g., 'homo_sapiens'.

assembly_name

Assembly name.

cytogenetic_band

Name of the cytogenetic_band.

chromosome

Chromosome name.

start

Genomic start position of the cytogenetic band. Starts at 1.

end

Genomic end position of the cytogenetic band. End position is included in the band interval.

stain

Giemsa stain results: Giemsa negative, 'gneg'; Giemsa positive, of increasing intensities, 'gpos25', 'gpos50', 'gpos75', and 'gpos100'; centromeric region, 'acen'; heterochromatin, either pericentric or telomeric, 'gvar'; and short arm of acrocentric chromosomes are coded as 'stalk'.

strand

Strand.

Examples

# Get toplevel sequences for the human genome (default) get_cytogenetic_bands()
#> # A tibble: 862 × 8 #> species_name assembly_name cytogenetic_band chromosome start end stain #> <chr> <chr> <chr> <chr> <int> <int> <chr> #> 1 homo_sapiens GRCh38 p11.1 Y 10300001 1.04e7 acen #> 2 homo_sapiens GRCh38 p11.2 Y 600001 1.03e7 gneg #> 3 homo_sapiens GRCh38 p11.31 Y 300001 6 e5 gpos… #> 4 homo_sapiens GRCh38 p11.32 Y 1 3 e5 gneg #> 5 homo_sapiens GRCh38 q11.1 Y 10400001 1.06e7 acen #> 6 homo_sapiens GRCh38 q11.21 Y 10600001 1.24e7 gneg #> 7 homo_sapiens GRCh38 q11.221 Y 12400001 1.71e7 gpos… #> 8 homo_sapiens GRCh38 q11.222 Y 17100001 1.96e7 gneg #> 9 homo_sapiens GRCh38 q11.223 Y 19600001 2.38e7 gpos… #> 10 homo_sapiens GRCh38 q11.23 Y 23800001 2.66e7 gneg #> # … with 852 more rows, and 1 more variable: strand <int>
# Get toplevel sequences for Mus musculus get_cytogenetic_bands('mus_musculus')
#> # A tibble: 82 × 8 #> species_name assembly_name cytogenetic_band chromosome start end stain #> <chr> <chr> <chr> <chr> <int> <int> <chr> #> 1 mus_musculus GRCm39 "" MT 1 1.63e4 gneg #> 2 mus_musculus GRCm39 "" Y 1 9.15e7 gneg #> 3 mus_musculus GRCm39 "cen" 2 110001 1.56e6 acen #> 4 mus_musculus GRCm39 "cen" 2 1555001 3 e6 acen #> 5 mus_musculus GRCm39 "p" 2 1 1.1 e5 gneg #> 6 mus_musculus GRCm39 "q" 2 3000001 1.82e8 gneg #> 7 mus_musculus GRCm39 "cen" 4 110001 1.56e6 acen #> 8 mus_musculus GRCm39 "cen" 4 1555001 3 e6 acen #> 9 mus_musculus GRCm39 "p" 4 1 1.1 e5 gneg #> 10 mus_musculus GRCm39 "q" 4 3000001 1.57e8 gneg #> # … with 72 more rows, and 1 more variable: strand <int>