Skip to contents

Filter the HGNC data set by a keyword to be looked up in the columns containing gene names or symbols. By default, it will look up in symbol, name, alias_symbol, alias_name, prev_symbol and prev_name. Note that this function in dive into list-columns and match return a hit result if at least one of the strings matches the keyword.

Usage

filter_by_keyword(
  tbl,
  keyword,
  cols = c("symbol", "name", "alias_symbol", "alias_name", "prev_symbol", "prev_name")
)

Arguments

tbl

A tibble containing the HGNC data set, typically obtained with import_hgnc_dataset().

keyword

A keyword or a regular expression to be used as search criterion.

cols

Columns to be looked up.

Value

A tibble of the HGNC data set filtered by observations matching the keyword.

Examples

if (FALSE) {
# Start by retrieving the HGNC data set
hgnc_tbl <- import_hgnc_dataset()

# Search for entries containing "TP53" in the HGNC data set
hgnc_tbl %>%
  filter_by_keyword('TP53') %>%
  dplyr::select(1:4)

# The same as above but restrict the search to the `symbol` column
hgnc_tbl %>%
  filter_by_keyword('TP53', cols = 'symbol') %>%
  dplyr::select(1:4)

# Match "TP53" exactly in the `symbol` column
hgnc_tbl %>%
  filter_by_keyword('^TP53$', cols = 'symbol') %>%
  dplyr::select(1:4)

# `filter_by_keyword()` is vectorised over `keyword`
hgnc_tbl %>%
  filter_by_keyword(c('^TP53$', '^PIK3CA$'), cols = 'symbol') %>%
  dplyr::select(1:4)
}