Statistics & downloads help

Statistics & files

The Statistics & downloads page contains tables with breakdown statistics by locus group and locus type of the number of approved symbol reports we have within the database. The tables also contain icons shown below, which enable users to download the data in text (tsv) or JSON format.

The icons are as follows:

Above the tables there are two drop down menus that allow you to select a specific species and chromosome. Selecting a specific chromosome will change the table statistics to show the data for the selected chromosome.

Beneath the tables we also have text (tsv) and JSON files for our complete VGNC dataset.

Fields within the TXT and JSON files

vgnc_id

VGNC ID. A unique ID created by the VGNC for every approved symbol.

symbol

The HGNC approved gene symbol. Equates to the "APPROVED SYMBOL" field within the gene symbol report.

name

HGNC approved name for the gene. Equates to the "APPROVED NAME" field within the gene symbol report.

locus_group

A group name for a set of related locus types as defined by the HGNC (e.g. non-coding RNA).

locus_type

The locus type as defined by the HGNC (e.g. RNA, transfer).

status

Status of the symbol report, which can be either "Approved" or "Entry Withdrawn".

location

Cytogenetic location of the gene (e.g.2q34).

location_sortable

Same as "location" but single digit chromosomes are prefixed with a 0 enabling them to be sorted in correct numerical order (e.g. 02q34).

alias_symbol

Other symbols used to refer to this gene as seen in the "ALIAS SYMBOLS" field in the symbol report.

alias_name

Other names used to refer to this gene as seen in the "ALIAS NAMES" field in the gene symbol report.

prev_symbol

Symbols previously approved by the HGNC for this gene. Equates to the "PREVIOUS SYMBOLS" field within the gene symbol report.

prev_name

Gene names previously approved by the VGNC for this gene. Equates to the "PREVIOUS NAMES" field within the gene symbol report.

gene_family

Name given to a gene family or group the gene has been assigned to. Equates to the GENE FAMILY field within the gene symbol report.

gene_family_id

ID used to designate a gene family or group the gene has been assigned to.

date_approved_reserved

The date the entry was first approved.

date_symbol_changed

The date the gene symbol was last changed.

date_name_changed

The date the gene name was last changed.

date_modified

Date the entry was last modified.

entrez_id

NCBI Gene ID. Found within the "GENE RESOURCES" section of the gene symbol report.

ensembl_gene_id

Ensembl gene ID. Found within the "GENE RESOURCES section of the gene symbol report.