EMBL-EBI, the home for big data in biology, provides freely available data from life science experiments, performs basic research in computational biology and offers an extensive user training programme, supporting researchers in academia and industry.
The HGNC is responsible for approving unique symbols and names for human loci, including protein coding genes, ncRNA genes and pseudogenes, to allow unambiguous scientific communication. This database is searchable, and provides information about the function of a gene, and previous nomenclature.
NCBI provides a number of resources. Such as AceView which provides a curated, comprehensive and non-redundant sequence representation of all public mRNA sequences, as well as ClinVar which collects submissions from clinical testing labs, researchers, locus-specific databases, expert panels, and professional societies. Others include dbVar and dbSNP which are databases of genomic and SNP structural variation that allows users to search, view, and download structural variation data from submitted studies on several species. There is also GeneBank which is a collection of annotated genetic sequences, and Gene Expression Omnibus (GEO) which is a public functional genomics data repository. It also includes HapMap which helps researchers find genes associated with human disease and PubChem which is a database of chemical molecules and their activities against biological assays.