EMBL-EBI provides freely available data from life science experiments, performs basic research in computational biology and offers an extensive user training programme, supporting researchers in academia and industry.
This includes a number of resources including the European Genome-phenome Archive (EGA) that has datasets from genomic studies, and the Database of Genomic Variants Online (European Variation Archive) which is a database of genetic variation. It also contains the Chemical Entities of Biological Interest (ChEBI) which provides a list of molecular entities focused on ‘small’ chemical compounds. Array Express is a database of functional genomics experiments that can be queried and the data downloaded. It also houses the IntAct database for molecular interaction data, as well as Uniprot for protein sequence and structure.
The modENCODE project is a continuation of the original ENCODE project targeting the identification of functional elements in selected model organism genomes, specifically, Drosophila melanogaster and Caenorhabditis elegans.
NCBI provides a number of resources. Such as AceView which provides a curated, comprehensive and non-redundant sequence representation of all public mRNA sequences, as well as ClinVar which collects submissions from clinical testing labs, researchers, locus-specific databases, expert panels, and professional societies. Others include dbVar and dbSNP which are databases of genomic and SNP structural variation that allows users to search, view, and download structural variation data from submitted studies on several species. There is also GeneBank which is a collection of annotated genetic sequences, and Gene Expression Omnibus (GEO) which is a public functional genomics data repository. It also includes HapMap which helps researchers find genes associated with human disease and PubChem which is a database of chemical molecules and their activities against biological assays.