Skip to main content

Biological Database

Many public databases are available with information on proteins, small molecules, RNA, cells, and more. Many datasets are aggregates of other datasets and therefore overlap between databases can be high. While some datasets are extensively curated by hand, most are automated, and therefore redundancy within a dataset can also be high.

Copilot addresses these challenges by offering integrated search capabilities that access public databases, streamlining the process of navigating overlapping datasets and minimizing redundancy.

Protein Databases

Find a cytokine

Protein nameUniProtSequencePDB
Cytokine receptor-like factor 3Q8IUI8MRGAME ... 442-
Cytokine

UniProt is a comprehensive protein sequence and functional information database, widely used by the scientific community. The data consists of manual and automated curation, experimental and predicted information. More information here

Find human hemoglobin

Hemoglobin

Protein Data Bank (PDB) is a repository for experimental 3D structural data of biological molecules. The main focus is proteins, but includes nucleic acids, lipids, and small molecules. Experiments include crystallography, cryo-EM, NMR, and some other (e.g. SAXS). Data is manually entered by individual researchers. More information here

Fetch 1yvn

AlphaFold Database provides AI-predicted 3D structures of proteins, covering natural protein sequences, and providing structures for those that do no have experimental structural data. Data is generated and quality-controlled automatically. More information here

Load H2NHM8

Cyclin-dependent kinase 2

Small Molecule

PubChem is a small molecule database with chemical properties, biological activities, and other functional data. Information is automatically curated. More information here

Load molecule rapamycin

FoldSeek

FoldSeek is a computational tool for comparing and aligning protein structures. More information here

Find others like 3kwd

3kwd