Skip to main content

4 docs tagged with "search"

View all tags

Biological Database

Many public databases are available with information on proteins, small molecules, RNA, cells, and more. Many datasets are aggregates of other datasets and therefore overlap between databases can be high. While some datasets are extensively curated by hand, most are automated, and therefore redundancy within a dataset can also be high.

FoldSeek

Some types of search, such as by sequence or structure similarity, require specialized tools. Foldseek is a fast and reliable protein structure similarity lookup algorithm.

Protein

Many public databases are available with information on proteins. Many datasets are aggregates of other datasets and therefore overlap between databases can be high. While some datasets are extensively curated by hand, most are automated and therefore redundancy within a dataset can also be high.

Small Molecule

Many public databases are available with information on proteins, small molecules, RNA, cells, and more. Many datasets are aggregates of other datasets and therefore overlap between databases can be high. While some datasets are extensively curated by hand, most are automated and therefore redundancy within a dataset can also be high.