Protein Domains & Families
Integrated search in PROSITE, Pfam, PRINTS and other family and domain databases. InterPro is a database of protein families, domains and functional sites in which identifiable features found in known proteins can be applied to unknown protein sequences.
http://www.ebi.ac.uk/InterProScan/
Conserved Domain Database Search @ NCBI
http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi
http://www.ncbi.nlm.nih.gov/sites/entrez?db=cdd
http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml
PANTHER version 6.1 contains 5547 protein families, divided into 24,582 functionally distinct protein subfamilies by expert biologist curators.
http://www.pantherdb.org/panther/
TIGRFAMs are protein families based on Hidden Markov Models or HMMs. Use this page to see the curated seed alignmet for each TIGRFAM, the full alignment of all family members and the cutoff scores for inclusion in each of the TIGRFAMs. Also use this page to search through the TIGRFAMs and HMMs for text in the TIGRFAMs Text Search or search for specific sequences in the TIGRFAMs Sequence Search.
http://www.tigr.org/TIGRFAMs/index.shtml
ProDom is a comprehensive set of protein domain families automatically generated from the SWISS-PROT and TrEMBL sequence databases
http://prodom.prabi.fr/prodom/current/html/home.php
The Pfam database is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs).
http://pfam.sanger.ac.uk/
Find Pfam families within your sequence of interest.
http://pfam.sanger.ac.uk/search?tab=searchSequenceBlock
The Conserved Domain Architecture Retrieval Tool (CDART) performs similarity searches of the NCBI Entrez Protein Database based on domain architecture, defined as the sequential order of conserved domains in proteins.
http://www.ncbi.nlm.nih.gov/Structure/lexington/lexington.cgi?cmd=rps
PANDIT is a collection of multiple sequence alignments and phylogenetic trees covering many common protein domains.
http://www.ebi.ac.uk/goldman-srv/pandit/
AnDom helps to assign structual domains to protein sequences and to classify them according to SCOP.
http://coot.embl.de/AnDom/Usage.html
Proteins from complete genomes have been clustered by sequence similarity into groups - COGs, or in case of viruses, VOGs. Genome ProtMap maps each protein from a COG/VOG back to its genome, and displays all the genomic segments coding for members of this particular group of related proteins.
http://www.ncbi.nlm.nih.gov/sutils/protmap.cgi?cluster=COG4690E&result=map
The NCBI Entrez Protein Clusters database is a collection of Reference Sequence (RefSeq) proteins from the complete genomes of prokaryotes, plasmids, and organelles grouped and annotated based on sequence similarity and protein function.
http://www.ncbi.nlm.nih.gov/sites/entrez?db=proteinclusters
The CluSTr database offers an automatic classification of UniProt Knowledgebase and IPI proteins into groups of related proteins. The clustering is based on analysis of all pairwise comparisons between protein sequences. The database provides links to InterPro, which integrates information on protein families, domains and functional sites from PROSITE, PRINTS, Pfam, ProDom, SMART, TIGRFAMs, Gene3D, SUPERFAMILY, PIR Superfamily and PANTHER.
http://www.ebi.ac.uk/clustr/
PROSITE consists of documentation entries describing protein domains, families and functional sites as well as associated patterns and profiles to identify them
http://www.expasy.ch/prosite/
Scans a sequence against PROSITE or a pattern against the UniProt Knowledgebase (Swiss-Prot and TrEMBL)
http://www.expasy.ch/tools/scanprosite/
Implementation of the evolutionary trace method. The software expands on the evolutionary trace by allowing manipulation of the input data and parameters of analysis, and presents a number of novel tree inspired analysis of protein families.
http://www.cmpharm.ucsf.edu/~marcinj/JEvTrace/
Blocks are multiply aligned ungapped segments corresponding to the most highly conserved regions of proteins that can be searched here.
http://blocks.fhcrc.org/blocks/blocks_search.html
ava GUI for InterProScan (JIPS) is a grapical user interface for finding new motif/domains/fingerprints in repeated Interproscan searches. JIPS compares results from InterProScan searches performed with two versions of a InterPro database and highlights new motifs/Domains/fingerprints which are from the updated database. Results are displayed in an easy-to-use tabular format. JIPS also contains tools to assist with ortholog-based comparative studies of protein signatures.
http://athena.bioc.uvic.ca/workbench.php?tool=jips&db=
High-quality Automated and Manual Annotation of microbial Proteomes. HAMAP is a system, based on manual protein annotation, that identifies and semi-automatically annotates proteins that are part of well-conserved families or subfamilies: the HAMAP families. HAMAP is based on manually created family rules and is applied to bacterial, archaeal and plastid-encoded proteins.
http://www.expasy.ch/sprot/hamap/
SuperFamily (PIRSF) classification system. Based on the evolutionary relationships of whole proteins, this classification system allows annotation of both speci®c biological and generic biochemical functions.
http://pir.georgetown.edu/pirsf/
This tool allows you to search our library of functional diagnostic profiles with a protein sequence. A match to one or more of these profiles provides you with an effective multi-alignment of your query sequence to the best-matched profile's defining set of proteins along with its annotations.
http://bmerc-www.bu.edu/bioinformatics/profile_request.html
SUPERFAMILY is a database of structural and functional protein annotations for all completely sequenced organisms.
http://supfam.mrc-lmb.cam.ac.uk/SUPERFAMILY/
mkdom 2 is the program used to build the ProDom database.
http://prodom.prabi.fr/prodom/xdom/welcome.html