Patterns & Motifs Search
Sequence Pattern Discovery generates patterns from a collection of unaligned protein or DNA sequences
http://cbcsrv.watson.ibm.com/Tspd.html
Search your query sequence for protein motifs, rapidly compare your query protein sequence against all patterns stored in the PROSITE pattern database and determine what the function of an uncharacterised protein is.
http://www.ebi.ac.uk/ppsearch/
PRINTS is a compendium of protein fingerprints. A fingerprint is a group of conserved motifs used to characterise a protein family; its diagnostic power is refined by iterative scanning of a SWISS-PROT/TrEMBL composite. Usually the motifs do not overlap, but are separated along a sequence, though they may be contiguous in 3D-space. Fingerprints can encode protein folds and functionalities more flexibly and powerfully than can single motifs, full diagnostic potency deriving from the mutual context provided by motif neighbours.
http://www.bioinf.manchester.ac.uk/dbbrowser/PRINTS/
Search against FingerPRINTScan with a protein query sequence to identify the closest matching PRINTS sequence motif fingerprints in a protein sequence.
http://www.ebi.ac.uk/printsscan/
This tool allows the user to search for patterns conserved in sets of unaligned protein sequences. The user can specify what kind of patterns should be searched for, and how many sequences should match a pattern to be reported.
http://www.ebi.ac.uk/pratt/
PRATT is a tool to discover patterns that are conserved in a set of protein sequences. The patterns are reported using the PROSITE format.
http://www.expasy.org/tools/pratt/
Relationships between protein sequences and motifs. Hits is a free database devoted to protein domains. It is also a collection of tools for the investigation of the relationships between protein sequences and motifs described on them. These motifs are defined by an heterogeneous collection of predictors, which currently includes regular expressions, generalized profiles and hidden Markov models.
http://myhits.isb-sib.ch/cgi-bin/index
Motif scanning means finding all known motifs that occur in a sequence. This form lets you paste a protein sequence, select the collections of motifs to scan for, and launch the search.
http://myhits.isb-sib.ch/cgi-bin/motif_scan
Complex Pattern Search
http://www.dkfz.de/mga2/3of5/3of5.html
Eukaryotic Linear Motif resource for functional sites in proteins
http://elm.eu.org/
The Nine Amino Acid Transactivation Domain (9aaTAD) describes a nine amino acid-long motif that is common to the transactivation domains of many transcription factors ranging from Gal4 to p53 to NF-κB.
http://emb1.bcc.univie.ac.at/toolbox/9aatad/
Minimotif Miner (MnM) analyzes protein queries for the presence of short functional motifs that, in at least one protein, has been demonstrated to be involved in posttranslational modifications, binding to other proteins, nucleic acids, or small molecules, or proteins trafficking.
http://sms.engr.uconn.edu/servlet/SMSSearchServlet
PATTINPROT is a tool to scan a protein database of one or several sequences for one or several patterns.
http://npsa-pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=npsa_pattinprot.html
Protein Repeats
Rapid Automatic Detection and Alignment of Repeats in protein sequences. Tool to detect protein repeats.
http://www.ebi.ac.uk/Radar/
De novo tandem repeat detection and architecture modeling in protein sequences
http://jimcooperlab.mcdb.ucsb.edu/xstream/
TRUST is a method for ab-initio determination of internal repeats in proteins.
http://zeus.cs.vu.nl/programs/trustwww/
REPRO is able to recognise distant repeats in a single query sequence.
http://zeus.cs.vu.nl/programs/reprowww/
Searches a protein sequence for repeats. Currently implemented repeat families are Ankyrin, Armadillo, HAT, HEAT, HEAT_AAA, HEAT_ADB, HEAT_IMB, Kelch, Leucin Rich Repeats, PFTA, PFTB, RCC1, TPR, WD40.
http://www.embl-heidelberg.de/~andrade/papers/rep/search.html
Low Complexity Regions
Prediction of Low Complexity Regions
http://mendel.imp.ac.at/METHODS/seg.server.html
Protease & Cleavage Sites
Predicts potential protease and cleavage sites and sites cleaved by chemicals in a given protein sequence
http://www.expasy.ch/tools/peptidecutter/
Calculate masses of peptides and their post-translational modifications for a UniProtKB/Swiss-Prot or UniProtKB/TrEMBL entry or for a user sequence
http://www.expasy.ch/tools/peptide-mass.html
ProP 1.0 server predicts arginine and lysine propeptide cleavage sites in eukaryotic protein sequences using an ensemble of neural networks. Furin-specific prediction is the default. It is also possible to perform a general proprotein convertase (PC) prediction.
http://www.cbs.dtu.dk/services/ProP/
The NetPicoRNA World Wide Web server produces neural network predictions of cleavage sites of picornaviral proteases.
http://www.cbs.dtu.dk/services/NetPicoRNA/
NetCorona predicts coronavirus 3C-like proteinase (or protease) cleavage sites using artificial neural networks on amino acid sequences.
http://www.cbs.dtu.dk/services/NetCorona/