http://www.ncbi.nlm.nih.gov Conserved Domains Database banner graphic
NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Structure Home 3D Macromolecular Structures Conserved Domains
PubChem BioSystems
Conserved Domains and Protein Classification
RESOURCES
SEARCH HOW TO HELP NEWS FTP PUBLICATIONS DISCOVER
Resources
Conserved Domain Database (CDD)
is a protein annotation resource that consists of a collection of well-annotated multiple sequence alignment models for ancient domains and full-length proteins. These are available as position-specific score matrices (
PSSMs) for fast identification of conserved domains in protein sequences via
RPS-BLAST.
CDD content includes
NCBI-curated domains, which use
3D-structure information to explicitly define domain boundaries and provide insights into
sequence/structure/function relationships, as well as domain models imported from a number of
external source databases (
Pfam,
SMART,
COG,
PRK,
TIGRFAM).
Search How To Help News FTP Publications
CD-Search
&
Batch CD-Search
CD-Search is NCBI's interface to searching the Conserved Domain Database with
protein or nucleotide query sequences. It uses
RPS-BLAST, a variant of PSI-BLAST, to quickly scan a set of pre-calculated position-specific scoring matrices (
PSSMs) with a protein query. The
results of CD-Search are presented as an annotation of protein domains on the user query sequence (
illustrated example), and can be visualized as domain
multiple sequence alignments with
embedded user queries. High confidence associations between a query sequence and conserved domains are shown as
specific hits. The
CD-Search Help provides additional details, including information about
running CD-Search locally.
Batch CD-Search serves as both a web application and a
script interface for a conserved domain search on
multiple protein sequences, accepting up to 100,000 proteins in a single job. It enables you to view a
graphical display of the concise or full search result for any individual protein from your input list, or to
download the results for the complete set of proteins. The
Batch CD-Search Help provides additional details.
CD-Search (
Help &
FTP)
Batch CD-Search (
Help)
Publications
CDART: Domain Architectures Conserved Domain Architecture Retrieval Tool (CDART)
performs similarity searches of the
Entrez Protein database based on domain architecture, defined as the sequential order of conserved domains in protein queries. CDART finds protein similarities across significant evolutionary distances using sensitive domain profiles rather than direct sequence similarity. Proteins similar to the query are grouped and scored by architecture. You can search CDART directly with a query protein sequence, or, if a sequence of interest is already in the Entrez Protein database, simply retrieve the record, open its "
Links" menu, and select "
Domain Relatives" to see the precalculated CDART results (
illustrated example). Relying on domain profiles allows CDART to be fast and, because it relies on annotated functional domains, informative.
About Search Help FTP Publications
CDTree
is a helper application for your web browser that allows you to interactively view and examine conserved domain hierarchies curated at NCBI. CDTree works with Cn3D as its alignment viewer/editor, it is used in the CDD curation process and is a both
classification and research tool for functional annotation and the study of protein and protein domain families.
About Install Publications
How to use the conserved domain resources: examples
back to top
Identify the putative function of a protein sequence
Identify the amino acids in a protein sequence that are putatively involved in functions such as binding or catalysis, as mapped from conserved domain annotations to the query sequence
View a query protein sequence embedded within the multiple sequence alignment of a domain model
Interactively view the 3D structure of a conserved domain
Find other proteins with similar domain architecture
Interactively view the phylogenetic sequence tree for a conserved domain model of interest with or without a query sequence embedded
Highlights
What is a conserved domain? Thumbnail image for 3D structure of type-1 insulin-like growth-factor receptor (IGF-1R), viewed in the free Cn3D structure viewing program and colored by domain. Click on image to jump to a larger, annotated version in the CDD help document.
3-D structures and conserved core motifs: Thumbnail image for example of 3-dimensional structure: Cl- binding residues in Voltage-Gated Chloride Channel, cd00400. Click on image to jump to a larger, annotated version in the CDD help document.
Conserved features (binding and catalytic sites) Thumbnail image for examples of Conserved Features (Sites) in Voltage-Gated Chloride Channel, cd00400, including Cl- selectivity filter, pore-gating glutamate residue, Cl- binding residues, and dimer interface.. Click on image to jump to a larger, annotated version in the CDD help document.
Proteins with Similar Domain Architectures Thumbnail image showing the domain relatives for protein sequence NP_081086, mouse DNA mismatch repair protein Mlh1. Domain relatives are protein sequences that contain one or more of the conserved domains found in the query sequence, as identified by the Conserved Domain Architecture Retrieval Tool (CDART). Click on the image for an example of how to find domain relatives for a query protein.
Domain family hierarchies Thumbnail image of domain hierarchy showing divergence in a protein family based on phylogenetic relationships of protein sequences and functional properties. Click on image to jump to a larger, annotated version in the CDD help document.
| Revised 21 March 2013 | | Help Desk | Disclaimer | Privacy statement | Accessibility |