Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
A database of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs). Users can analyze protein sequences for Pfam matches, view Pfam family annotation and alignments, see groups of related families, look at the domain organization of a protein sequence, find the domains on a PDB structure, and query Pfam by keywords. There are two components to Pfam: Pfam-A and Pfam-B. Pfam-A entries are high quality, manually curated families that may automatically generate a supplement using the ADDA database. These automatically generated entries are called Pfam-B. Although of lower quality, Pfam-B families can be useful for identifying functionally conserved regions when no Pfam-A entries are found. Pfam also generates higher-level groupings of related families, known as clans (collections of Pfam-A entries which are related by similarity of sequence, structure or profile-HMM).
Proper citation: Pfam (RRID:SCR_004726) Copy
http://www.uniprot.org/taxonomy/
NEWT is the taxonomy database maintained by the UniProt group. It integrates taxonomy data compiled in the NCBI database and data specific to the UniProt Knowledgebase. Browse by hierarchy, List all, or Complete proteomes. Organisms are classified in a hierarchical tree structure. Our taxonomy database contains every node (taxon) of the tree. UniProtKB taxonomy data is manually curated: next to manually verified organism names, we provide a selection of external links, organism strains and viral host information. Species with protein sequences stored in the UniProt Knowledgebase are named according to UniProt nomenclature. We endeavour to maintain a list of manually curated species names for which protein sequence data is available. In particular, we have adopted a systematic convention for naming viral and bacterial strains and isolates. Links to external sites are chosen by the UniProt taxonomy team and show pictures and various scientific data of interest (taxonomy, biology, physiology,...).
Proper citation: NEWT (RRID:SCR_004477) Copy
Database of positive selection based on a rigorous branch-site specific likelihood test. Positive selection is detected using CODEML on all branches of animal gene trees.
Proper citation: Selectome: a Database of Positive Selection (RRID:SCR_004542) Copy
http://www.bioinformatics.org/go2msig/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on April 24, 2020. Software tool as automated Gene Ontology based multi species gene set generator for gene set enrichment analysis. Used to generate gene sets required for Gene Set Enrichment Analysis for almost any organism for which GO term association data exists.
Gene set collections can be automatically created for wide variety of species.
Proper citation: GO2MSIG (RRID:SCR_018359) Copy
http://img.jgi.doe.gov/cgi-bin/m/main.cgi
Resource for analysis and annotation of genome and metagenome datasets in comprehensive comparative context. IMG provides users with tools for analyzing publicly available genome datasets and metagenome datasets.
Proper citation: IMG System (RRID:SCR_002965) Copy
Database on transcriptional regulation in Escherichia coli K-12 containing knowledge manually curated from original scientific publications, complemented with high throughput datasets and comprehensive computational predictions. Graphic and text-integrated environment with friendly navigation where regulatory information is always at hand. They provide integrated views to understand as well as organized knowledge in computable form. Users may submit data to make it publicly available.
Proper citation: RegulonDB (RRID:SCR_003499) Copy
http://www.bioinsilico.org/cgi-bin/CAPSDB/staticHTML/home
It is a structural classification of helix-cappings or caps compiled from protein structures. Caps extracted from protein structures have been structurally classified based on geometry and conformation and organized in a tree-like hierarchical classification where the different levels correspond to different properties of the caps. CASP-DB is fully browsable and searchable and is regularly updated. The regions of the polypeptide chain immediately preceding or following a helix are known as Nt- and Ct cappings, respectively. Cappings play a central role stabilizing helices due to lack of intrahelical hydrogen bonds in the first and last turn. Sequence patterns of amino acid type preferences have been derived for cappings but the structural motifs associated to them are still unclassified. CAPS-DB is a database of clusters of structural patterns of different capping types. The clustering algorithm is based in the geometry and the space conformation of these regions. CAPS-DB is a relational database that allows the user to search, browse, inspect and retrieve structural data associated to cappings. The contents of CAPS-DB might be of interest to a wide range of scientist covering different areas such as protein design and engineering, structural biology and bioinformatics. CapsDB v4.0 * PDB structures: 4591 * Number of clusters: 859 * Number of caps: 31452
Proper citation: CAPS Database (RRID:SCR_006862) Copy
http://scop.mrc-lmb.cam.ac.uk/scop/
The Structural Classification of Proteins (SCOP) database is a comprehensive ordering of all proteins of known structure, according to their evolutionary and structural relationships. Protein domains in SCOP are hierarchically classified into families, superfamilies, folds and classes. The continual accumulation of sequence and structural data allows more rigorous analysis and provides important information for understanding the protein world and its evolutionary repertoire. SCOP participates in a project that aims to rationalize and integrate the data on proteins held in several sequence and structure databases. As part of this project, starting with release 1.63, we have initiated a refinement of the SCOP classification, which introduces a number of changes mostly at the levels below superfamily. The pending SCOP reclassification will be carried out gradually through a number of future releases. In addition to the expanded set of static links to external resources, available at the level of domain entries, we have started modernization of the interface capabilities of SCOP allowing more dynamic links with other databases.
Proper citation: SCOP: Structural Classification of Proteins (RRID:SCR_007039) Copy
Database devoted to protein domains. It is also a collection of tools for the investigation of the relationships between protein sequences and motifs described on them.
Proper citation: MyHits (RRID:SCR_006757) Copy
http://bioinformatics.biol.uoa.gr/cuticleDB
A relational database containing all structural proteins of Arthropod cuticle identified to date. Many come from direct sequencing of proteins isolated from cuticle and from sequences from cDNAs that share common features with these authentic cuticular proteins. It also includes proteins from the five sequenced genomes where manual annotation has been applied to cuticular proteins: Anopheles gambiae, Apis mellifera, Bombyx mori, Drosophila melanogaster, and Nasonia vitripennis. Some sequences were confirmed as authentic cuticular proteins because protein sequencing revealed that they were present in cuticle; others were identified by sequence homology and other criteria. Entries provides information about whether sequences are putative or authentic cuticular proteins. CuticleDB was primarily designed to contain correct and full annotation of cuticular protein data. The database will be of help to future genome annotators. Users will be able to test hypotheses for the existence of known and also of yet unknown motifs in cuticular proteins. An analysis of motifs may contribute to understanding how proteins contribute to the physical properties of cuticle as well as to the precise nature of their interaction with chitin.
Proper citation: CuticleDB (RRID:SCR_007045) Copy
The Database of Protein Disorder (DisProt) is a curated database that provides information about proteins that lack fixed 3D structure in their putatively native states, either in their entirety or in part. Users can BLAST sequences, browse by protein name, or view by protein function and functional subclass.
Proper citation: DisProt - Database of Protein Disorder (RRID:SCR_007097) Copy
Genome wide map of putative transcription factor binding sites in Arabidopsis thaliana genome.Data in AthaMap is based on published transcription factor (TF) binding specificities available as alignment matrices or experimentally determined single binding sites.Integrated transcriptional and post transcriptional data.Provides web tools for analysis and identification of co-regulated genes. Provides web tools for database assisted identification of combinatorial cis-regulatory elements and the display of highly conserved transcription factor binding sites in Arabidopsis thaliana.
Proper citation: AthaMap (RRID:SCR_006717) Copy
A database and interactive web site for manipulating and displaying annotations on genomes. Features include: detailed views of the genome; use of a variety of premade or personally made glyphs ; customizable order and appearance of tracks by administrators and end-users; search by annotation ID, name, or comment; support of third party annotation using GFF formats; DNA and GFF dumps; connectivity to different databases, including BioSQL and Chado; and a customizable plug-in architecture (e.g. run BLAST, find oligonucleotides, design primers, etc.). GBrowse is distributed as source code for Macintosh OS X, UNIX and Linux platforms, and as pre-packaged binaries for Windows machines. It can be installed using the standard Perl module build procedure, or automated using a network-based install script. In order to use the net installer, you will need to have Perl 5.8.6 or higher and the Apache web server installed. The wiki portion accepts data submissions.
Proper citation: GBrowse (RRID:SCR_006829) Copy
A database of proteins identified by various 2-D PAGE and SDS-PAGE reference maps. Each SWISS-2DPAGE entry contains textual data on one protein, including mapping procedures, physiological and pathological information, experimental data (isoelectric point, molecular weight, amino acid composition, peptide masses) and bibliographical references. In addition to this textual data, SWISS-2DPAGE provides several 2-D PAGE and SDS-PAGE images showing the experimentally determined location of the protein, as well as a theoretical region computed from the sequence protein, indicating where the protein might be found in the gel. Using the database, users can locate these proteins on the 2-D PAGE maps or display the region of a 2-D PAGE map where one might expect to find a protein from UniProtKB/Swiss-Prot.
Proper citation: SWISS-2DPAGE (RRID:SCR_006946) Copy
http://www.agbase.msstate.edu/
A curated, open-source, web-accessible resource for functional analysis of agricultural plant and animal gene products. Our long-term goal is to serve the needs of the agricultural research communities by facilitating post-genome biology for agriculture researchers and for those researchers primarily using agricultural species as biomedical models. AgBase provides tools designed to assist with the analysis of proteomics data and tools to evaluate experimental datasets using the GO. Additional tools for sequence analysis are also provided. We use controlled vocabularies developed by the Gene Ontology (GO) Consortium to describe molecular function, biological process, and cellular component for genes and gene products in agricultural species. AgBase will also accept annotations from any interested party in the research communities. AgBase develops freely available tools for functional analysis, including tools for using GO. We appreciate any and all questions, comments, and suggestions. AgBase uses the NCBI Blast program for searches for similar sequences. And the Taxonomy Browser allows users to find the NCBI defined taxon ID for or taxon name for different organisms.
Proper citation: AgBase (RRID:SCR_007547) Copy
HubMed provides an interface to PubMed. Quick access to searches with a Firefox search plugin or a HubMed bookmarklet (drag to your browser''s bookmarks toolbar). Export citations in RIS, BibTeX, RDF and MODS formats, or directly to RefWorks. Unzip HubMed''s import filter into Endnote''s Filters folder for direct import into Endnote, or install the RIS Export plugin for direct import into ProCite, RefMan and older versions of Endnote. Use the Citation Finder to convert reference lists from PDFs into search results. Create lists of closely related papers using Rank Relations, then visualise and browse clusters of related papers using TouchGraph (requires Java). Graph occurrences of keywords in published papers over time. Tag and store annotated metadata for articles of interest.
Proper citation: HubMed (RRID:SCR_007296) Copy
COGEME is an ongoing BBSRC-funded study to construct a relational database of genomic information from phytopathogenic fungi. This site also hosts microarray data for Blumeria graminis. Expressed sequence tags (ESTs) obtained from eighteen species of plant pathogenic fungi, two species of phytopathogenic oomycete and three species of saprophytic fungi are included here. Hierarchical clustering software was used to classify together ESTs representing the same gene and produce a single contig, or consensus sequence. The unisequence set for each pathogen therefore represents a set of unique gene sequences, each one consisting of either a single EST or a contig sequence made from a group of ESTs. Unisequences were annotated based on top hits against the NCBI non-redundant protein database using blastx.
Proper citation: COGEME Phytopathogenic Fungi and Oomycete EST Database (RRID:SCR_007604) Copy
CATH is a hierarchical classification of protein domain structures, which clusters proteins at four major levels: Class (C), Architecture (A), Topology (T) and Homologous superfamily (H). The boundaries and assignments for each protein domain are determined using a combination of automated and manual procedures which include computational techniques, empirical and statistical evidence, literature review and expert analysis Users can search CATH by ID/Sequence/text. They can also browse CATH from the top of the hierarchy, or download CATH data.
Proper citation: CATH: Protein Structure Classification (RRID:SCR_007583) Copy
http://www.ebi.ac.uk/GenomeReviews/
THIS RESOURCE IS NO LONGER IN SERVICE, documented April 24, 2017. The Genome Reviews database provides an up-to-date, standardized and comprehensively annotated view of the genomic sequence of organisms with completely deciphered genomes. Currently, Genome Reviews contains the genomes of archaea, bacteria, bacteriophages and selected eukaryota. Genome Reviews is available as a MySQL relational database, or a flat file format derived from that in the EMBL Nucleotide Sequence Database. An Ensembl-style browser is now available for Genome Reviews, providing a zoomable graphical view of all chromosomes and plasmids represented in the database. The location and structure of all genes is shown and the distribution of features throughout the sequence is displayed.
Proper citation: Genome Reviews (RRID:SCR_007685) Copy
http://firedb.bioinfo.cnio.es/
A database of Protein Data Bank structures, ligands and annotated functional site residues. The database can be accessed by PDB codes or UniProt accession numbers as well as keywords. FireDB contains information on every chemical compound in the PDB, including their descriptions, the PDB structures in which the compounds are found and the amino acids that are in contact with the ligand.
Proper citation: FireDB (RRID:SCR_007655) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.