Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://www.roselab.jhu.edu/coil/
The Protein Coil Library is a library of protein structure fragments derived from the Protein Data Bank (PDB). The fragments in this library are those fragments in the PDB that cannot be classified as either alpha-helix or beta-strand. Three-dimensional structures as well as side-chain and backbone torsion angles are stored in the database. The Protein Coil Library allows rapid and comprehensive access to non-alpha-helix and non-beta-strand fragments contained in the Protein Data Bank (PDB). The library contains both sequence and structure information together with calculated torsion angles for both the backbone and side chains. Several search options are implemented, including a query function that uses output from popular PDB-culling servers directly. Additionally, several popular searches are stored and updated for immediate access. The library is a useful tool for exploring conformational propensities, turn motifs, and a recent model of the unfolded state. The library stores the complete torsion angle descriptions for the fragments as well as the three dimensional structures of the fragments themselves. The goal of extracting and pre-calculating this data is to allow for more straightforward investigation of peptide structure without the background of secondary structure elements. In addition to searching by PDB ID, it is possible to download a particular size class, perform a batch search of PDB/chain ID''s, or download precompiled lists of PDB ID''s of interest (PDB Select, etc.). For users interested in browsing the entire database at once or maintaining their own locally-updated copy of the library, FTP access instructions are also provided. The files stored in the coil library FTP site or returned after a batch search are organized heirarchically by PDB ID. This is done to reduce filesystem access times and fascilitate searches using the UNIX find utility. At the lowest directory level in the heirarchy, files are further sorted by fragment length. As a result, the number of files in a particular directory is generally less then 50, yielding relatively fast access on UNIX/Linux filesystems. The heirarchical organization is based on the middle two letters of the PDB ID. For example, hen egg lysozyme, which has a PDB ID of 1HEL, will be located in the directory h/he/. At the final level, fragments of varying sizes are stored in directories that correspond to their fragment length. Again, using lysozyme as an example, any seven-residue fragments, if they exist, will reside in the directory h/he/7/. Similarly, seven-residue fragments from 2HEX and 1HE0 will also be in this location. Sponsors: The Protein Coil Library is funded by Johns Hopkins University.
Proper citation: The Protein Coil Library (RRID:SCR_008233) Copy
http://bibiserv.techfak.uni-bielefeld.de/HyPa/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on July 15, 2013. It contains annotated structural elements characteristic for certain classes of structural and/or functional RNAs. These elements are described in a language specifically designed for this purpose. The language allows convenient specification of hybrid patterns, i.e. motifs consisting of sequence features and structural elements together with sequence similarity and thermodynamic constraints. A system that searches complex patterns (on nucleic-acid or protein level) in large biosequence-databases. As patterns, they allow hybrid patterns, which combine sequence similarity, structure similarity and arbitrary characteristics, like thermodynamic constraints. Applications are in the research of highly specific Protein/RNA-interactions or in the search of RNA-tertiary-structure-interactions. They developed a declarative pattern description language, which is implemented by known and new pattern-matching algorithms and an optimizing backtracking procedure. To achieve high efficiency when screening large data sets, the patterns are divided and queries are composed. The significance of patterns is estimated by a Monte-Carlo procedure. Complex results of queries are processed by a visualizing component. A library of biologically relevant patterns is developed and it is provided on the WWW together with the search-tool. The evaluation of the tool w.r.t. to the biosequence databases will in some cases mean to make laboratory-experiments, in order to check algorithmically developed functional hypothesis. Sponsors: This project is supported by a grant from the Deutsche Forschungsgemeinschaft. It is part of the special program on Computational Methods for the Analysis and Interpretation of large genomic data
Proper citation: Hybrid Pattern Library (RRID:SCR_008193) Copy
http://cmbi.bjmu.edu.cn/cmbidata/cgf/CGF_Database/cytokine.medic.kumamoto-u.ac.jp/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on August 26, 2016. A collection of cDNA, gene and protein records of cytokines deposited in public databases provides various information about the cytokine members of vertebrates in other databases including NCBI GenBank, Swiss-Prot, UniGene, TIGR (The Institute for Genomic Research) Gene Indices, Ensembl, Entrez Gene, Mouse Genome Informatics (MGI) and Rat Genome Database (RGD). It also provides orthologous relationship of cytokine members and includes novel members identified in the databases.
Proper citation: Cytokine Family Database (RRID:SCR_008134) Copy
http://pbil.univ-lyon1.fr/acuts/ACUTS.html
THIS RESOURCE IS NO LONGER IN SERVICE, Documented on August 12, 2014. Database that identifies new regulatory elements in untranslated regions of protein-coding genes (5 prime flanks, 5 prime UTRs, introns, 3 prime UTRs and 3 prime flanks). The analyses is focused on genes from metazoan species (essentially vertebrates, insects and nematodes). Information on highly conserved regions (sequences, alignments, annotations, bibliographic references) are compiled. Currently 176 out of 326 detected highly conserved regions (HCRs) have been analyzed and incorporated in the database. You can also access the list of annotated conserved elements and the list of conserved elements that remain to be processed. Their approach is based on comparative sequence analysis, for the identification of phylogenetic footprints.
Proper citation: Ancient conserved untranslated sequences (RRID:SCR_008130) Copy
A database of binding affinities for the protein-ligand complexes in the Protein Data Bank (PDB). The PDBbind database is a collection of the experimentally measured binding affinities exclusively for the protein-ligand complexes available in the Protein Data Bank (PDB). It thus provides a link between energetic and structural information of those complexes and may be of great value to various molecular recognition studies. This site was last updated in 2007. The updated version of the resource is maintained by the Shanghai Institute of Organic Chemistry (http://www.pdbbind.org.cn).
Proper citation: Protein Data Bank Bind Database (RRID:SCR_008224) Copy
http://wwwmgs.bionet.nsc.ru/mgs/gnw/pdbsite/
Protein Data Bank (PDB) contains data on the spatial protein structures and their biologically active sites (i.e., ligand binding regions, enzyme catalytic centers, regions subjected to biochemical modifications, etc.). However, neither of the well known systems searching PDB does not provide the user with possibility to make the queries related with the active sites. A database PDBSITE storing the data on biologically active sites contained in the PDB database has been developed. PDBSITE accumulates amino acid content, structure features calculated by spatial protein structures, and physicochemical properties of sites and their spatial surroundings.
Proper citation: Protein Data Bank Site (RRID:SCR_008227) Copy
THIS RESOURCE IS NO LONGER IN SERVICE, documented on August 20,2019.The COG-database has become a powerful tool in the field of comparative genomics. The construction of this data-base is based on sequence homologies of proteins from different completely sequenced genomes. Highly homologous proteins are assigned to clusters of orthologous groups. The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies. The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. Here is a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or approximately 54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of approximately 20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (approximately 1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes.
Proper citation: Phylogenetic Clusters of Orthologous Groups Ranking (RRID:SCR_008223) Copy
http://lpdb.chem.lsa.umich.edu/
The Ligand Protein Database is designed to allow the selection of complexes based on various properties of receptors and ligands for the design and parametrization of new scoring functions or to assess and improve existing ones. Moreover, for each complex, a continuum of ligand positions ranging from the crystallographic position to points on the surface of the protein receptor allows an assessment of the energetic behavior of particular scoring functions. Access to the database is password protected. To obtain access to the LPDB, complete a form, available online, have it signed by your research advisor, and fax the completed form back to the attention of Professor Charles L. Brooks III, (858) 784-8688. There is no fee for academic use of the LPDB. We are currently working out details for licensing to our colleagues in industry. Please contact Professor Brooks to obtain current information on access to the LPDB.
Proper citation: LPDB: Ligand-Protein DataBase (RRID:SCR_008172) Copy
The aim of the PEROXISOME database (PeroxisomeDB) is to gather, organize and integrate curated information on peroxisomal genes, their encoded proteins, their molecular function and metabolic pathway they belong to, and their related disorders. PeroxisomeDB contains the complete peroxisomal proteome of Homo sapiens (encoded by 85 genes) and Saccharomyces cerevisiae (encoded by 61 genes). Now, we have included 34 new organism genomes with the acquisition of 2426 new peroxisomal homolog proteins. PeroxisomeDB 2.0 integrates the peroxisomal metabolome of whole microbody family by the new incorporation of the glycosome proteomes of trypanosomatids and the glyoxysome proteome of Arabidopsis thaliana. The site also provides a Peroxisome Metabolome of peroxisomal genes and proteins, their molecular interactions and metabolic pathways, tools for comparative genomics, predictive tools. Sponsors: Preoxisome Database is funded by Institut de Gntique et deBiologie Molculaire et Cellulaire.
Proper citation: Peroxisome Database (RRID:SCR_008352) Copy
http://cssb.biology.gatech.edu/skolnick/files/gpcr/gpcr.html
THIS RESOURCE IS NO LONGER IN SERVICE, documented on August 19,2019.Database of tertiary structural modeling results of threading assembly refinement (TASSER) method for all 907 G protein-coupled receptors (GPCRs) in human genome. All sequences were collected from GPCR database http://www.gpcr.org/7tm/ and http://www.expasy.org/cgi-bin/lists?7tmrlist.txt. Unlike traditional homology modeling approaches, TASSER modeling does not require solved homologous template structures; moreover, it often refines the structures closer to native. G protein-coupled receptors (GPCRs), encoded by about 5% of human genes, comprise the largest family of integral membrane proteins and act as cell surface receptors responsible for the transduction of endogenous signal into a cellular response. Although tertiary structural information is crucial for function annotation and drug design, there are few experimentally determined GPCR structures. To address this issue, we employ the recently developed threading assembly refinement (TASSER) method to generate structure predictions for all 907 putative GPCRs in the human genome. Unlike traditional homology modeling approaches, TASSER modeling does not require solved homologous template structures; moreover, it often refines the structures closer to native. These features are essential for the comprehensive modeling of all human GPCRs when close homologous templates are absent. Based on a benchmarked confidence score, approximately 820 predicted models should have the correct folds. The majority of GPCR models share the characteristic seven-transmembrane helix topology, but 45 ORFs are predicted to have different structures. This is due to GPCR fragments that are predominantly from extracellular or intracellular domains as well as database annotation errors. Our preliminary validation includes the automated modeling of bovine rhodopsin, the only solved GPCR in the Protein Data Bank. With homologous templates excluded, the final model built by TASSER has a global C(alpha) root-mean-squared deviation from native of 4.6 angstroms, with a root-mean-squared deviation in the transmembrane helix region of 2.1 angstroms. Models of several representative GPCRs are compared with mutagenesis and affinity labeling data, and consistent agreement is demonstrated. Structure clustering of the predicted models shows that GPCRs with similar structures tend to belong to a similar functional class even when their sequences are diverse. These results demonstrate the usefulness and robustness of the in silico models for GPCR functional analysis. Sponsors: GPCR is funded by the University at Buffalo, Buffalo, New York.
Proper citation: Structure modeling of 907 G protein coupled receptors in the human genome (RRID:SCR_008351) Copy
http://mitointeractome.kobic.kr/
Database that gathers data on interactions in the mitochondrial proteome that has been used to construct a network for the aging process in humans and to identify interactions that influence this process, since mitochondria is a major source of cellular reactive oxygen species that accumulate during aging. It will: # aid in increasing our understanding of the molecular functions and interaction networks of mitochondrial proteins, # help in identifying new target proteins for experimental research using predicted protein-protein interaction information, and # help in identifying biomarkers for diagnosis and new molecular targets for drug development related to mitochondria. How is MitoInteractome different? * Provides protein-protein interaction information with graphical display. * Applies newly added new mitochondrial protein information by using BLAST incorporated in Mitointeractome * Shows correlation of mutation with their impact * Provides specific pathway information to aid study of their impact * Contains SNP Information
Proper citation: MitoInteractome (RRID:SCR_010225) Copy
http://www.grenoble.prabi.fr/obiwarehouse/unipathway
A manually curated database of enzyme-catalyzed and spontaneous chemical reactions. It provides a hierarchical representation of metabolic pathways and a controlled vocabulary for pathway annotation in UniProtKB. UniPathway data are cross-linked to existing metabolic resources such as ChEBI/Rhea, KEGG and MetaCyc. Users may do a quick search, browse pathway, browse compound, or browse organism.
Proper citation: UniPathway (RRID:SCR_010513) Copy
http://www.informatics.jax.org/
Community model organism database for laboratory mouse and authoritative source for phenotype and functional annotations of mouse genes. MGD includes complete catalog of mouse genes and genome features with integrated access to genetic, genomic and phenotypic information, all serving to further the use of the mouse as a model system for studying human biology and disease. MGD is a major component of the Mouse Genome Informatics.Contains standardized descriptions of mouse phenotypes, associations between mouse models and human genetic diseases, extensive integration of DNA and protein sequence data, normalized representation of genome and genome variant information. Data are obtained and integrated via manual curation of the biomedical literature, direct contributions from individual investigators and downloads from major informatics resource centers. MGD collaborates with the bioinformatics community on the development and use of biomedical ontologies such as the Gene Ontology (GO) and the Mammalian Phenotype (MP) Ontology.
Proper citation: Mouse Genome Database (RRID:SCR_012953) Copy
Open-access database of antibodies against human proteins developed through collaboration between Antibodypedia AB and the Nature Publishing Group. It aims to provide the scientific community and antibody distributors alike with information on the effectiveness of specific antibodies in specific applications--to help scientists select the right antibody for the right application. Antibodypedia's mission is to promote the functional understanding of the human proteome and expedite analysis of potential biomarkers discovered through clinical efforts. To this end, they have developed an open-access, curated, searchable database containing annotated and scored affinity reagents to aid users in selecting antibodies tailored to specific biological and biomedical assays. They envisage Antibodypedia as a virtual repository of validated antibodies against all human, and ultimately most model-organism, proteins. Such a tool will be exploitable to identify affinity reagents to document protein expression patterns in normal and pathological states and to purify proteins alone and in complex for structural and functional analyses. They hope to promote characterization of the roles and interplay of proteins and complexes in human health and disease. They encourage commercial providers to submit information regarding their inventory of antibodies with links to quality control data. Independent users can submit their own application-specific experimental data using standard validation criteria (supportive or non-supportive) developed with the assistance of an international advisory board recruited from academic research institutions. Users can also comment on specific antibodies without submitting validation data.
Proper citation: Antibodypedia (RRID:SCR_012782) Copy
SYFPEITHI is a database comprising more than 7000 peptide sequences known to bind class I and class II MHC molecules. The entries are compiled from published reports only. It contains a collection of MHC class I and class II ligands and peptide motifs of humans and other species, such as apes, cattle, chicken, and mouse, for example, and is continuously updated. Searches for MHC alleles, MHC motifs, natural ligands, T-cell epitopes, source proteins/organisms and references are possible. Hyperlinks to the EMBL and PubMed databases are included. In addition, ligand predictions are available for a number of MHC allelic products. The database is based on previous publications on T-cell epitopes and MHC ligands. It contains information on: -Peptide sequences -anchor positions -MHC specificity -source proteins, source organisms -publication references Since the number of motifs continuously increases, it was necessary to set up a database which facilitates the search for peptides and allows the prediction of T-cell epitopes. The prediction is based on published motifs (pool sequencing, natural ligands) and takes into consideration the amino acids in the anchor and auxiliary anchor positions, as well as other frequent amino acids. The score is calculated according to the following rules: The amino acids of a certain peptide are given a specific value depending on whether they are anchor, auxiliary anchor or preferred residue. Ideal anchors will be given 10 points, unusual anchors 6-8 points, auxiliary anchors 4-6 and preferred residues 1-4 points. Amino acids that are regarded as having a negative effect on the binding ability are given values between -1 and -3. Sponsors: SYFPEITHI is supported by DFG-Sonderforschungsbereich 685 and theEuropean Union: EU BIOMED CT95-1627, BIOTECH CT95-0263, and EU QLQ-CT-1999-00713.
Proper citation: SYFPEITHI: A Database for MHC Ligands and Peptide Motifs (RRID:SCR_013182) Copy
http://epsf.bmad.bii.a-star.edu.sg/cube/db/html/home.html
Cube-DB is a database of pre-evaluated conservation and specialization scores for residues in paralogous proteins belonging to multi-member families of human proteins. Protein family classification follows (largely) the classification suggested by HUGO Gene Nomenclature Committee. Sets of orhtologous protein sequences were generated by mutual-best-hit strategy using full vertebrate genomes available in Ensembl. The scores, described on documentation page, are assigned to each individual residue in a protein, and presented in the form of a table (html or downloadable xls formats) and mapped, when appropriate, onto the related structure (Jmol, Pymol, Chimera).
Proper citation: Cube-DB (RRID:SCR_013233) Copy
http://www.alzforum.org/res/com/ant/
The Alzheimer Research Forum is the web''s most dynamic scientific community dedicated to understanding Alzheimer''s disease and related disorders. It also contains a database of providers of antibodies directed against several hundred molecules and proteins of relevant to research on Alzheimer and other neurodegenerative diseases. The web site reports on the latest scientific findings, from basic research to clinical trials; creates and maintains public databases of essential research data and reagents, and produces discussion forums to promote debate, speed the dissemination of new ideas, and break down barriers across the numerous disciplines that can contribute to the global effort to cure Alzheimer''s disease. The ARF team of professional science writers and editors, information technology experts, web developers and producers all work closely with our distinguished and diverse Advisory Board to ensure a high-quality of information and services. We very much welcome our readers'' participation in all aspects of the web site. Sponsors: The Alzheimer Research Forum is an independent nonprofit organization. It is supported by grants and individual donations.
Proper citation: Alzforum Antibody Directory for Neuroscience Research (RRID:SCR_013601) Copy
http://research.bioinformatics.udel.edu/iptmnet/
A protein database which connects multiple disparate bioinformatics tools and systems text mining, data mining, analysis and visualization tools, and databases and ontologies.
Proper citation: iPTMnet (RRID:SCR_014416) Copy
http://web.mit.edu/glycomics/gt/gtdb.shtml
A pathway-based graphical interface for navigating the glycoenzyme database. The goal of the project is to define the paradigms by which carbohydrate binding proteins function in cellular communication. These pages are divided into six categories: -Glycosphingolipid: Sub-categories are Isogloboseries, Globoseries, Neo-lactoseries, Lactoseries and Ganglioseries - N-linked: Sub-categories are High-mannose, Hybrid and Complex -Mucin -Terminal Core 1 -Other O-linked -Terminal All: Includes all potential terminal structures for each glycan category
Proper citation: Glycosylation Pathways Database (RRID:SCR_013486) Copy
http://xin.cz3.nus.edu.sg/group/trmp/trmp.asp
The Therapeutically Relevant Multiple Pathways Database is designed to provide information about such multiple pathways and related therapeutic targets described in the literatures, the targeted disease conditions, and the corresponding drugs/ligands directed at each of these targets. This database currently contains 11 entries of multiple pathways, 97 entries of individual pathways, 120 targets covering 72 disease conditions along with 120 sets of drugs directed at each of these targets. Each entry can be retrieved through multiple methods including multiple pathway name, individual pathway name and disease name. Additional information provided include protein name, synonyms, Swissprot AC number, species, gene name and location, protein sequence (AASEQ) and gene sequence (NTSEQ) as well as potential therapeutic implications while applicable. Cross-links to other databases are provided which include Genecard, GDB, Locuslink, NCBI, KEGG, OMIM, SwissProt to facilitate the access of more detailed information about various aspects of the particular target or non-target protein. Queries can be submitted by entering or selecting the required information in any one or combination of the fields in the form. User can specify full name or any part of the name in a text field, or choose one item from an selection field. Sponsors: TRMP is supported by the National University of Singapore.
Proper citation: Therapeutically Relevant Multiple Pathways Database (RRID:SCR_013471) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.