Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
ooTFD (object-oriented Transcription Factors Database) is a successor to TFD, the original Transcription Factors Database. This database is aimed at capturing information regarding the polypeptide interactions which comprise and define the properties of transcription factors. ooTFD contains information about transcription factor binding sites, as well as composite relationships within transcription factors, which frequently occur as multisubunit proteins that form a complex interface to cellular processes outside the transcription machinery through protein-protein interactions. ooTFD contains information represented in TFD but also allows the representation of containment, composite, and interaction relationships between transcription factor polypeptides. It is designed to represent information about all transcription factors, both eukaryotic and prokaryotic, basal as well as regulatory factors, and multiprotein complexes as well as monomers.
Proper citation: object-oriented Transcription Factors Database (RRID:SCR_002435) Copy
http://cubic.bioc.columbia.edu/db/LOC3d/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on July 16, 2013. LOC3d is a database of predicted subcellular localization for eukaryotic proteins of known 3-D structure taken from the Protein Databank. Subcellular localization is currently predicted using four different methods: predictNLS (nuclear localization signal), LOChom (using homology), LOCkey (using keywords) and LOC3d (neural network based prediction). The reported localization is based on the method which predicts localization of a given protein with the highest confidence. LOCtree is a novel system of support vector machines (SVMs) that predict the subcellular localization of proteins, and DNA-binding propensity for nuclear proteins, by incorporating a hierarchical ontology of localization classes modeled onto biological processing pathways. Biological similarities are incorporated from the description of cellular components provided by the gene ontology consortium (GO). GO definitions have been simplified and tailored to the problem of protein sorting. Technically the ontology has been implemented using a decision tree with SVMs as the nodes. LOCtree, was extremely successful at learning evolutionary similarities among subcellular localization classes and was significantly more accurate than other traditional networks at predicting subcellular localization. Whenever available, LOCtree also reports predictions based on the following: 1) Nuclear localization signals found by PredictNLS, 2) Localization inferred using Prosite motifs and Pfam domains found in the protein, and 3) SWISS-PROT keywords associated with a protein. Localization is inferred in the last two cases using the entropy-based LOCkey algorithm. Additional information can be found in the LOCtree manuscript and associated PredictNLS and LOCkey publications.
Proper citation: Database oDatabase of Predicted Subcellular Localization for Eukaryotic PDB Chainsf Predicted Subcellular Localization for Eukaryotic PDB Chains (RRID:SCR_002831) Copy
http://eyesite.cryst.bbk.ac.uk/
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 14,2026. An information and modeling database for families of proteins that function in the eye. Homologues are collected from all species and clustered according to tissue type, function and sequence similarity. A principal feature of the site is structural annotations, which range from experimentally solved structures to close structural neighbors to distant structure predictions. Many pre-generated homology models are provided. Other features include domain architecture analysis and pre-generated sequence alignments, and the site is extensively linked to other bioinformatic resources on the web.
Proper citation: EyeSite (RRID:SCR_002669) Copy
Bioinformatics and cheminformatics database that combines detailed drug (i.e. chemical, pharmacological and pharmaceutical) data with comprehensive drug target (i.e. sequence, structure, and pathway) information.
Proper citation: DrugBank (RRID:SCR_002700) Copy
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 23, 2016. ELISA is an online database that combines functional annotation with structure and sequence homology modeling to place proteins into sequence-structure-function neighborhoods. The atomic unit of the database is a set of sequences and structural templates that those sequences encode. A graph that is built from the structural comparison of these templates is called PDUG (protein domain universe graph). It introduces a method of functional inference through a probabilistic calculation done on an arbitrary set of PDUG nodes. Further, all PDUG structures are mapped onto all fully sequenced proteomes allowing an easy interface for evolutionary analysis and research into comparative proteomics. ELISA is the first database with applicability to evolutionary structural genomics explicitly in mind.
Proper citation: Evolutionary Lineage Inferred from Structural Analysis (RRID:SCR_002343) Copy
http://www.ncbi.nlm.nih.gov/gene
Database for genomes that have been completely sequenced, have active research community to contribute gene-specific information, or that are scheduled for intense sequence analysis. Includes nomenclature, map location, gene products and their attributes, markers, phenotypes, and links to citations, sequences, variation details, maps, expression, homologs, protein domains and external databases. All entries follow NCBI's format for data collections. Content of Entrez Gene represents result of curation and automated integration of data from NCBI's Reference Sequence project (RefSeq), from collaborating model organism databases, and from many other databases available from NCBI. Records are assigned unique, stable and tracked integers as identifiers. Content is updated as new information becomes available.
Proper citation: Entrez Gene (RRID:SCR_002473) Copy
http://pir.georgetown.edu/resid/
A comprehensive collection of annotations and structures for protein modifications including amino-terminal, carboxyl-terminal and peptide chain cross-link post-translational modifications. It provides: systematic and alternate names, atomic formulas and masses, enzyme activities generating the modifications, keywords, literature citations, Gene Ontology cross-references, Protein Information Resource (PIR) and SWISS-PROT protein sequence database feature table annotations, structure diagrams and molecular models. Each RESID Database entry presents a chemically unique modification and shows how that modification is currently annotated in the protein sequence databases, Swiss-Prot and the Protein Information Resource (PIR). The RESID Database provides a table of corresponding equivalent feature annotations that is used in the UniProt project, an international effort to combine the resources of the Swiss-Prot, TrEMBL and PIR. As an annotation tool, the RESID Database is used in standardizing and enhancing modification descriptions in the feature tables of Swiss-Prot entries.
Proper citation: RESID (RRID:SCR_003505) Copy
THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone., documented September 2, 2016. Relational database containing the compositions of multi-protein complexes in the nucleus of budding yeast and human cells. Its content is limited to information curated from the proteomics literature and primarily comprises of components of the general transcription and DNA repair machinery. In addition to database browsing and searching capabilities, the PINdb web portal also includes user-friendly interactive tools for comparative analysis of the composition of multiple protein complexes and for clustering and visualizing network of protein complexes. Currently, PINdb contains mostly protein complexes that may be involved in gene transcription. To facilitate comparative analyses and identification of protein complexes, the compositional information is integrated with standardized gene nomenclature, annotation and protein sequences from public databases. The PINdb web interface provides a number of tools for (1) comparison of protein complexes, (2) search for a protein complex by its published name or by a partial list of its components and (3) browsing specific subsets or a functional classification of the complexes.
Proper citation: PINdb (RRID:SCR_003348) Copy
http://bioinformatics.charite.de/synsysnet/
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on August 19,2025. A curated database for synaptic proteins that provides adequate definitions of pre- and post-synaptic proteins, proteins present in sub-domains of the synapse, e.g. the synaptic vesicle and associated proteins, lipid rafts and postsynaptic density. In addition to data that was and will be gathered from the experiments conducted within SynSys - A European expertise Network on building the synapse, they have extracted and manually curated all relevant data on these proteins from other sources and provided an ontology for these. Novel splice forms are being identified that can be matched with proteomics data. Information on proteins, their 3D structure, binding small molecules Protein-Protein-Interactions (PPIs) and Compound-Protein-Interactions are integrated. Proteins or compounds can be searched and Interactive Networks can be visualized. The point Diseases present neurological diseases, to illustrate the role of SynSysNet in the medication.
Proper citation: SynSysNet (RRID:SCR_003180) Copy
http://www.lipidmaps.org/data/proteome/LMPD.php
Database of lipid related proteins representing human and mouse proteins involved in lipid metabolism. Collection of lipid related genes and proteins contains data for genes and proteins from Homo sapiens, Mus musculus, Rattus norvegicus, Saccharomyces cerevisiae, Caenorhabditis elegans, Escherichia coli, Macaca mulata, Drosophila melanogaster, Arabidopsis thaliana and Danio rerio.
Proper citation: LIPID MAPS Proteome Database (RRID:SCR_003062) Copy
http://www.ncbi.nlm.nih.gov/protein
Databases of protein sequences and 3D structures of proteins. Collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
Proper citation: NCBI Protein Database (RRID:SCR_003257) Copy
http://biocomputer.bio.cuhk.edu.hk/T3DB/
Database aimed to annotate all bacterial Type III Secretion System (T3SS) related structure, effector, regulator, and auxiliary genes.
Proper citation: Type-III-Secretion-System related database (RRID:SCR_002941) Copy
http://integrativebiology.org/
Database for molecular interaction information integrated with various other bio-entity information, including pathways, diseases, gene ontology (GO) terms, species and molecular types. The information is obtained from several manually curated databases and automatic extraction from literature. There are protein-protein interaction, gene/protein regulation and protein-small molecule interaction information stored in the database. The interaction information is linked with relevant GO terms, pathway, disease and species names. Interactions are also linked to the PubMed IDs of the corresponding abstracts the interactions were obtained from. Manually curated molecular interaction information was obtained from BioGRID, IntAct, NCBI Gene, and STITCH database. Pathway related information was obtained from KEGG database, Pathway Interaction database and Reactome. Disease information was obtained from PharmGKB and KEGG database. Gene ontology terms and related information was obtained from Gene Ontology database and GOA database.
Proper citation: Integrated Molecular Interaction Database (RRID:SCR_003546) Copy
http://www.ebi.ac.uk/goldman-srv/pandit
PANDIT is a collection of multiple sequence alignments and phylogenetic trees covering many common protein domains. It contains: * the seed protein sequence alignments from the Pfam-A (curated families) database (version 17.0) * nucleotide sequence alignments derived from sequences available for the above and using the protein alignments as "templates"; * protein sequence alignments restricted to the family members for which nucleotide sequences are available * inferred phylogenetic trees for each alignment The data in PANDIT and the dataset's development have been frozen owing to a lack of funding support. The existing data, version 17.0 corresponding to Pfam 17.0, remain stable and, we hope, useful. The entire database is also available for download as a flatfile from this website.
Proper citation: PANDIT : Protein and Associated Nucleotide Domains with Inferred Trees (RRID:SCR_003321) Copy
http://www.ncbi.nlm.nih.gov/structure
Database of three-dimensional structures of macromolecules that allows the user to retrieve structures for specific molecule types as well as structures for genes and proteins of interest. Three main databases comprise Structure-The Molecular Modeling Database; Conserved Domains and Protein Classification; and the BioSystems Database. Structure also links to the PubChem databases to connect biological activity data to the macromolecular structures. Users can locate structural templates for proteins and interactively view structures and sequence data to closely examine sequence-structure relationships. * Macromolecular structures: The three-dimensional structures of biomolecules provide a wealth of information on their biological function and evolutionary relationships. The Molecular Modeling Database (MMDB), as part of the Entrez system, facilitates access to structure data by connecting them with associated literature, protein and nucleic acid sequences, chemicals, biomolecular interactions, and more. It is possible, for example, to find 3D structures for homologs of a protein of interest by following the Related Structure link in an Entrez Protein sequence record. * Conserved domains and protein classification: Conserved domains are functional units within a protein that act as building blocks in molecular evolution and recombine in various arrangements to make proteins with different functions. The Conserved Domain Database (CDD) brings together several collections of multiple sequence alignments representing conserved domains, in addition to NCBI-curated domains that use 3D-structure information explicitly to define domain boundaries and provide insights into sequence/structure/function relationships. * Small molecules and their biological activity: The PubChem project provides information on the biological activities of small molecules and is a component of NIH''''s Molecular Libraries Roadmap Initiative. PubChem includes three databases: PCSubstance, PCBioAssay, and PCCompound. The PubChem data are linked to other data types (illustrated example) in the Entrez system, making it possible, for example, to retrieve information about a compound and then Link to its biological activity data, retrieve 3D protein structures bound to the compound and interactively view their active sites, and find biosystems that include the compound as a component. * Biological Systems: A biosystem, or biological system, is a group of molecules that interact directly or indirectly, where the grouping is relevant to the characterization of living matter. The NCBI BioSystems Database provides centralized access to biological pathways from several source databases and connects the biosystem records with associated literature, molecular, and chemical data throughout the Entrez system. BioSystem records list and categorize components (illustrated example), such as the genes, proteins, and small molecules involved in a biological system. The companion FLink icon FLink tool, in turn, allows you to input a list of proteins, genes, or small molecules and retrieve a ranked list of biosystems.
Proper citation: NCBI Structure (RRID:SCR_004218) Copy
http://anya.igsb.anl.gov/Geneways/GeneWays.html
System for automatically extracting, analzying, visualizing and integrating molecular pathway data from the research literature. System focuses on interactions between molecular substances and actions, providing a graphical consensus view on the collected information. GeneWays is designed as open platform, allowing researchers to query, review and critique integrated information.
Proper citation: GeneWays (RRID:SCR_000572) Copy
https://www.expasy.org/resources/uniprotkb-swiss-prot
Curated component of UniProtKB (produced by the UniProt consortium). It contains hundreds of thousands of protein descriptions, including function, domain structure, subcellular location, post-translational modifications and functionally characterized variants.
Proper citation: UniProtKB/Swiss-Prot (RRID:SCR_021164) Copy
Database of protein structure predictions by AlphaFold that are freely and openly available to global scientific community. Included are nearly all catalogued proteins known to science. Provides programmatic access to and interactive visualization of predicted atomic coordinates, per residue and pairwise model confidence estimates and predicted aligned errors.
Proper citation: AlphaFold Protein Structure Database (RRID:SCR_023662) Copy
http://www.glycosciences.de/glycocd/
Manually curated, comprehensive repository of clusters of differentiation (CDs) which are a) defined as distinct oligosaccharide sequences as part of either glycoproteins and/or glycosphingolipids and b) defined as proteins which have carbohydrate recognition sites (CRDs) or as carbohydrate binding lectins. The data base is generated by exhaustive search of literature and other online data banks related to carbohydrates and proteins. This data bank is the beginning of an effort to provide concise, relevant information of carbohydrate-related CDs in a user- friendly manner. For users convenience the data bank under menu browse of GlycoCD is arranged in two section namely carbohydrate recognition CDs (CRD CD) and glycan CD. The carbohydrate recognition CD part is the collection of proteins which recognize glycan structures by means of the CRDs. Glycan CD is the part in which CDs are summarized which characterize specific oligosaccharide structures. The GlycoCD databank has been developed with the aim to assist the immunologist, cell biologist as well as the clinician who wants to keep up with the present knowledge in this field of glycobiology.
Proper citation: Glyco-CD (RRID:SCR_001574) Copy
Commercial antibody vendor which supplies antibodies and other products to life science researchers.
Proper citation: Novus Biologicals (RRID:SCR_004286) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.