Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
| Resource Name | Proper Citation | Abbreviations | Resource Type |
Description |
Keywords | Resource Relationships | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
Combinatorial Extension (CE) Resource Report Resource Website 1+ mentions |
Combinatorial Extension (CE) (RRID:SCR_007585) | CE | data or information resource, database | CE is a databases of alignments for all polypeptide chains. A representative set of proteins is available and kept current with the PDB, a method for calculating pairwise structure alignments. CE aligns two polypeptide chains using characteristics of their local geometry as defined by vectors between C alpha positions. Matches are termed aligned fragment pairs (AFPs). Heuristics are used in defining a set of optimal paths joining AFPs with gaps as needed. The path with the best RMSD is subject to dynamic programming to achieve an optimal alignment. For specific families of proteins additional characteristics are used to weight the alignment. Complete details are described in the paper (PDF format). Databases of alignments for all polypeptide chains and a representative set of proteins is available and kept current with the PDB | polypeptide, polypeptide chain, protein, protein alignment, protein structure |
is listed by: 3DVC is related to: Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) has parent organization: University of California at San Diego; California; USA |
nif-0000-02648 | http://cl.sdsc.edu/ce.html | SCR_007585 | Combinatorial extension | 2026-02-11 10:57:38 | 5 | |||||||
|
COG Resource Report Resource Website 1000+ mentions |
COG (RRID:SCR_007139) | COG, COG Cluster, COG Function, COG Pathway | data or information resource, database | A database for phylogenetic classification for proteins encoded in complete genomes. Clusters of Orthologous Groups of proteins (COGs) were delineated by comparing protein sequences encoded in complete genomes, representing major phylogenetic lineages. Each COG consists of individual proteins or groups of paralogs from at least 3 lineages and thus corresponds to an ancient conserved domain. Please be aware that COGs hasn't been updated in many years and will not be. | ortholog, protein, cog, conserved protein sequence, unicellular cluster, genome, order, class, phyla, eukaryotic cluster, gold standard |
is listed by: OMICtools is related to: MLTreeMap is related to: ProOpDB is related to: Conserved Domain Database has parent organization: NCBI is parent organization of: Clusters of Orthologous Groups Analysis Ontology |
PMID:12969510 PMID:9381173 |
OMICS_01688, nif-0000-02672 | SCR_007139 | COG Database, Clusters of Orthologous Groups of proteins, COGs, COGs - Clusters of Orthologous Groups of proteins, COGs - Phylogenetic classification of proteins encoded in complete genomes, COG Cluster, COG Pathway, COG Function | 2026-02-11 10:57:30 | 1117 | |||||||
|
DisProt - Database of Protein Disorder Resource Report Resource Website 100+ mentions |
DisProt - Database of Protein Disorder (RRID:SCR_007097) | data or information resource, database | The Database of Protein Disorder (DisProt) is a curated database that provides information about proteins that lack fixed 3D structure in their putatively native states, either in their entirety or in part. Users can BLAST sequences, browse by protein name, or view by protein function and functional subclass. | protein, protein structure, bio.tools, FASEB list |
is listed by: bio.tools is listed by: Debian has parent organization: Temple University; Pennsylvania; USA |
nif-0000-02754, r3d100010561, biotools:disprot | https://bio.tools/disprot https://doi.org/10.17616/R3NG75 |
http://divac.ist.temple.edu/disprot | SCR_007097 | DisProt | 2026-02-11 10:57:29 | 204 | |||||||
|
Genomic Distribution of structural Superfamilies Resource Report Resource Website 1+ mentions |
Genomic Distribution of structural Superfamilies (RRID:SCR_007670) | data or information resource, database | Genomic Distribution of structural Superfamilies identifies and classifies evolutionary related proteins at the superfamily level in whole genome databases. GenDiS has been curated in direct correspondence with SCOP and represents 4001 highly resolved domains in 1194 structural superfamilies across protein sequence databases. Sequences showing reliable homology to entries in SCOP and PASS2 databases have been obtained from the non-redundant protein sequence database and aligned. Similar alignments of the superfamily members are provided in the genome level. GenDiS provides a platform for cross genome comparison at the superfamily level. GenDis relates proteins sequence information across all strata of taxonomy. One may navigate through the database to obtain structural homologues across different levels in taxonomic classification. The nomenclature of the various genomes and their hierarchy is in direct correspondence with the taxonomy database maintained at the NCBI. Sequence homologues for the various structural members are obtained from the non-redundant protein sequence database employing sensitive sequence search methods. Multiple approaches such as PSI-BLAST, HMMsearch of the HMMer suite and an interacting motif constrained PHI-BLAST have been employed to identify homologues in the sequence databases. | protein, protein superfamily | nif-0000-02874 | SCR_007670 | GenDiS | 2026-02-11 10:57:40 | 2 | ||||||||||
|
Colibri Resource Report Resource Website 100+ mentions |
Colibri (RRID:SCR_007606) | Colibri | data or information resource, database | Database dedicated to the analysis of the genome of Escherichia coli. Its purpose is to collate and integrate various aspects of the genomic information from E. coli, the paradigm of Gram-negative bacteria. Colibri provides a complete dataset of DNA and protein sequences derived from the paradigm strain E. coli K-12, linked to the relevant annotations and functional assignments. It allows one to easily browse through these data and retrieve information, using various criteria (gene names, location, keywords, etc.). The data contained in Colibri originates from two major sources of information, the reference genomic DNA sequence from the E. coli Genome Project and the feature annotations from the EcoGene data collection., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025. | dna, genome, protein, escherichia coli, escherichia coli genome, escherichia coli protein |
is related to: EcoGene has parent organization: Pasteur Institute |
CNRS ; French Ministry of Higher Education and Research |
PMID:8246843 | THIS RESOURCE IS NO LONGER IN SERVICE | nif-0000-02676 | SCR_007606 | 2026-02-11 10:57:37 | 229 | ||||||
|
DB-PABP: a database of polyanion binding proteins Resource Report Resource Website |
DB-PABP: a database of polyanion binding proteins (RRID:SCR_007603) | data or information resource, database | THIS RESOURCE IS NO LONGER IN SERVICE, documented August 23, 2016. DB-PABP is an attempt to document the publicly available experimentally determined polyanion binding proteins (PABPs). The purpose of the database is to provide life scientists who are interested in PA/PABP interactions with a comprehensive data repository, as well as computer scientists with a publicly available dataset to perform knowledge discovery and datamining studies. The database is manually curated. It uses protein annotations from NCBI protein database and literature information is retrieved from PubMed. Whenever applicable, links to NCBI protein database and PubMed are provided so users may access additional information available in these public databases. | pabp, polyanion, polyanion binding protein, polyanion binding protein interaction, polyanion interactions, protein | has parent organization: University of Kansas; Kansas; USA | THIS RESOURCE IS NO LONGER IN SERVICE | nif-0000-02724 | SCR_007603 | DB-PABP | 2026-02-11 10:57:42 | 0 | ||||||||
|
dbPTM: An informational repository of proteins and post-translational modifications Resource Report Resource Website 100+ mentions |
dbPTM: An informational repository of proteins and post-translational modifications (RRID:SCR_007619) | data or information resource, database | dbPTM is a database that compiles information on protein post-translational modifications (PTM) such as the modified sites, solvent accessibility of surrounding amino acids, protein secondary and tertiary structures, protein domains, and protein variations. The version 2.0 of dbPTM integrates the experimentally validated PTM sites with referable literatures from Swiss-Prot, Phospho.ELM, O-GLYCBASE, and UbiProt. In all of the collected PTM information, about 25 types of PTM with enough experimentally validated sites are trained the profile hidden Markov models (HMMs) to detect the potential PTM sites with 100% specificity against Swiss-Prot proteins. To help users investigating more detail in each type of PTM, the substrate peptide specificity such as positional amino acid frequency, solvent accessibility and secondary structure surrounding the modified sites are also provided. Moreover, the information of orthologous protein clusters is provided to users for analyzing whether the PTM sites located in the evolutionary conserved regions or not., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025. | protein, protein post-translational modification, ptm | THIS RESOURCE IS NO LONGER IN SERVICE | nif-0000-02730 | SCR_007619 | dbPTM | 2026-02-11 10:57:37 | 112 | |||||||||
|
FireDB Resource Report Resource Website 1+ mentions |
FireDB (RRID:SCR_007655) | FireDB | data or information resource, database | A database of Protein Data Bank structures, ligands and annotated functional site residues. The database can be accessed by PDB codes or UniProt accession numbers as well as keywords. FireDB contains information on every chemical compound in the PDB, including their descriptions, the PDB structures in which the compounds are found and the amino acids that are in contact with the ligand. | protein, protein structure, pdb, bio.tools |
uses: Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) is listed by: bio.tools is listed by: Debian has parent organization: Spanish National Cancer Research Center |
nif-0000-02839, biotools:firedb | https://bio.tools/firedb | SCR_007655 | 2026-02-11 10:57:43 | 7 | ||||||||
|
Functional Coverage of the Proteome Resource Report Resource Website 1+ mentions |
Functional Coverage of the Proteome (RRID:SCR_007654) | data or information resource, database | FCP is a publicly accessible web tool dedicated to analyzing the current state and trends of available proteome structures along the classification schemes of enzymes and nuclear receptors. It offers both graphical and quantitative data on the degree of functional coverage in that portion of the proteome by existing structures and on the bias observed in the distribution of those structures among proteins. Users can choose to search the website based on structures or ligands, and can also sort by enzyme or receptor. Users can also view data based on structural and population (species) filters. | enzyme, nuclear receptor, protein, proteome, proteome structure | has parent organization: Pompeu Fabra University; Barcelona; Spain | nif-0000-02834 | SCR_007654 | FCP | 2026-02-11 10:57:38 | 2 | |||||||||
|
Human PAML Browser Resource Report Resource Website 1+ mentions |
Human PAML Browser (RRID:SCR_007715) | data or information resource, database | THIS RESOURCE IS NO LONGER IN SERVICE, documented on July 16, 2013. It provides access to the results of tests for positive selection in 14,000 human genes. Multiple alignments of protein-coding regions of genes from human and other mammals were extracted from whole-genome alignments available from UC-Santa Cruz. Each gene was analyzed using the maximum likelihood tests of selection using PAML. Branch, site, and branch+site tests were performed, each with at least one matching null model. | positive selection, protein | has parent organization: Case Western Reserve University; Ohio; USA | THIS RESOURCE IS NO LONGER IN SERVICE | nif-0000-02996 | SCR_007715 | Human PAML Browser | 2026-02-11 10:57:39 | 1 | ||||||||
|
SIMAP Resource Report Resource Website 10+ mentions |
SIMAP (RRID:SCR_007927) | SIMAP | data or information resource, database | It provides a database based on a pre-computed similarity matrix covering the similarity space formed by >4 million amino acid sequences from public databases and completely sequenced genomes. The database is capable of handling very large datasets and is updated incrementally. For sequence similarity searches and pairwise alignments, we implemented a grid-enabled software system, which is based on FASTA heuristics and the Smith Waterman algorithm. SimpleSIMAP and AdvancedSIMAP retrieve homologs for given protein sequences that need to be contained in the SIMAP database. While SimpleSIMAP provides only selected parameters and preconfigured search spaces, the AdvancedSIMAP allows the user to specify search space, filtering and sorting parameters in a flexible manner. Both types of queries result in lists of homologs that are linked in turn to their homologs. So the web interfaces allow users to explore quickly and interactively the protein world by homology. Sponsors: SIMAP is supported by the Department of Genome Oriented Bioinformatics of the Technische Universitt Mnchen and the Institute for Bioinformatics of the GSF-National Research Center for Environment and Health. | genome, alignment, amino acid, homolog, matrix, protein, protein domain and protein classification databases, sequence | SCR_007927 | The Similarity Matrix of Proteins, Similarity Matrix of Proteins | 2026-02-11 10:57:42 | 14 | ||||||||||
|
Systematic Platform for Identifying Mutated Proteins (SysPIMP) Resource Report Resource Website 1+ mentions |
Systematic Platform for Identifying Mutated Proteins (SysPIMP) (RRID:SCR_007954) | SysPIMP | data or information resource, database | A database ofhuman disease-related mutated proteins identified by mass-spectrometry (MS). For achieving this goal, we collected human mutated sequences known to be related to diseases till now. After surveying mutated sequence sources: PMD, OMIM, SwissProt polymorphism, HGMD, etc, we found that currently HGMD contains the largest human gene mutation information. However, because, for academic users, HGMD does not provide with whole data download service, we decided to systematically extract and curate mutation information from PMD, OMIM, SwissProt, MSIPI database to form SysPIMP and provide it free for academic users. | human disease, mutation, protein |
has parent organization: Shanghai Jiao Tong University; Shanghai; China has parent organization: Chinese Academy of Sciences; Beijing; China |
nif-0000-03527 | SCR_007954 | Systematic Platform for Identifying Mutated Proteins | 2026-02-11 10:57:51 | 2 | ||||||||
|
SYSTERS Resource Report Resource Website 1+ mentions |
SYSTERS (RRID:SCR_007955) | data or information resource, database | SYSTERS is a database of protein sequences grouped into homologous families and superfamilies. The SYSTERS project aims to provide a meaningful partitioning of the whole protein sequence space by a fully automatic procedure. A refined two-step algorithm assigns each protein to a family and a superfamily. The sequence data underlying SYSTERS release 4 now comprise several protein sequence databases derived from completely sequenced genomes (ENSEMBL, TAIR, SGD and GeneDB), in addition to the comprehensive Swiss-Prot/TrEMBL databases. To augment the automatically derived results, information from external databases like Pfam and Gene Ontology are added to the web server. Furthermore, users can retrieve pre-processed analyses of families like multiple alignments and phylogenetic trees. New query options comprise a batch retrieval tool for functional inference about families based on automatic keyword extraction from sequence annotations. A new access point, PhyloMatrix, allows the retrieval of phylogenetic profiles of SYSTERS families across organisms with completely sequenced genomes. Gene, Human, Vertebrate, Genome, Human ORFs | family, gene, genome, human, human orfs, protein, superfamily, vertebrate | has parent organization: Max Planck Institute for Molecular Genetics; Berlin; Germany | nif-0000-03528 | SCR_007955 | SYSTERS | 2026-02-11 10:57:43 | 7 | |||||||||
|
SUPERFAMILY Resource Report Resource Website 100+ mentions |
SUPERFAMILY (RRID:SCR_007952) | data or information resource, database | SUPERFAMILY is a database of structural and functional protein annotations for all completely sequenced organisms. The SUPERFAMILY annotation is based on a collection of hidden Markov models, which represent structural protein domains at the SCOP superfamily level. A superfamily groups together domains which have an evolutionary relationship. The annotation is produced by scanning protein sequences from over 1,700 completely sequenced genomes against the hidden Markov models. | protein, hmm, hidden markov model, genome, structure, homology, model, FASEB list |
is listed by: 3DVC is related to: DBD: Transcription factor prediction database has parent organization: University of Bristol; Bristol; United Kingdom |
PMID:11697912 | nif-0000-03511 | http://supfam.org, http://stash.mrc-lmb.cam.ac.uk/SUPERFAMILY | SCR_007952 | Superfamily - HMM library and genome assignments server | 2026-02-11 10:57:43 | 325 | |||||||
|
Invitrogen iPath Resource Report Resource Website 50+ mentions |
Invitrogen iPath (RRID:SCR_008120) | data or information resource, database | THIS RESOURCE IS NO LONGER IN SERVICE, documented on August 26, 2016. LINNEA Pathways is a user-friendly comprehensive online resource for gene- or protein-based scientific research. It is based on a total of 248 signaling and metabolic human biological pathway maps created for Invitrogen by GeneGo. The current version of iPath features 225 maps displaying human regulatory and metabolic pathways established in experimental literature produced by MetaCore from GeneGo, Inc. The map objects (proteins, genes, EC functions, and compounds) are connected via metabolic transformations and physical protein interactions, which were assembled by the GeneGo team of experienced annotators, geneticists, and biochemists. The pathways are organized in a vertical fashion following the general signaling path from signaling molecules and membrane receptors, via signal transduction cascades, to transcription factors and their gene targets. Following the natural organization of cellular machinery with highly interconnected pathways and modules, many maps are linked together via hyperlinked box symbols. Such linkage allows the reconstruction of a big picture view of human cell biology., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025. | gene, genetic, metabolic, molecule, pathway, protein, receptor, research, signaling | THIS RESOURCE IS NO LONGER IN SERVICE | nif-0000-20864 | SCR_008120 | iPath | 2026-02-11 10:57:45 | 88 | |||||||||
|
Search Tool for Interactions of Chemicals Resource Report Resource Website 500+ mentions |
Search Tool for Interactions of Chemicals (RRID:SCR_007947) | STITCH | data or information resource, database | Database to explore known and predicted interactions of chemicals and proteins. It integrates information about interactions from metabolic pathways, crystal structures, binding experiments and drug-target relationships. Inferred information from phenotypic effects, text mining and chemical structure similarity is used to predict relations between chemicals. STITCH further allows exploring the network of chemical relations, also in the context of associated binding proteins. Each proposed interaction can be traced back to the original data sources. The database contains interaction information for over 68,000 different chemicals, including 2200 drugs, and connects them to 1.5 million genes across 373 genomes and their interactions contained in the STRING database. | drug-target relationship, chemical, chemical-protein interaction, chemical relationship, crystal structure, metabolic pathway interaction, protein, interaction, small molecule, drug, interaction network, FASEB list |
is listed by: OMICtools is related to: Integrated Molecular Interaction Database has parent organization: European Molecular Biology Laboratory |
BMBF ; European Union FP6 EMBO ; ProBioC |
PMID:22075997 PMID:19897548 PMID:18084021 |
r3d100012165, OMICS_01589, nif-0000-03499 | https://doi.org/10.17616/R3606X https://doi.org/10.17616/R3606X |
SCR_007947 | STITCH: Chemical-Protein Interactions | 2026-02-11 10:57:43 | 754 | |||||
|
Intergrated Transcription Factor Platform Resource Report Resource Website 10+ mentions |
Intergrated Transcription Factor Platform (RRID:SCR_008119) | data or information resource, database | ITFP is an integrated transcription factor (TF) platform, which included abundant TFs and targets message of mammalian. Support vector machine (SVM) algorithm combined with error-correcting output coding (ECOC) algorithm was utilized to identify and classify transcription factor from protein sequence of Human, Mouse and Rat. For transcription factor targets, a reverse engineering method named ARACNE was used to derive potential interaction pairs between transcription factor and downstream regulated gene from Human, Mouse and Rat gene expression profile data. Detailed information of gene expression profile data can be found in help page. Moreover, all data provided by the platform is free for non-commercial users and can be downloaded through links on help page. | expression, gene, human, message, mouse, protein, rat, sequence, target, transcription factor | has parent organization: Fudan University; Shanghai; China | nif-0000-20862 | SCR_008119 | ITFP | 2026-02-11 10:57:55 | 25 | |||||||||
|
Interaction Reference Index Web Interface Resource Report Resource Website 10+ mentions |
Interaction Reference Index Web Interface (RRID:SCR_008118) | data or information resource, database | iRefWeb is an interface to a relational database containing the latest build of the interaction Reference Index (iRefIndex) which integrates protein interaction data from nine different interaction databases: BioGRID, BIND, CORUM, DIP, HPRD, INTACT, MINT, MPPI, MPACT and OPHID. Integration is achieved through a rigorously documented procedure for mapping protein IDs across databases, enabling systematic backtracking of the links used to establish the identity of the interaction partners. The iRefWeb interface groups interaction records from the different databases into a single non-redundant view. In particular iRefWeb facilitates comparing interaction records as seen by the various source databases relative to the PubMeds they were annotated from. iRefWeb is one of several views of the iRefIndex resource. Data are also available in a tab-delimited plain-text format (PSI-MITAB) as well as planned releases of a PSI-XML formatted version and a Cytoscape plugin. Further details about the iRefIndex project as well as data downloads are available from here . The method used to build iRefIndex is described in a recent publication. | bind, biogrid, corum, dip, hprd, intact, interaction, mint, mpact, mppi, ophid, protein | nif-0000-20861, r3d100012725 | https://doi.org/10.17616/R31B8K https://doi.org/10.17616/R31B8K |
SCR_008118 | iRefWeb | 2026-02-11 10:57:45 | 27 | |||||||||
|
Ingenuity Pathways Knowledge Base Resource Report Resource Website 1000+ mentions |
Ingenuity Pathways Knowledge Base (RRID:SCR_008117) | data or information resource, database | A horizontally and vertically structured database that pulls scientific and medical information and describes it consistently using the Ingenuity Ontology. The Knowledge Base pulls information from journals, public molecular content databases, and textbooks. Data is curated and and integrated into the Knowledge Base . | gene, life science, molecule, protein, research, software, ontology, knowledge base, FASEB list |
is used by: Ingenuity Pathway Analysis is listed by: SoftCite has parent organization: QIAGEN |
Available to the research community, Commercial | nif-0000-20858 | SCR_008117 | Ingenuity Knowledge Base | 2026-02-11 10:57:45 | 4627 | ||||||||
|
BoLA Nomenclature: International Society for Animal Genetics Resource Report Resource Website 10+ mentions |
BoLA Nomenclature: International Society for Animal Genetics (RRID:SCR_008142) | data or information resource, database | This website is intended to be the definitive source of information on the bovine major histocompatibility complex - its genes, proteins and polymorphism. Its purpose is to collate data on the Bovine Leucocyte Antigens (BoLA) and provide a forum for the analysis and nomenclature of polymorphisms in the genes and proteins of the bovine MHC. The BoLA nomenclature committee is a standing committee of the International Society for Animal Genetics. Its purpose is to collate data on the Bovine Leucocyte Antigens (BoLA) and provide a forum for the analysis and nomenclature of polymorphisms in the genes and proteins of the bovine MHC. The information gathered here is based on the BoLA workshop reports, which are published in Animal Genetics and the European Journal of Immunogenetics. The workshop report data are reproduced with the permission of the publishers Blackwell Science, and other text on the site is used with the permission of CRC Press. | gene, genetic, animal, antigen, bovine, complex, histocompatibility, immunogenetic, leucocyte, nomenclature, polymorphism, protein, journal article | has parent organization: University of Edinburgh; Scotland; United Kingdom | nif-0000-20967 | http://www.projects.roslin.ac.uk/bola/bolahome.html | SCR_008142 | International Society for Animal Genetics | 2026-02-11 10:57:45 | 16 |
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the facets that you can filter the data by.
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.