Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://cbl-gorilla.cs.technion.ac.il/
A tool for identifying and visualizing enriched GO terms in ranked lists of genes. It can be run in one of two modes: * Searching for enriched GO terms that appear densely at the top of a ranked list of genes or * Searching for enriched GO terms in a target list of genes compared to a background list of genes.
Proper citation: GOrilla: Gene Ontology Enrichment Analysis and Visualization Tool (RRID:SCR_006848) Copy
http://murphylab.web.cmu.edu/services/SLIF/
SLIF finds fluorescence microscope images in on-line journal articles, and indexes them according to cell line, proteins visualized, and resolution. Images can be accessed via the SLIF Web database. SLIF takes on-line papers and scans them for figures that contain fluorescence microscope images (FMIs). Figures typically contain multiple FMIs, to SLIF must segment these images into individual FMIs. When the FMI images are extracted, annotations for the images (for instance, names of proteins and cell-lines) are also extracted from the accompanying caption text. Protein annotation are also used to link to external databases, such as the Gene Ontology DB. The more detailed process includes: segmentation of images into panels; panel classification, to find FMIs; segmentation of the caption, to find which portions of the caption apply to which panels; text-based entity extraction; matching of extracted entities to database entries; extraction of panel labels from text and figures; and alignment of the text segments to the panels. Extracted FMIs are processed to find subcellular location features (SLFs), and the resulting analyzed, annotated figures are stored in a database, which is accessible via SQL queries.
Proper citation: Subcellular Location Image Finder (RRID:SCR_006723) Copy
http://wwwmgs.bionet.nsc.ru/mgs/programs/panalyst/
WebProAnalyst provides web-accessible analysis for scanning the quantitative structure-activity relationships in protein families. It searches for a sequence region, whose substitutions are correlated with variations in the activities of a homologous protein set, the so-called activity modulating sites. WebProAnalyst allows users to search for the key physicochemical characteristics of the sites that affect the changes in protein activities. It enables the building of multiple linear regression and neural networks models that relate these characteristics to protein activities. WebProAnalyst implements multiple linear regression analysis, back propagation neural networks and the Structure-Activity Correlation/Determination Coefficient (SACC/SADC). A back propagation neural network is implemented as a two-layered network, one layer as input, the other as output (Rumelhart et al, 1986). WebProAnalyst uses alignment of amino acid sequences and data on protein activity (pK, Km, ED50, among others). The input data are the numerical values for the physicochemical characteristics of a site in the multiple alignment given by a slide window. The output data are the predicted activity values. The current version of WebProAnalyst handles a single activity for a single protein. The SACC/SADC may be defined as an estimate of the strongest multiple correlation between the physicochemical characteristics of a site in a multiple alignment and protein activities. The SACC/SADC coefficient makes possible the calculation of the possible highest correlation achievable for the quantitative relationship between the physicochemical properties of sites and protein activities. The SACC/SADC is a convenient means for an arrangement of positions by their functional significance. WebProAnalyst outputs a list of multiple alignment positions, the respective correlation values, also regression analysis parameters for the relationships between the amino acid physicochemical characteristics at these positions and the protein activity values.
Proper citation: Webproanalyst (RRID:SCR_008348) Copy
http://www.glycosciences.de/tools/carp/
Service that generates Ramachandran-like plots of carbohydrate linkage torsions in pdb-files. The Ramachandran Plot, where backbone torsion angles are plotted against each other, is a frequently used tool to evaluate the quality of a protein 3D structure. For carbohydrate structures, linkage torsions can be evaluated in a similar way. Preferred Phi/Psi values of the torsion angles of glycosidic bonds depend strongly on the types of monosaccharides involved in the linkage, the kind of linkage (1-3, 1-4, etc) as well as the degree of branching of the structure. CARP analyses carbohydrate data given in PDB files using the pdb2linucs algorithm. For each different linkage type a separate plot is generated. The user can choose between two sources for plot background information for comparison: data obtained from PDB provided by GlyTorsion or from GlycoMapsDB. GlycoMapsDB provides calculated conformational maps, which show energetically preferred regions for a specific linkage, while PDB data are based on experimentally solved structures. For seldom occuring linkages, however, PDB data are often rare, so maybe not sufficient background information for comparison will be available from this source., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: CARP (RRID:SCR_009021) Copy
http://www.ebi.ac.uk/Tools/sss/wublast/
Tool to find regions of sequence similarity within selected protein databases quickly, with minimum loss of sensitivity.
Proper citation: WU-BLAST (RRID:SCR_011824) Copy
http://www.cdtdb.brain.riken.jp/CDT/Top.jsp
Transcriptomic information (spatiotemporal gene expression profile data) on the postnatal cerebellar development of mice (C57B/6J & ICR). It is a tool for mining cerebellar genes and gene expression, and provides a portal to relevant bioinformatics links. The mouse cerebellar circuit develops through a series of cellular and morphological events, including neuronal proliferation and migration, axonogenesis, dendritogenesis, and synaptogenesis, all within three weeks after birth, and each event is controlled by a specific gene group whose expression profile must be encoded in the genome. To elucidate the genetic basis of cerebellar circuit development, CDT-DB analyzes spatiotemporal gene expression by using in situ hybridization (ISH) for cellular resolution and by using fluorescence differential display and microarrays (GeneChip) for developmental time series resolution. The CDT-DB not only provides a cross-search function for large amounts of experimental data (ISH brain images, GeneChip graph, RT-PCR gel images), but also includes a portal function by which all registered genes have been provided with hyperlinks to websites of many relevant bioinformatics regarding gene ontology, genome, proteins, pathways, cell functions, and publications. Thus, the CDT-DB is a useful tool for mining potentially important genes based on characteristic expression profiles in particular cell types or during a particular time window in developing mouse brains.
Proper citation: Cerebellar Development Transcriptome Database (RRID:SCR_013096) Copy
https://rostlab.org/owiki/index.php/PredictNLS
Software automated tool for analysis and determination of Nuclear Localization Signals (NLS). Predicts that your protein is nuclear or finds out whether your potential NLS is found in our database. The program also compiles statistics on the number of nuclear/non-nuclear proteins in which your potential NLS is found. Finally, proteins with similar NLS motifs are reported, and the experimental paper describing the particular NLS are given.
Proper citation: PredictNLS (RRID:SCR_003133) Copy
http://www.elsevier.com/online-tools/pathway-studio/biological-database
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 5, 2023. MedScan is a fast and flexible biomedical information extraction technology. It uses dictionaries to identify individual biomedical terms (proteins, cellular processes, small molecules, diseases, etc) referred to in literature articles, and applies advanced natural language processing techniques to detect the relationships within the article and extract these terms and the relationships; the overall process of detection, identification, extraction and assembling, is termed Information Harvesting. Information extracted by MedScan represents the multiple aspects of protein function, including protein modification, cellular localization, protein-protein interactions, gene expression regulation, molecular transport and synthesis, as well as association with diseases, and regulation of various cellular processes. This scope can be broadened by modifying information extraction rules and the dictionaries. Dictionaries can be assembled on any topic or area that is represented in the literature you wish to harvest. High-throughput data generation methodologies like microarray gene expression require new approaches for gathering information for data analysis. For the best results, computational approaches used for high-throughput data analysis require that biological information from the literature be a coherent and integrated part of the analysis software itself. Pathway Studio meets this challenge through its MedScan Technology and underlying ResNet database. All editions of Pathway Studio contain MedScan Technology to harvest information from the literature and to save this information in the Pathway Studio ResNet database ready for data analysis. MedScan is more than a web search engine. Indeed, the output of a Google search can be channeled into MedScan for example. Web searches, like Google, are excellent at finding items as a result of a query. A quick look at the output list usually locates the item for which you are looking. This approach however, is not well suited for information and knowledge gathering. Also, once information is gathered, where do you put it for later computational use? MedScan meets this challenge for the area of biomedical literature and biomedical online information. PubMed meets the needs for a central repository of biomedical literature. Researchers can go to PubMed and search for any topic and articles of interest, much like a web type of search. However, just like a web type of search, PubMed also provides a list of all the hits with a link to the articles. If a single article, or even just a few, are sought, this search approach is useful. Alternatively, MedScan will list all the articles of interest but additionally scans the text for relationships, highlights these relationships in the articles and then lists these relationships and the biological molecules and processes involved in the relationships in separate tables. The tables of relationships can be viewed graphically in Pathway Studio and can be saved into the ResNet database for use in experimental data analysis.
Proper citation: MedScan (RRID:SCR_003314) Copy
http://rostlab.org/services/nlsdb/
A database of nuclear localization signals (NLSs) and of nuclear proteins targeted to the nucleus by NLS motifs. NLSs are short stretches of residues mediating transport of nuclear proteins into the nucleus. The database contains 114 experimentally determined NLSs that were obtained through an extensive literature search. Using "in silico mutagenesis" this set was extended to 308 experimental and potential NLSs. This final set matched over 43% of all known nuclear proteins and matches no currently known non-nuclear protein. NLSdb contains over 6000 predicted nuclear proteins and their targeting signals from the PDB and SWISS-PROT/TrEMBL databases. The database also contains over 12 500 predicted nuclear proteins from six entirely sequenced eukaryotic proteomes (Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana and Saccharomyces cerevisiae). NLS motifs often co-localize with DNA-binding regions. This observation was used to also annotate over 1500 DNA-binding proteins. From this site you can: * Query NLSdb * Find out how to use NLSdb * Browse the entries in NLSdb * Find out if your protein has an NLS using PredictNLS * Predict subcellular localization of your protein using LOCtree
Proper citation: NLSdb: a database of nuclear localization signals (RRID:SCR_003273) Copy
http://abi.inf.uni-tuebingen.de/Services/MultiLoc2
An extensive high-performance subcellular protein localization prediction system that incorporates phylogenetic profiles and Gene Ontology terms to yield higher accuracies compared to its previous version. Moreover, it outperforms other prediction systems in two benchmarks studies. A downloadable version of MultiLoc2 for local use is also available.
Proper citation: MultiLoc (RRID:SCR_003151) Copy
Database to catalog experimentally determined interactions between proteins combining information from a variety of sources to create a single, consistent set of protein-protein interactions that can be downloaded in a variety of formats. The data were curated, both, manually and also automatically using computational approaches that utilize the the knowledge about the protein-protein interaction networks extracted from the most reliable, core subset of the DIP data. Because the reliability of experimental evidence varies widely, methods of quality assessment have been developed and utilized to identify the most reliable subset of the interactions. This CORE set can be used as a reference when evaluating the reliability of high-throughput protein-protein interaction data sets, for development of prediction methods, as well as in the studies of the properties of protein interaction networks. Tools are available to analyze, visualize and integrate user's own experimental data with the information about protein-protein interactions available in the DIP database. The DIP database lists protein pairs that are known to interact with each other. By interact they mean that two amino acid chains were experimentally identified to bind to each other. The database lists such pairs to aid those studying a particular protein-protein interaction but also those investigating entire regulatory and signaling pathways as well as those studying the organization and complexity of the protein interaction network at the cellular level. Registration is required to gain access to most of the DIP features. Registration is free to the members of the academic community. Trial accounts for the commercial users are also available.
Proper citation: Database of Interacting Proteins (DIP) (RRID:SCR_003167) Copy
Database of images of putative biological pathways, macromolecular structures, gene families, and cellular relationships. It is of use to those who are working with large sets of genes or proteins using cDNA arrays, functional genomics, or proteomics. The rationale for this collection is that: # Except in a few cases, information on most biological pathways in higher eukaryotes is non-existent, incomplete, or conflicting. # Similar biological pathways differ by tissue context, developmental stages, stimulatory events, or for other complex reasons. This database allows comparisons of different variations of pathways that can be tested empirically. # The goal of this database is to use images created directly by biomedical scientists who are specialists in a particular biological system. It is specifically designed to NOT use average, idealized or redrawn pathways. It does NOT use pathways defined by computer algorithm or information search approaches. # Information on biological pathways in higher eukaryotes generally resides in the images and text of review papers. Much of this information is not easily accessible by current medical reference search engines. # All images are attributable to the original authors. All pathways or other biological systems described are graphic representations of natural systems. Each pathway is to be considered a work in progress. Each carries some degree of error or incompleteness. The end user has the ultimate responsibility to determine the scientific correctness and validity in their particular biological system. Image/pathway submissions are welcome.
Proper citation: Biological Biochemical Image Database (RRID:SCR_003474) Copy
http://pir.georgetown.edu/pirwww/dbinfo/pirsf.shtml
A SuperFamily classification system, with rules for functional site and protein name, to facilitate the sensible propagation and standardization of protein annotation and the systematic detection of annotation errors. The PIRSF concept is being used as a guiding principle to provide comprehensive and non-overlapping clustering of UniProtKB sequences into a hierarchical order to reflect their evolutionary relationships. The PIRSF classification system is based on whole proteins rather than on the component domains; therefore, it allows annotation of generic biochemical and specific biological functions, as well as classification of proteins without well-defined domains. There are different PIRSF classification levels. The primary level is the homeomorphic family, whose members are both homologous (evolved from a common ancestor) and homeomorphic (sharing full-length sequence similarity and a common domain architecture). At a lower level are the subfamilies which are clusters representing functional specialization and/or domain architecture variation within the family. Above the homeomorphic level there may be parent superfamilies that connect distantly related families and orphan proteins based on common domains. Because proteins can belong to more than one domain superfamily, the PIRSF structure is formally a network. The FTP site provides free download for PIRSF.
Proper citation: PIRSF (RRID:SCR_003352) Copy
Centralized, standards compliant, public data repository for proteomics data, including protein and peptide identifications, post-translational modifications and supporting spectral evidence. Originally it was developed to provide a common data exchange format and repository to support proteomics literature publications. This remit has grown with PRIDE, with the hope that PRIDE will provide a reference set of tissue-based identifications for use by the community. The future development of PRIDE has become closely linked to HUPO PSI. PRIDE encourages and welcomes direct user submissions of protein and peptide identification data to be published in peer-reviewed publications. Users may Browse public datasets, use PRIDE BioMart for custom queries, or download the data directly from the FTP site. PRIDE has been developed through a collaboration of the EMBL-EBI, Ghent University in Belgium, and the University of Manchester.
Proper citation: Proteomics Identifications (PRIDE) (RRID:SCR_003411) Copy
http://wiki.c2b2.columbia.edu/honiglab_public/index.php/Main_Page
Laboratory portal, including software, web-based tools, databases and data sets, related to their research that focuses on the development and application of biophysical and bioinformatics methods aimed at understanding the structural and energetic origins of protein-protein, protein-nucleic acid, and protein-membrane interactions. Their work includes fundamental theoretical research, the development of software tools, and applications to problems of biological importance. In this regard they maintain an active collaborative computational and experimental research program on the molecular basis of cell-cell adhesion. Other problems of current interest include protein structure prediction, the organization of protein sequence/structure space, the prediction of protein function based on protein structure, the structural origins of specificity in protein-DNA interactions, RNA function and, more generally, the electrostatic properties of biological macromolecules.
Proper citation: Honig Lab (RRID:SCR_003410) Copy
Collection of pathways and pathway annotations. The core unit of the Reactome data model is the reaction. Entities (nucleic acids, proteins, complexes and small molecules) participating in reactions form a network of biological interactions and are grouped into pathways (signaling, innate and acquired immune function, transcriptional regulation, translation, apoptosis and classical intermediary metabolism) . Provides website to navigate pathway knowledge and a suite of data analysis tools to support the pathway-based analysis of complex experimental and computational data sets.
Proper citation: Reactome (RRID:SCR_003485) Copy
http://mimi.ncibi.org/MimiWeb/main-page.jsp
MiMi Web gives you an easy to use interface to a rich NCIBI data repository for conducting your systems biology analyses. This repository includes the MiMI database, PubMed resources updated nightly, and text mined from biomedical research literature. The MiMI database comprehensively includes protein interaction information that has been integrated and merged from diverse protein interaction databases and other biological sources. With MiMI, you get one point of entry for querying, exploring, and analyzing all these data. MiMI provides access to the knowledge and data merged and integrated from numerous protein interactions databases and augments this information from many other biological sources. MiMI merges data from these sources with deep integration into its single database with one point of entry for querying, exploring, and analyzing all these data. MiMI allows you to query all data, whether corroborative or contradictory, and specify which sources to utilize. MiMI displays results of your queries in easy-to-browse interfaces and provides you with workspaces to explore and analyze the results. Among these workspaces is an interactive network of protein-protein interactions displayed in Cytoscape and accessed through MiMI via a MiMI Cytoscape plug-in. MiMI gives you access to more information than you can get from any one protein interaction source such as: * Vetted data on genes, attributes, interactions, literature citations, compounds, and annotated text extracts through natural language processing (NLP) * Linkouts to integrated NCIBI tools to: analyze overrepresented MeSH terms for genes of interest, read additional NLP-mined text passages, and explore interactive graphics of networks of interactions * Linkouts to PubMed and NCIBI's MiSearch interface to PubMed for better relevance rankings * Querying by keywords, genes, lists or interactions * Provenance tracking * Quick views of missing information across databases. Data Sources include: BIND, BioGRID, CCSB at Harvard, cPath, DIP, GO (Gene Ontology), HPRD, IntAct, InterPro, IPI, KEGG, Max Delbreuck Center, MiBLAST, NCBI Gene, Organelle DB, OrthoMCL DB, PFam, ProtoNet, PubMed, PubMed NLP Mining, Reactome, MINT, and Finley Lab. The data integration service is supplied under the conditions of the original data sources and the specific terms of use for MiMI. Access to this website is provided free of charge. The MiMI data is queryable through a web services api. The MiMI data is available in PSI-MITAB Format. These files represent a subset of the data available in MiMI. Only UniProt and RefSeq identifiers are included for each interactor, pathways and metabolomics data is not included, and provenance is not included for each interaction. If you need access to the full MiMI dataset please send an email to mimi-help (at) umich.edu.
Proper citation: Michigan Molecular Interactions (RRID:SCR_003521) Copy
http://portal.ncibi.org/gateway/mimiplugin.html
The Cytoscape MiMI Plugin is an open source interactive visualization tool that you can use for analyzing protein interactions and their biological effects. The Cytoscape MiMI Plugin couples Cytoscape, a widely used software tool for analyzing bimolecular networks, with the MiMI database, a database that uses an intelligent deep-merging approach to integrate data from multiple well-known protein interaction databases. The MiMI database has data on 119,880 molecules, 330,153 interactions, and 579 complexes. By querying the MiMI database through Cytoscape you can access the integrated molecular data assembled in MiMI and retrieve interactive graphics that display protein interactions and details on related attributes and biological concepts. You can interact with the visualization by expanding networks to the next nearest neighbors and zooming and panning to relationships of interest. You also can perceptually encode nodes and links to show additional attributes through color, size and the visual cues. You can edit networks, link out to other resources and tools, and access information associated with interactions that has been mined and summarized from the research literature information through a biology natural language processing database (BioNLP) and a multi-document summarization system, MEAD. Additionally, you can choose sub-networks of interest and use SAGA, a graph matching tool, to match these sub-networks to biological pathways.
Proper citation: MiMI Plugin for Cytoscape (RRID:SCR_003424) Copy
http://core.biotech.hawaii.edu/Bioinformatics.htm
THIS RESOURCE IS NO LONGER IN SERVCE, documented January 28, 2019. Core Facility provides the software and support for computer assisted protein and DNA sequence analysis and database access. The Genetics Computer Group GCG-Wisconsin package is currently available on PBRC's UNIX platform that is accessible via modem or direct connection. The package can be accessed via three interfaces: the command-line interface (UNIX C-shell), the web-based interface (SeqWeb) and the X-Windows based graphics interface (SeqLab). Applications in the package include sequence editing, alignment, comparison, primer design, restriction analysis, mapping, data presentation, database browsing, etc. In addition to local databases, access to remote databases (BLAST) is integrated into the package. The local databases are updated quarterly. Databases available include GenBank, EMBL, PIR-Protein, SWISS-PROT and Restriction Enzymes (REBASE).
Proper citation: GCG/SeqWeb (RRID:SCR_003454) Copy
https://services.healthtech.dtu.dk/
Center for Biological Sequence Analysis of the Technical University of Denmark conducts basic research in the field of bioinformatics and systems biology and directs its research primarily towards topics related to the elucidation of the functional aspects of complex biological mechanisms. A large number of computational methods have been produced, which are offered to others via WWW servers. Several data sets are also available. The center also has experimental efforts in gene expression analysis using DNA chips and data generation in relation to the physical and structural properties of DNA. The on-line prediction services at CBS are available as interactive input forms. Most of the servers are also available as stand-alone software packages with the same functionality. In addition, for some servers, programmatic access is provided in the form of SOAP-based Web Services. The center also educates engineering students in biotechnology and systems biology and offers a wide range of courses in bioinformatics, systems biology, human health, microbiology and nutrigenomics.
Proper citation: DTU Center for Biological Sequence Analysis (RRID:SCR_003590) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.