Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
A free archive and distribution service for unpublished preprints in the life sciences allowing authors to make their findings immediately available to the scientific community and receive feedback on draft manuscripts before they are submitted to journals. An article may be posted prior to, or concurrently with, submission to a journal but should not be posted if it has already been published. Once an article is published in a journal, bioRxiv will update the preprint with a link to the published version.
Proper citation: bioRxiv (RRID:SCR_003933) Copy
A free and open source distributed version control system designed to handle everything from small to very large projects with speed and efficiency. Git is easy to learn and has a tiny footprint with lightning fast performance. It outclasses SCM tools like Subversion, CVS, Perforce, and ClearCase with features like cheap local branching, convenient staging areas, and multiple workflows.
Proper citation: Git (RRID:SCR_003932) Copy
http://pubchem.ncbi.nlm.nih.gov/
Collection of information about chemical structures and biological properties of small molecules and siRNA reagents hosted by the National Center for Biotechnology Information (NCBI).
Proper citation: PubChem (RRID:SCR_004284) Copy
The European resource for the collection, organization and dissemination of data on biological macromolecular structures. In collaboration with the other worldwide Protein Data Bank (wwPDB) partners - the Research Collaboratory for Structural Bioinformatics (RCSB) and BioMagResBank (BMRB) in the USA and the Protein Data Bank of Japan (PDBj) - they work to collate, maintain and provide access to the global repository of macromolecular structure data. The main objectives of the work at PDBe are: * to provide an integrated resource of high-quality macromolecular structures and related data and make it available to the biomedical community via intuitive user interfaces. * to maintain in-house expertise in all the major structure-determination techniques (X-ray, NMR and EM) in order to stay abreast of technical and methodological developments in these fields, and to work with the community on issues of mutual interest (such as data representation, harvesting, formats and standards, or validation of structural data). * to provide high-quality deposition and annotation facilities for structural data as one of the wwPDB deposition sites. Several sophisticated tools are also available for the structural analysis of macromolecules.
Proper citation: PDBe - Protein Data Bank in Europe (RRID:SCR_004312) Copy
http://discovery.hsci.harvard.edu/
An online database of curated cancer stem cell (CSC) experiments coupled to the Galaxy analytical framework. Driven by a need to improve our understanding of molecular processes that are common and unique across cancer stem cells (CSCs), the SCDE allows users to consistently describe, share and compare CSC data at the gene and pathway level. The initial focus has been on carefully curating tissue and cancer stem cell-related experiments from blood, intestine and brain to create a high quality resource containing 53 public studies and 1098 assays. The experimental information is captured and stored in the multi-omics Investigation/Study/Assay (ISA-Tab) format and can be queried in the data repository. A linked Galaxy framework provides a comprehensive, flexible environment populated with novel tools for gene list comparisons against molecular signatures in GeneSigDB and MSigDB, curated experiments in the SCDE and pathways in WikiPathways. Investigation/Study/Assay (ISA) infrastructure is the first general-purpose format and freely available desktop software suite targeted to experimentalists, curators and developers and that: * assists in the reporting and local management of experimental metadata (i.e. sample characteristics, technology and measurement types, sample-to-data relationships) from studies employing one or a combination of technologies; * empowers users to uptake community-defined minimum information checklists and ontologies, where required; * formats studies for submission to a growing number of international public repositories endorsing the tools, currently ENA (genomics), PRIDE (proteomics) and ArrayExpress (transcriptomics). Galaxy allows you to do analyses you cannot do anywhere else without the need to install or download anything. You can analyze multiple alignments, compare genomic annotations, profile metagenomic samples and much much more. Best of all, Galaxy''''s history system provides a complete analyses record that can be shared. Every history is an analysis workflow, which can be used to reproduce the entire experiment. The code for this Galaxy instance is available for download from BitBucket.
Proper citation: Stem Cell Discovery Engine (RRID:SCR_004453) Copy
A web server dedicated to the reconstruction of phylogenetic trees, reticulation networks and to the inference of horizontal gene transfer (HGT) events.
Proper citation: Tree and reticulogram REConstruction (RRID:SCR_004497) Copy
https://github.com/alyssafrazee/derfinder
R package for differential expression analysis of RNA-seq data.
Proper citation: DER Finder (RRID:SCR_004250) Copy
Curated, relational database containing sequence, classification, structural, functional and evolutionary information about transport systems from variety of living organisms based on IUBMB-approved transporter classification (TC) system. Descriptions, TC numbers, and examples of over 600 families of transport proteins are provided. TC system is analogous to Enzyme Commission (EC) system for classification of enzymes, except that it incorporates both functional and phylogenetic information. TCDB users may submit their own sequenced proteins and descriptions for inclusion into database. The software tools used are all freely available for download. These programs are used for analysis of Protein and DNA sequences. Programs require UNIX server to run.
Proper citation: Transporter Classification Database (RRID:SCR_004490) Copy
http://blast.ncbi.nlm.nih.gov/Blast.cgi
Web search tool to find regions of similarity between biological sequences. Program compares nucleotide or protein sequences to sequence databases and calculates statistical significance. Used for identifying homologous sequences.
Proper citation: NCBI BLAST (RRID:SCR_004870) Copy
http://noble.gs.washington.edu/proj/philius/
Web server that predicts protein transmembrane topology and signal peptides. Hidden Markov models (HMM) have been successfully applied to the tasks of transmembrane protein topology prediction and signal peptide prediction. They expand upon this work by making use of the more powerful class of dynamic Bayesian networks (DBN). Their model, Philius, is inspired by a previously published HMM, Phobius, and combines a signal peptide sub-model with a transmembrane sub-model. They introduce a two-stage DBN decoder which combines the power of posterior decoding with the grammar constraints of Viterbi-style decoding. Philius also provides protein type, segment, and topology confidence metrics to aid in the interpretation of the predictions.
Proper citation: Philius (RRID:SCR_004625) Copy
http://phenom.ccbr.utoronto.ca/index.jsp
Database of morphological phenotypes caused by mutation of essential genes in Saccharomyces cerevisiae, it allows storing, retrieving, visualizing and data mining the quantitative single-cell measurements extracted from micrographs of the temperature-sensitive (ts) mutant cells. PhenoM allows users to rapidly search and retrieve raw images and their quantified morphological data for genes of interest. The database also provides several data-mining tools, including a PhenoBlast module for phenotypic comparison between mutant strains and a Gene Ontology module for functional enrichment analysis of gene sets showing similar morphological alterations. About one-fifth of the genes in the budding yeast are essential for haploid viability and cannot be functionally assessed using standard genetic approaches such as gene deletion. To facilitate genetic analysis of essential genes, we and others have assembled collections of yeast strains expressing temperature-sensitive (ts) alleles of essential genes. To explore the phenotypes caused by essential gene mutation we used a panel of genetically engineered fluorescent markers to explore the morphology of cells in the ts strain collection using high-throughput microscopy. The database contains quantitative measurements of 1,909,914 cells and 78,194 morphological images for 775 temperature-sensitive mutants spanning 491 different essential genes in permissive temperature (26* C) and restrictive temperature (32* C). The morphological images were generated by high-content screening (HCS) technology.
Proper citation: PhenoM - Phenomics of yeast Mutants (RRID:SCR_006970) Copy
http://tritrypdb.org/tritrypdb/
An integrated genomic and functional genomic database providing access to genome-scale datasets for kinetoplastid parasites, and supporting a variety of complex queries driven by research and development needs. Currently, TriTrypDB integrates datasets from Leishmania braziliensis, L. infantum, L. major, L. tarentolae, Trypanosoma brucei and T. cruzi. Users may examine individual genes or chromosomal spans in their genomic context, including syntenic alignments with other kinetoplastid organisms. Data within TriTrypDB can be interrogated utilizing a sophisticated search strategy system that enables a user to construct complex queries combining multiple data types. All search strategies are stored, allowing future access and integrated searches. ''''User Comments'''' may be added to any gene page, enhancing available annotation; such comments become immediately searchable via the text search, and are forwarded to curators for incorporation into the reference annotation when appropriate. TriTrypDB provides programmatic access to its searches, via REST Web Services. The result of a web service request is a list of records (genes, ESTs, etc) in either XML or JSON format. REST services can be executed in a browser by typing a specific URL. TriTrypDB and its continued development are possible through the collaborative efforts between EuPathDB, GeneDB and colleagues at the Seattle Biomedical Research Institute (SBRI).
Proper citation: TriTrypDB (RRID:SCR_007043) Copy
http://bioinfo.cau.edu.cn/agriGO/
A web-based tool and database for the gene ontology analysis. Its focus is on agricultural species and is user-friendly. The agriGO is designed to provide deep support to agricultural community in the realm of ontology analysis. Compared to other available GO analysis tools, unique advantages and features of agriGO are: # The agriGO especially focuses on agricultural species. It supports 45 species and 292 datatypes currently. And agriGO is designed as an user-friendly web server. # New tools including PAGE (Parametric Analysis of Gene set Enrichment), BLAST4ID (Transfer IDs by BLAST) and SEACOMPARE (Cross comparison of SEA) were developed. The arrival of these tools provides users with possibilities for data mining and systematic result exploration and will allow better data analysis and interpretation. # The exploratory capability and result visualization are enhanced. Results are provided in different formats: HTML tables, tabulated text files, hierarchical tree graphs, and flash bar graphs. # In agriGO, PAGE and SEACOMPARE can be used to carry out cross-comparisons of results derived from different data sets, which is very important when studying multiple groups of experiments, such as in time-course research. Platform: Online tool, THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: agriGO (RRID:SCR_006989) Copy
http://www.imgt.org/IMGTindex/IMGTgene-db.html
IMGT/GENE-DB is the comprehensive IMGT genome database for immunoglobulin (IG) and T cell receptor (TR) genes from human and mouse, and, in development, from other vertebrates. IMGT/GENE-DB is the international reference for the IG and TR gene nomenclature and works in close collaboration with the HUGO Nomenclature Committee, Mouse Genome Database and genome committees for other species. IMGT/GENE-DB allows a search of IG and TR genes by locus, group and subgroup, which are CLASSIFICATION concepts of IMGT-ONTOLOGY. Short cuts allow the retrieval gene information by gene name or clone name. Direct links with configurable URL give access to information usable by humans or programs. An IMGT/GENE-DB entry displays accurate gene data related to genome (gene localization), allelic polymorphisms (number of alleles, IMGT reference sequences, functionality, etc.) gene expression (known cDNAs), proteins and structures (Protein displays, IMGT Colliers de Perles). It provides internal links to the IMGT sequence databases and to the IMGT Repertoire Web resources, and external links to genome and generalist sequence databases. IMGT/GENE-DB manages the IMGT reference directory used by the IMGT tools for IG and TR gene and allele comparison and assignment, and by the IMGT databases for gene data annotation., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: IMGT/GENE-DB (RRID:SCR_006964) Copy
Integrative database of germ-line V genes from the immunoglobulin loci of human and mouse. It presents V gene sequences extracted from the EMBL nucleotide sequence database and Ensembl together with links to the respective source sequences. Based on the properties of the source sequences, V genes are classified into 3 different classes: * Class 1: genomic and rearranged evidence * Class 2: genomic evidence only * Class 3: rearranged evidence only This allows careful sequence quality validation by the user. References to other immunological databases ( KABAT, IMGT/LIGM and VBASE ) are given to provide all public annotation data for each V gene. The VBASE2 database can be accessed either by the Direct Query interface or by the DNAPLOT Query interface. The Sequences given by the user are aligned with DNAPLOT against the VBASE2 database. Direct Query allows to enter sequence IDs and names (Field 1), choose species, locus, V gene family and class (Field 2) or search for 100% sequences (Field 3). At the DNAPLOT Query, the sequences given by the user are aligned with DNAPLOT against the VBASE2 database. The DNAPLOT program offers V gene nucleotide sequence alignment referring to the IMGT V gene unique numbering. The Quick Search can be used either for Direct Query to search for sequence IDs and V gene names or for DNAPLOT Query for up to 5 sequences. The new Fab Analysis allows you to align Fab, scFab, scAb or scFv sequences with DNAPLOT against the VBASE2 database, where both heavy and light chain are analyzed.
Proper citation: VBASE2 (RRID:SCR_007082) Copy
http://gene3d.biochem.ucl.ac.uk/Gene3D/
A large database of CATH protein domain assignments for ENSEMBL genomes and Uniprot sequences. Gene3D is a resource of form studying proteins and the component domains. Gene3D takes CATH domains from Protein Databank (PDB) structures and assigns them to the millions of protein sequences with no PDB structures using Hidden Markov models. Assigning a CATH superfamily to a region of a protein sequence gives information on the gross 3D structure of that region of the protein. CATH superfamilies have a limited set of functions and so the domain assignment provides some functional insights. Furthermore most proteins have several different domains in a specific order, so looking for proteins with a similar domain organization provides further functional insights. Strict confidence cut-offs are used to ensure the reliability of the domain assignments. Gene3D imports functional information from sources such as UNIPROT, and KEGG. They also import experimental datasets on request to help researchers integrate there data with the corpus of the literature. The website allows users to view descriptions for both single proteins and genes and large protein sets, such as superfamilies or genomes. Subsets can then be selected for detailed investigation or associated functions and interactions can be used to expand explorations to new proteins. The Gene3D web services provide programmatic access to the CATH-Gene3D annotation resources and in-house software tools. These services include Gene3DScan for identifying structural domains within protein sequences, access to pre-calculated annotations for the major sequence databases, and linked functional annotation from UniProt, GO and KEGG., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: Gene3D (RRID:SCR_007672) Copy
http://www.ensembl.org/info/docs/tools/vep/index.html
Data analysis service to predict the functional consequences of known and unknown variants.
Proper citation: Variant Effect Predictor (RRID:SCR_007931) Copy
Gene Expression Atlas is a semantically enriched database of meta-analysis based summary statistics over a curated subset of ArrayExpress Archive, servicing queries for condition-specific gene expression patterns as well as broader exploratory searches for biologically interesting genes/samples. The EBI Gene Expression Atlas Blog discusses ideas, features and problems of creating a large scale meta-analytical atlas of gene expression from publicly available microarray data. Atlas REST API provides all the results available in the main web application in a pragmatic, easy to use form - simple HTTP GET queries as input and either JSON or XML formats as output. Gene Expression Atlas goals: 1. Provision of a statistically robust framework for integration of gene expression experiment results across different platforms at a meta-analytical level 2. A simple interface for identifying strong differential expression candidate genes in conditions of interest 3. Integration of ontologies for high quality annotation of gene and sample attributes 4. Construction of new gene expression summarized views, with a view to analysis of putative signaling pathway targets, discovery of correlated gene expression patterns and the identification of condition/tissue-specific patterns of gene expression.
Proper citation: Gene Expression Atlas (RRID:SCR_007989) Copy
http://etest.vbi.vt.edu/etblast3/
eTBLAST is a unique search engine for searching biomedical literature. Our service is very different from PubMed. While PubMed searches for keywords, our search engine lets you input an entire paragraph and returns MEDLINE abstracts that are similar to it. This is something like PubMed''s Related Articles feature, only better because it runs on your unique set of interests. For example, input the abstract of an unpublished paper or a grant proposal into our engine, and with the touch of a button you''ll be able to find every abstract in MEDLINE dealing with your topic. No more guessing whether your set of keywords has found all the right papers. No more sorting through hundreds of papers you don''t care about to find the handful you were looking for--our search engine does it for you. When most people use PubMed to search MEDLINE they pick one or two keywords to describe their topic, then browse through a long list of results. When they find a paper that looks interesting they click on its Related Articles, in hopes of finding more papers like that one. If they find another relevant paper, they explore it''s related articles--and so on. This process of culling long lists of documents by hand makes literature searching tedious and time consuming. We make it easier for you by providing better results the first time, and then allowing you to automatically combine the papers you care about for a second round. Our Iterate feature allows you to checkmark the abstracts you found interesting in the first round and combine them all to create a new query. It''s like rolling several Related Articles lists into one. * We sort our results by relevance, while PubMed sorts by date. * We save you the time and effort of creating a complicated query. * We let you iterate your search over several good papers to narrow your focus. * We provide you the full MEDLINE abstract in our results, and a link to the PubMed page. * We can send your results straight to your email so you never lose a reference or forget where you found it. * This absolutely free service is provided by the University of Texas Southwestern Medical Center. No registration necessary!
Proper citation: eTBlast (RRID:SCR_008188) Copy
http://lowelab.ucsc.edu/tRNAscan-SE
Web server to search for tRNA genes in genomic sequence. If you would like to run tRNAscan-SE locally, you can get the UNIX source code (gzip''d tar file).
Proper citation: tRNAscan-SE (RRID:SCR_008637) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.