Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
A question answer forum for scientists, focusing on methods in bioinformatics, computational genomics and biological data analysis. They welcome detailed and specific posts, written clearly and simply.
Proper citation: BioStar (RRID:SCR_002580) Copy
THIS RESOURCE IS NO LONGER IN SERVICE, Documented on March 24, 2014. A resource for gene expression studies, storing highly curated MIAME-compliant studies (i.e. experiments) employing a variety of technologies such as filter arrays, 2-channel microarrays, Affymetrix chips, SAGE, MPSS and RT-PCR. Data were available for querying and downloading based on the MGED ontology, publications or genes. Both public and private studies (the latter viewable only by users having appropriate logins and permissions) were available from this website. Specific details on protocols, biomaterials, study designs, etc., are collected through a user-friendly suite of web annotation forms. Software has been developed to generate MAGE-ML documents to enable easy export of studies stored in RAD to any other database accepting data in this format. RAD is part of a more general Genomics Unified Schema (http://gusdb.org), which includes a richly annotated gene index (http://allgenes.org), thus providing a platform that integrates genomic and transcriptomic data from multiple organisms. NOTE: Due to changes in technology and funding, the RAD website is no longer available. RAD as a schema is still very much active and incorporated in the GUS (Genomics Unified Schema) database system used by CBIL (EuPathDB, Beta Cell Genomics) and others. The schema for RAD can be viewed along with the other GUS namespaces through our Schema Browser.
Proper citation: RNA Abundance Database (RRID:SCR_002771) Copy
A disease / disorder relationships explorer and a sample of a map-oriented scientific work. It uses the Human Disease Network dataset and allows intuitive knowledge discovery by mapping its complexity. The Human Disease Network (official) dataset, a poster of the data and related book (Biology - The digital era, ISBN: 978-2-271-06779-1) are available. This kind of data has a network-like organization, and relations between elements are at least as important as the elements themselves. More data could be integrated to this prototype and could eventually bring closer phenotype and genotype. Results should be visual, but also printable. Creating posters can enhance collaborative work. It facilitates discussion and sharing of ideas about the data. This website initiative is an invitation to think about the benefits of networks exploration but above all it tries to outline future designs of scientific information systems.
Proper citation: Diseasome (RRID:SCR_002792) Copy
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 14,2026. Integrated database of genomic, expression and protein data for Drosophila, Anopheles, C. elegans and other organisms. You can run flexible queries, export results and analyze lists of data. FlyMine presents data in categories, with each providing information on a particular type of data (for example Gene Expression or Protein Interactions). Template queries, as well as the QueryBuilder itself, allow you to perform searches that span data from more than one category. Advanced users can use a flexible query interface to construct their own data mining queries across the multiple integrated data sources, to modify existing template queries or to create your own template queries. Access our FlyMine data via our Application Programming Interface (API). We provide client libraries in the following languages: Perl, Python, Ruby and & Java API
Proper citation: FlyMine (RRID:SCR_002694) Copy
https://sites.google.com/site/jingyijli/SLIDE.zip
Software package that takes exon boundaries and RNA-Seq data as input to discern the set of mRNA isoforms that are most likely to present in an RNA-Seq sample. It is based on a linear model with a design matrix that models the sampling probability of RNA-Seq reads from different mRNA isoforms. To tackle the model unidentifiability issue, SLIDE uses a modified Lasso procedure for parameter estimation. Compared with deterministic isoform assembly algorithms (e.g., Cufflinks), SLIDE considers the stochastic aspects of RNA-Seq reads in exons from different isoforms and thus has increased power in detecting more novel isoforms. Another advantage of SLIDE is its flexibility of incorporating other transcriptomic data such as RACE, CAGE, and EST into its model to further increase isoform discovery accuracy. SLIDE can also work downstream of other RNA-Seq assembly algorithms to integrate newly discovered genes and exons. Besides isoform discovery, SLIDE sequentially uses the same linear model to estimate the abundance of discovered isoforms.
Proper citation: SLIDE (RRID:SCR_005137) Copy
http://compbio.cs.brown.edu/projects/gasv/
Software tool combining both paired read and read depth signals into probabilistic model which can analyze multiple alignments of reads. Used to find structural variation in both normal and cancer genomes using data from variety of next-generation sequencing platforms. Used to predict structural variants directly from aligned reads in SAM/BAM format.Combines read depth information along with discordant paired read mappings into single probabilistic model two common signals of structural variation. When multiple alignments of read are given, GASVPro utilizes Markov Chain Monte Carlo procedure to sample over the space of possible alignments.
Proper citation: GASVPro (RRID:SCR_005259) Copy
http://bioportal.bioontology.org/annotator
A Web service that annotates textual metadata (e.g. journal abstract) with relevant ontology concepts. NCBO uses this Web service to annotate resources in the NCBO Resource Index. They also provide this Web service as a stand-alone service for users. This Web service can be accessed through BioPortal or used directly in your software. Currently, the annotation workflow is based on syntactic concept recognition (using concept names and synonyms) and on a set of semantic expansion algorithms that leverage the semantics in ontologies (e.g., is_a relations). Their service methodology leverages ontologies to create annotations of raw text and returns them using semantic web standards.
Proper citation: NCBO Annotator (RRID:SCR_005329) Copy
http://bowtie-bio.sourceforge.net/index.shtml
Software ultrafast memory efficient tool for aligning sequencing reads. Bowtie is short read aligner.
Proper citation: Bowtie (RRID:SCR_005476) Copy
http://llama.mshri.on.ca/gofish/GoFishWelcome.html
Software program, available as a Java applet online or to download, allows the user to select a subset of Gene Ontology (GO) attributes, and ranks genes according to the probability of having all those attributes., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: GoFish (RRID:SCR_005682) Copy
https://sourceforge.net/p/obo/mailman/message/59165700/
A structured controlled vocabulary of the anatomy of Drosophila melanogaster. These ontologies are query-able reference sources for information on Drosophila anatomy and developmental stages. They also provide controlled vocabularies for use in annotation and classification of data related to Drosophila anatomy, such as gene expression, phenotype and images. They were originally developed by FlyBase, who continue to maintain them and have used them for over 200,000 annotations of phenotypes and expression. Extensive use of synonyms means that, given a suitably sophisticated autocomplete, users can find relevant content by searching with almost any anatomical term they find in the literature. These ontologies are developed in the web ontology language OWL2. Their extensive formalization in OWL can be used to drive sophisticated query systems.
Proper citation: Drosophila anatomy and development ontologies (RRID:SCR_001607) Copy
https://repository.niddk.nih.gov/study/21
Data and biological samples were collected by this consortium organizing international efforts to identify genes that determine an individual risk of type 1 diabetes. It originally focused on recruiting families with at least two siblings (brothers and/or sisters) who have type 1 diabetes (affected sibling pair or ASP families). The T1DGC completed enrollment for these families in August 2009. They completed enrollment of trios (father, mother, and a child with type 1 diabetes), as well as cases (people with type 1 diabetes) and controls (people with no history of type 1 diabetes) from populations with a low prevalence of this disease in January 2010. T1DGC Data and Samples: Phenotypic and genotypic data as well as biological samples (DNA, serum and plasma) for T1DGC participants have been deposited in the NIDDKCentral Repositories for future research.
Proper citation: Type 1 Diabetes Genetics Consortium (RRID:SCR_001557) Copy
Community standard for pathway data sharing. Standard language that aims to enable integration, exchange, visualization and analysis of biological pathway data. Supports data exchange between pathway data groups and thus reduces complexity of interchange between data formats by providing accepted standard format for pathway data. Open and collaborative effort by community of researchers, software developers, and institutions. BioPAX is defined in OWL DL and is represented in RDF/XML format.Uses W3C standard Web Ontology Language, OWL.
Proper citation: Biological Pathways Exchange (RRID:SCR_001681) Copy
Multi-organism, publicly accessible compendium of peptides identified in a large set of tandem mass spectrometry proteomics experiments. Mass spectrometer output files are collected for human, mouse, yeast, and several other organisms, and searched using the latest search engines and protein sequences. All results of sequence and spectral library searching are subsequently processed through the Trans Proteomic Pipeline to derive a probability of correct identification for all results in a uniform manner to insure a high quality database, along with false discovery rates at the whole atlas level. The raw data, search results, and full builds can be downloaded for other uses. All results of sequence searching are processed through PeptideProphet to derive a probability of correct identification for all results in a uniform manner ensuring a high quality database. All peptides are mapped to Ensembl and can be viewed as custom tracks on the Ensembl genome browser. The long term goal of the project is full annotation of eukaryotic genomes through a thorough validation of expressed proteins. The PeptideAtlas provides a method and a framework to accommodate proteome information coming from high-throughput proteomics technologies. The online database administers experimental data in the public domain. You are encouraged to contribute to the database.
Proper citation: PeptideAtlas (RRID:SCR_006783) Copy
Curated protein-protein and genetic interaction repository of raw protein and genetic interactions from major model organism species, with data compiled through comprehensive curation efforts.
Proper citation: Biological General Repository for Interaction Datasets (BioGRID) (RRID:SCR_007393) Copy
Next generation sequencing and genotyping services provided to investigators working to discover genes that contribute to disease. On-site statistical geneticists provide insight into analysis issues as they relate to study design, data production and quality control. In addition, CIDR has a consulting agreement with the University of Washington Genetics Coordinating Center (GCC) to provide statistical and analytical support, most predominantly in the areas of GWAS data cleaning and methods development. Completed studies encompass over 175 phenotypes across 530 projects and 620,000 samples. The impact is evidenced by over 380 peer-reviewed papers published in 100 journals. Three pathways exist to access the CIDR genotyping facility: * NIH CIDR Program: The CIDR contract is funded by 14 NIH Institutes and provides genotyping and statistical genetic services to investigators approved for access through competitive peer review. An application is required for projects supported by the NIH CIDR Program. * The HTS Facility: The High Throughput Sequencing Facility, part of the Johns Hopkins Genetic Resources Core Facility, provides next generation sequencing services to internal JHU investigators and external scientists on a fee-for-service basis. * The JHU SNP Center: The SNP Center, part of the Johns Hopkins Genetic Resources Core Facility, provides genotyping to internal JHU investigators and external scientists on a fee-for-service basis. Data computation service is included to cover the statistical genetics services provided for investigators seeking to identify genes that contribute to human disease. Human Genotyping Services include SNP Genome Wide Association Studies, SNP Linkage Scans, Custom SNP Studies, Cancer Panel, MHC Panels, and Methylation Profiling. Mouse Genotyping Services include SNP Scans and Custom SNP Studies.
Proper citation: Center for Inherited Disease Research (RRID:SCR_007339) Copy
https://www.mc.vanderbilt.edu/victr/dcc/projects/acc/index.php/Main_Page
A national consortium formed to develop, disseminate, and apply approaches to research that combine DNA biorepositories with electronic medical record (EMR) systems for large-scale, high-throughput genetic research. The consortium is composed of seven member sites exploring the ability and feasibility of using EMR systems to investigate gene-disease relationships. Themes of bioinformatics, genomic medicine, privacy and community engagement are of particular relevance to eMERGE. The consortium uses data from the EMR clinical systems that represent actual health care events and focuses on ethical issues such as privacy, confidentiality, and interactions with the broader community.
Proper citation: eMERGE Network: electronic Medical Records and Genomics (RRID:SCR_007428) Copy
http://go.princeton.edu/cgi-bin/GOTermFinder
The Generic GO Term Finder finds the significant GO terms shared among a list of genes from an organism, displaying the results in a table and as a graph (showing the terms and their ancestry). The user may optionally provide background information or a custom gene association file or filter evidence codes. This tool is capable of batch processing multiple queries at once. GO::TermFinder comprises a set of object-oriented Perl modules GO::TermFinder can be used on any system on which Perl can be run, either as a command line application, in single or batch mode, or as a web-based CGI script. This implementation, developed at the Lewis-Sigler Institute at Princeton, depends on the GO-TermFinder software written by Gavin Sherlock and Shuai Weng at Stanford University and the GO:View module written by Shuai Weng. It is made publicly available through the GMOD project. The full source code and documentation for GO:TermFinder are freely available from http://search.cpan.org/dist/GO-TermFinder/. Platform: Online tool, Windows compatible, Mac OS X compatible, Linux compatible, Unix compatible
Proper citation: Generic GO Term Finder (RRID:SCR_008870) Copy
An information extracting and processing package for biological literature that can be used online or installed locally via a downloadable software package, http://www.textpresso.org/downloads.html Textpresso's two major elements are (1) access to full text, so that entire articles can be searched, and (2) introduction of categories of biological concepts and classes that relate two objects (e.g., association, regulation, etc.) or describe one (e.g., methods, etc). A search engine enables the user to search for one or a combination of these categories and/or keywords within an entire literature. The Textpresso project serves the biological and biomedical research community by providing: * Full text literature searches of model organism research and subject-specific articles at individual sites. Major elements of these search engines are (1) access to full text, so that the entire content of articles can be searched, and (2) search capabilities using categories of biological concepts and classes that relate two objects (e.g., association, regulation, etc.) or identify one (e.g., cell, gene, allele, etc). The search engines are flexible, enabling users to query the entire literature using keywords, one or more categories or a combination of keywords and categories. * Text classification and mining of biomedical literature for database curation. They help database curators to identify and extract biological entities and facts from the full text of research articles. Examples of entity identification and extraction include new allele and gene names and human disease gene orthologs; examples of fact identification and extraction include sentence retrieval for curating gene-gene regulation, Gene Ontology (GO) cellular components and GO molecular function annotations. In addition they classify papers according to curation needs. They employ a variety of methods such as hidden Markov models, support vector machines, conditional random fields and pattern matches. Our collaborators include WormBase, FlyBase, SGD, TAIR, dictyBase and the Neuroscience Information Framework. They are looking forward to collaborating with more model organism databases and projects. * Linking biological entities in PDF and online journal articles to online databases. They have established a journal article mark-up pipeline that links select content of Genetics journal articles to model organism databases such as WormBase and SGD. The entity markup pipeline links over nine classes of objects including genes, proteins, alleles, phenotypes, and anatomical terms to the appropriate page at each database. The first article published with online and PDF-embedded hyperlinks to WormBase appeared in the September 2009 issue of Genetics. As of January 2011, we have processed around 70 articles, to be continued indefinitely. Extension of this pipeline to other journals and model organism databases is planned. Textpresso is useful as a search engine for researchers as well as a curation tool. It was developed as a part of WormBase and is used extensively by C. elegans curators. Textpresso has currently been implemented for 24 different literatures, among them Neuroscience, and can readily be extended to other corpora of text.
Proper citation: Textpresso (RRID:SCR_008737) Copy
http://hymenopteragenome.org/beebase/
Gene sequences and genomes of Bombus terrestris, Bombus impatiens, Apis mellifera and three of its pathogens, that are discoverable and analyzed via genome browsers, blast search, and apollo annotation tool. The genomes of two additional species, Apis dorsata and A. florea are currently under analysis and will soon be incorporated.BeeBase is an archive and will not be updated. The most up-to-date bee genome data is now available through the navigation bar on the HGD Home page.
Proper citation: BeeBase (RRID:SCR_008966) Copy
An infrastructure for managing of diverse computational biology resources - data, software tools and web-services. The iTools design, implementation and meta-data content reflect the broad NCBC needs and expertise (www.NCBCs.org).
Proper citation: iTools (RRID:SCR_009626) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.