Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
A comprehensive analysis and visualization software package for gene expression experiments that provides: a number of clustering and analysis techniques; integrated gene expression and analysis result visualizations, integration with the Gene Expression Omnibus; and an optional data sharing architecture. GO is used to assign functional enrichment scores to clusters, using a combination of specially developed techniques and general statistical methods. These results can be explored using the in built ontology browsing tool or through the generated web pages. SeqExpress also supports numerous data transformation, projection, visualization, file export/import, searching, integration (with R), and clustering options.
Proper citation: SeqExpress (RRID:SCR_007075) Copy
http://www.ncbi.nlm.nih.gov/guide/sitemap/
The National Center for Biotechnology Information''s listing of resources. Sort by alphabetical character, Databases, Downloads, Submissions, Tools and How-To; or by Topic: Chemicals & Bioassays; Data & Software; DNA & RNA; Domains & Structures; Genes & Expression; Genetics & Medicine; Genomes & Maps; Homology; Literature; Proteins; Sequence Analysis; Taxonomy; Training & Tutorials; Variation.
Proper citation: NCBI Resource List (RRID:SCR_005628) Copy
http://www.sgn.cornell.edu/bulk/input.pl?modeunigene
Allows users to download Unigene or BAC information using a list of identifiers or complete datasets with FTP., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: Sol Genomics Network - Bulk download (RRID:SCR_007161) Copy
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 11, 2023. Archiving services, insertional site analysis, pharmacology and toxicology resources, and reagent repository for academic investigators and others conducting gene therapy research. Databases and educational resources are open to everyone. Other services are limited to gene therapy investigators working in academic or other non-profit organizations. Stores reserve or back-up clinical grade vector and master cell banks. Maintains samples from any gene therapy related Pharmacology or Toxicology study that has been submitted to FDA by U.S. academic investigator that require storage under Good Laboratory Practices. For certain gene therapy clinical trials, FDA has required post-trial monitoring of patients, evaluating clinical samples for evidence of clonal expansion of cells. To help academic investigators comply with this FDA recommendation, the NGVB offers assistance with clonal analysis using LAM-PCR and LM-PCR technology.
Proper citation: National Gene Vector Biorepository (RRID:SCR_004760) Copy
http://linux1.softberry.com/spldb/SpliceDB.html
Database of canonical and non-canonical mammalian splice sites. The information about verified splice site sequences for canonical and non-canonical sites is presented with the supporting evidence. Weight matrices were built for the major splice groups, which can be incorporated into gene prediction programs.
Proper citation: SpliceDB (RRID:SCR_006262) Copy
http://aws.amazon.com/1000genomes/
A dataset containing the full genomic sequence of 1,700 individuals, freely available for research use. The 1000 Genomes Project is an international research effort coordinated by a consortium of 75 companies and organizations to establish the most detailed catalogue of human genetic variation. The project has grown to 200 terabytes of genomic data including DNA sequenced from more than 1,700 individuals that researchers can now access on AWS for use in disease research free of charge. The dataset containing the full genomic sequence of 1,700 individuals is now available to all via Amazon S3. The data can be found at: http://s3.amazonaws.com/1000genomes The 1000 Genomes Project aims to include the genomes of more than 2,662 individuals from 26 populations around the world, and the NIH will continue to add the remaining genome samples to the data collection this year. Public Data Sets on AWS provide a centralized repository of public data hosted on Amazon Simple Storage Service (Amazon S3). The data can be seamlessly accessed from AWS services such Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Elastic MapReduce (Amazon EMR), which provide organizations with the highly scalable compute resources needed to take advantage of these large data collections. AWS is storing the public data sets at no charge to the community. Researchers pay only for the additional AWS resources they need for further processing or analysis of the data. All 200 TB of the latest 1000 Genomes Project data is available in a publicly available Amazon S3 bucket. You can access the data via simple HTTP requests, or take advantage of the AWS SDKs in languages such as Ruby, Java, Python, .NET and PHP. Researchers can use the Amazon EC2 utility computing service to dive into this data without the usual capital investment required to work with data at this scale. AWS also provides a number of orchestration and automation services to help teams make their research available to others to remix and reuse. Making the data available via a bucket in Amazon S3 also means that customers can crunch the information using Hadoop via Amazon Elastic MapReduce, and take advantage of the growing collection of tools for running bioinformatics job flows, such as CloudBurst and Crossbow.
Proper citation: 1000 Genomes Project and AWS (RRID:SCR_008801) Copy
http://montana.eagle-i.net/i/0000012b-00be-4e65-df3b-3fdc80000000
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on October 27, 2023. Core for Microarray analysis, Database development, Systems biology analysis, Genome assembly, Pathway data analysis, Expression data analysis, Metagenomics analysis. To maintain equipment and software for bioinformatic research, promote bioinformatics education on the MSU campus, and provide training and support to biologists implementing bioinformatics tools in their research.
Proper citation: Montana State University Bioinformatics Core Facility (RRID:SCR_009937) Copy
http://www.khri.med.umich.edu/research/lesperance_lab/low_freq.php
This web site lists the disease causing mutations and polymorphisms found in the Wolfram syndrome (WFS1) gene. Sponsors: This resource is supported by the University of Michigan at Ann Arbor.
Proper citation: Human Genetics Laboratory: WFS1 Gene Mutation and Polymorphism Database (RRID:SCR_001113) Copy
A tool for performing multi-cluster gene functional enrichment analyses on large scale data (microarray experiments with many time-points, cell-types, tissue-types, etc.). It facilitates co-analysis of multiple gene lists and yields as output a rich functional map showing the shared and list-specific functional features. The output can be visualized in tabular, heatmap or network formats using built-in options as well as third-party software. It uses the hypergeometric test to obtain functional enrichment achieved via the gene list enrichment analysis option available in ToppGene.
Proper citation: ToppCluster (RRID:SCR_001503) Copy
http://ww2.sanbi.ac.za/Dbases.html
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 23, 2016. The STACKdb is knowledgebase generated by processing EST and mRNA sequences obtained from GenBank through a pipeline consisting of masking, clustering, alignment and variation analysis steps. The STACK project aims to generate a comprehensive representation of the sequence of each of the expressed genes in the human genome by extensive processing of gene fragments to make accurate alignments, highlight diversity and provide a carefully joined set of consensus sequences for each gene. The STACK project is comprised of the STACKdb human gene index, a database of virtual human transcripts, as well as stackPACK, the tools used to create the database. STACKdb is organized into 15 tissue-based categories and one disease category. STACK is a tool for detection and visualization of expressed transcript variation in the context of developmental and pathological states. The data system organizes and reconstructs human transcripts from available public data in the context of expression state. The expression state of a transcript can include developmental state, pathological association, site of expression and isoform of expressed transcript. STACK consensus transcripts are reconstructed from clusters that capture and reflect the growing evidence of transcript diversity. The comprehensive capture of transcript variants is achieved by the use of a novel clustering approach that is tolerant of sub-sequence diversity and does not rely on pairwise alignment. This is in contrast with other gene indexing projects. STACK is generated at least four times a year and represents the exhaustive processing of all publicly available human EST data extracted from GenBank. This processed information can be explored through 15 tissue-specific categories, a disease-related category and a whole-body index
Proper citation: Sequence Tag Alignment and Consensus Knowledgebase Database (RRID:SCR_002156) Copy
http://www.projects.roslin.ac.uk/sheepmap/front.html
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 23, 2016. The project aims to apply genome mapping research to sheep, utilizing previous research in sheep (in other countries) and in other species (in the UK and abroad) to the benefit of the UK sheep industry. The project itself uses existing breeding structures, knowledge of the sheep genome and experimental resources. It has three main aims: i) To use the Suffolk, Texel and Charollais Sire Referencing Schemes to detect and verify quantitative trait loci (QTLs) for growth and carcass composition traits ii) To investigate candidate genes and/or chromosomal regions for associations with production traits. iii) To investigate approaches for optimizing future genotyping strategies within the sire referencing schemes for practical and cost effective application of marker-assisted selection By using commercial breeding populations for the research, immediate application of beneficial results is possible. Potential benefits include increased genetic progress through marker assisted selection which utilizes the genotype information, correction of possible parentage errors (ultimately leading to additional genetic progress) and opportunities for using marker information for product certification. The project will benefit the UK sheep industry by the use of Marker Assisted Selection (MAS) utilizing QTL or gene variants identified in the project. Additional benefits may arise from parentage verification and correction of errors e.g. misallocation of lamb to ewe. In the longer term, opportunities may exist to use markers for quality control, tracing products to their source. The major advantage of the design of this project is that the results are immediately applicable to the breeding schemes within which the QTLs and/or genes are detected. The time lag in the application of the results that is often seen with experimental populations is minimized. The project requires close involvement with the Sire Reference Schemes, in return for their assistance the results have immediate benefit to animals within these groups.
Proper citation: UK Sheep Genome Mapping Project (RRID:SCR_002272) Copy
http://www.nactem.ac.uk/facta/
Text mining tool to discover associations between biomedical concepts from MEDLINE articles. Use the service from your browser or via a Web Service. The whole MEDLINE corpus containing more than 20 million articles is indexed with an efficient text search engine, and it allows you to navigate such associations and their textual evidence in a highly interactive manner - the system accepts arbitrary query terms and displays relevant concepts immediately. A broad range of important biomedical concepts are covered by the combination of a machine learning-based term recognizer and large-scale dictionaries for genes, proteins, diseases, and chemical compounds. There is also a FACTA+ visualization service that can be found here: http://www.nactem.ac.uk/facta-visualizer/
Proper citation: FACTA+. (RRID:SCR_001767) Copy
Professionally curated repository for genetics, genomics and related data resources for soybean that contains the most current genetic, physical and genomic sequence maps integrated with qualitative and quantitative traits. SoyBase includes annotated Williams 82 genomic sequence and associated data mining tools. The genetic and sequence views of the soybean chromosomes and the extensive data on traits and phenotypes are extensively interlinked. This allows entry to the database using almost any kind of available information, such as genetic map symbols, soybean gene names or phenotypic traits. The repository maintains controlled vocabularies for soybean growth, development, and traits that are linked to more general plant ontologies. Contributions to SoyBase or the Breeder''s Toolbox are welcome.
Proper citation: SoyBase (RRID:SCR_005096) Copy
A database for phenotyping human single nucleotide polymorphisms (SNPs)that primarily focuses on the molecular characterization and annotation of disease and polymorphism variants in the human proteome. They provide a detailed variant analysis using their tools such as: * TANGO to predict aggregation prone regions * WALTZ to predict amylogenic regions * LIMBO to predict hsp70 chaperone binding sites * FoldX to analyse the effect on structure stability Further, SNPeffect holds per-variant annotations on functional sites, structural features and post-translational modification. The meta-analysis tool enables scientists to carry out a large scale mining of SNPeffect data and visualize the results in a graph. It is now possible to submit custom single protein variants for a detailed phenotypic analysis., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: SNPeffect (RRID:SCR_005091) Copy
http://amp.pharm.mssm.edu/lib/chea.jsp
Data analysis service for gene-list enrichment analysis against a manual database. It allows users to input lists of mammalian gene symbols for which the program computes over-representation of transcription factor targets from the ChIP-X database. The database integrates interaction data from ChIP-chip, ChIP-seq, ChIP-PET and DamID studies and contains 189,933 interactions, manually extracted from 87 publications, describing the binding of 92 transcription factors to 31,932 target genes.
Proper citation: ChEA (RRID:SCR_005403) Copy
Freely accessible phenotype-centered database with integrated analysis and visualization tools. It combines diverse data sets from multiple species and experiment types, and allows data sharing across collaborative groups or to public users. It was conceived of as a tool for the integration of biological functions based on the molecular processes that subserved them. From these data, an empirically derived ontology may one day be inferred. Users have found the system valuable for a wide range of applications in the arena of functional genomic data integration.
Proper citation: Gene Weaver (RRID:SCR_003009) Copy
THIS RESOURCE IS NO LONGER IN SERVICE, documented on July 15, 2013. Database covering a range of plant pathogenic oomycetes, fungi and bacteria primarily those under study at Virginia Bioinformatics Institute. The data comes from different sources and has genomes of 3 oomycetes pathogens: Phytophthora sojae, Phytophthora ramorum and Hyaloperonospora arabidopsidis. The genome sequences (95 MB for P.sojae and 65 MB for P.ramorum) were annotated with approximately 19,000 and approximately 16,000 gene models, respectively. Two different statistical methods were used to validate these gene models, Fickett''''s and a log-likelihood method. Functional annotation of the gene models is based on results from BlastX and InterProScan screens. From the InterProScan results, putative functions to 17,694 genes in P.sojae and 14,700 genes in P.ramorum could be assigned. An easy-to-use genome browser was created to view the genome sequence data, which opens to detailed annotation pages for each gene model. A community annotation interface is available for registered community members to add or edit annotations. There are approximately 1600 gene models for P.sojae and approximately 700 models for P.ramorum that have already been manually curated. A toolkit is provided as an additional resource for users to perform a variety of sequence analysis jobs.
Proper citation: VMD (RRID:SCR_004905) Copy
A web-based tool, knowledgebase and community for analysis and interpretation of human variant files. VCFs (Variant Call Formats) are preprocessed and annotated, you can filter them, access all databases and provide your expertise to the community by creating annotations.
Proper citation: GeneTalk (RRID:SCR_005231) Copy
http://crdd.osdd.net/servers/virsirnadb/
VIRsiRNAdb is a curated database of experimentally validated viral siRNA / shRNA targeting diverse genes of 42 important human viruses including influenza, SARS and Hepatitis viruses. Submissions are welcome. Currently, the database provides detailed experimental information of 1358 siRNA/shRNA which includes siRNA sequence, virus subtype, target gene, GenBank accession, design algorithm, cell type, test object, test method and efficacy (mostly quantitative efficacies). Further, wherever available, information regarding alternative efficacies of above 300 siRNAs derived from different assays has also been incorporated. The database has facilities like search, advance search (using Boolean operators AND, OR) browsing (with data sorting option), internal linking and external linking to other databases (Pubmed, Genbank, ICTV). Additionally useful siRNA analysis tools are also provided e.g. siTarAlign for aligning the siRNA sequence with reference viral genomes or user defined sequences. virsiRNAdb would prove useful for RNAi researchers especially in siRNA based antiviral therapeutics development.
Proper citation: VIRsiRNAdb (RRID:SCR_006108) Copy
http://go.princeton.edu/cgi-bin/GOTermMapper
The Generic GO Term Mapper finds the GO terms shared among a list of genes from your organism of choice within a slim ontology, allowing them to be binned into broader categories. The user may optionally provide a custom gene association file or slim ontology, or a custom list of slim terms. The implementation of this Generic GO Term Mapper uses map2slim.pl script written by Chris Mungall at Berkeley Drosophila Genome Project, and some of the modules included in the GO-TermFinder distribution written by Gavin Sherlock and Shuai Weng at Stanford University, made publicly available through the GMOD project. GO Term Mapper serves a different function than the GO Term Finder. GO Term Mapper simply bins the submitted gene list to a static set of ancestor GO terms. In contrast, GO Term Finder finds the GO terms significantly enriched in a submitted list of genes. Platform: Online tool, Windows compatible, Mac OS X compatible, Linux compatible, Unix compatible
Proper citation: Generic GO Term Mapper (RRID:SCR_005806) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.