Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://funcoup.sbc.su.se/search/
Database of genome wide functional coupling networks. Provides tools to explore predicted networks and to retrieve detailed information about data underlying each prediction. Web service for functional coupling search.
Proper citation: FunCoup (RRID:SCR_018711) Copy
http://smithlabresearch.org/software/preseq/
Software package for predicting library complexity and genome coverage in high throughput sequencing. Aimed at predicting yield of distinct reads from genomic library from initial sequencing experiment. Predicting molecular complexity of sequencing libraries.
Proper citation: Preseq (RRID:SCR_018664) Copy
http://enterobase.warwick.ac.uk/
Integrated software environment that supports identification of global population structures within several bacterial genera that include pathogens. Web service for analyzing and visualizing genomic variation within bacteria. Genome database to enable to identify, analyse, quantify and visualise genomic variation within bacterial genera including Salmonella, Escherichia/Shigella, Clostridioides,Vibrio,Yersinia,Helicobacter,Moraxella.
Proper citation: EnteroBase (RRID:SCR_019019) Copy
https://www.otago.ac.nz/chatterjee-lab/tools/index.html
Software package for large scale genomic DNA methylation analysis. Filters and processes aligned bisulphite sequenced data to generate comprehensive reference methylomes in different units for any genome. Processes aligned SAM files of multiple samples to provide reliable and statistically significant differentially methylated regions, then relate them to proximal genes and CpG features with reasonable rapidity.
Proper citation: Differential Methylation Analysis Package (RRID:SCR_019148) Copy
http://ww2.sanbi.ac.za/Dbases.html
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 23, 2016. The STACKdb is knowledgebase generated by processing EST and mRNA sequences obtained from GenBank through a pipeline consisting of masking, clustering, alignment and variation analysis steps. The STACK project aims to generate a comprehensive representation of the sequence of each of the expressed genes in the human genome by extensive processing of gene fragments to make accurate alignments, highlight diversity and provide a carefully joined set of consensus sequences for each gene. The STACK project is comprised of the STACKdb human gene index, a database of virtual human transcripts, as well as stackPACK, the tools used to create the database. STACKdb is organized into 15 tissue-based categories and one disease category. STACK is a tool for detection and visualization of expressed transcript variation in the context of developmental and pathological states. The data system organizes and reconstructs human transcripts from available public data in the context of expression state. The expression state of a transcript can include developmental state, pathological association, site of expression and isoform of expressed transcript. STACK consensus transcripts are reconstructed from clusters that capture and reflect the growing evidence of transcript diversity. The comprehensive capture of transcript variants is achieved by the use of a novel clustering approach that is tolerant of sub-sequence diversity and does not rely on pairwise alignment. This is in contrast with other gene indexing projects. STACK is generated at least four times a year and represents the exhaustive processing of all publicly available human EST data extracted from GenBank. This processed information can be explored through 15 tissue-specific categories, a disease-related category and a whole-body index
Proper citation: Sequence Tag Alignment and Consensus Knowledgebase Database (RRID:SCR_002156) Copy
https://github.com/BackofenLab/HVSeeker/tree/main
Software tool for distinguishing between bacterial and phage sequences. Consists of two separate models: one analyzing DNA sequences and the other focusing on proteins.
Proper citation: HVSeeker (RRID:SCR_026120) Copy
http://noble.gs.washington.edu/proj/genomedata/
A format for efficient storage of multiple tracks of numeric data anchored to a genome. The format allows fast random access to hundreds of gigabytes of data, while retaining a small disk space footprint. They have also developed utilities to load data into this format. Retrieving data from this format is more than 2900 times faster than a naive approach using wiggle files. A reference implementation in Python and C components is available here under the GNU General Public License. The software has only been tested on Linux and Mac systems.
Proper citation: Genomedata (RRID:SCR_004544) Copy
Public database that stores areas of genome that differ between individual genomes (variants) and, where available, associated disease and phenotype information. Different types of variants for several species: single nucleotide polymorphisms (SNPs), short nucleotide insertions and/or deletions, and longer variants classified as structural variants (including CNVs). Effects of variants on the Ensembl transcripts and regulatory features for each species are predicted. You can run same analysis on your own data using Variant Effect Predictor. These data are integrated with other data sources in Ensembl, and can be accessed using the API or website. For several different species in Ensembl, they import variation data (SNPs, CNVs, allele frequencies, genotypes, etc) from a variety of sources (e.g. dbSNP). Imported variants and alleles are subjected to quality control process to flag suspect data. In human, they calculate linkage disequilibrium for each variant, by population.
Proper citation: Ensembl Variation (RRID:SCR_001630) Copy
Web application to search nucleotide databases using a nucleotide query. Algorithms: blastn, megablast, discontiguous megablast.
Proper citation: BLASTN (RRID:SCR_001598) Copy
Web resource that provides data and tools for exploring genomic organization of highly conserved noncoding elements (HCNEs) for multiple genomes. It includes a genome browser that shows HCNE locations and features novel HCNE density plots as a powerful tool to discover developmental regulatory genes and distinguish their regulatory elements and domains. They identify HCNEs as non-exonic regions of high similarity between genome sequences from distantly related organisms, such as human and fish, and provide tools for studying the distribution of HCNEs along chromosomes. Major peaks of HCNE density along chromosomes most often coincide with developmental regulatory genes. Their aim with this site is to aid discovery of developmental regulatory genes, their regulatory domains and their fundamental regulatory elements.
Proper citation: Ancora (RRID:SCR_001623) Copy
Website for analyzing microarray data. Software toolbox for storing, analyzing and integrating microarray data and related genotype and phenotype data. The site is particularly suited for combining QTL and microarray data to search for candidate genes contributing to complex traits. In addition, the site allows, if desired by the investigators, sharing of the data. Investigators can conduct in-silico microarray experiments using their own and/or shared data. There are five major sections of the site: Genome/Transcriptome Data Browser, Microarray Analysis Tools, Gene List Analysis Tools, QTL Tools, and Downloads. The genome/transcriptome data browser combines a genome browser with all the microarray, RNA-Seq, and Genomic Sequencing data. This provides an effective platform to view all of this data side by side. Source code is available on GitHub.
Proper citation: PhenoGen Informatics (RRID:SCR_001613) Copy
http://www.norcomm.org/index.htm
Large-scale research initiative focused on developing and distributing a library of mouse embryonic stem (ES) cell lines carrying single gene trapped or targeted mutations across the mouse genome. NorCOMM's large and growing archive of ES cells is publicly available on a cost-recovery basis from the Canadian Mouse Mutant Repository. As an international public resource, access to clones is unrestricted and nonexclusive. Through NorCOMM's affiliation with the Canadian Mouse Consortium (CMC), NorCOMM also provides clients with a single point of access to regional mouse derivation, phenotyping, genetic and archiving services across Canada. These value-added services can help your company harness NorCOMM's resources for drug discovery, target discovery and preclinical validation.
Proper citation: North American Conditional Mouse Mutagenesis Project (RRID:SCR_001614) Copy
https://www.hgsc.bcm.edu/content/sea-urchin-genome-project
Provides informationa about Genome of California Purple Sea Urchin, one species (Strongylocentrotus purpuratus) of which has been sequenced and annotated by Sea Urchin Genome Sequencing Consortium led by HGSC. Reports sequence and analysis of genome of sea urchin Strongylocentrotus purpuratus, a model for developmental and systems biology.
Proper citation: Sea Urchin Genome Project (RRID:SCR_001735) Copy
http://www.animalgenome.org/cgi-bin/QTLdb/index
Database of trait mapping data, i.e. QTL (phenotype / expression, eQTL), candidate gene and association data (GWAS) and copy number variations (CNV) mapped to livestock animal genomes, to facilitate locating and comparing discoveries within and between species. New data and database tools are continually developed to align various trait mapping data to map-based genome features, such as annotated genes. QTLdb is open to house QTL/association date from other animal species where feasible. Most scientific journals require that any original QTL/association data be deposited into public databases before paper may be accepted for publication. User curator accounts are provided for direct data deposit. Users can download QTLdb data from each species or individual chromosome.
Proper citation: Animal QTLdb (RRID:SCR_001748) Copy
http://www.sanbi.ac.za/resources/
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 23, 2022. The South African National Bioinformatics Institute delivers biomedical discovery appropriate to both international and African context. Researchers at SANBI perform the highest level of research and provide excellence in education. Research at SANBI has set well recognized milestones in the field of computational biology. The tools and techniques used have not only been developed but also implemented across heterogeneous domains of advanced research. Local and international efforts have driven our discoveries. Until recently, the core of SANBIs research has focused upon gene expression biology. Methods developed and applied at SANBI revolve around a greater understanding of the underlying causes of diseases. SANBI approaches the problem by comparison of genes, genomes and transcriptomes. It uses computational gene expression biology to create novel biological insights and to provide biomarkers for experimental validation. It also performs analysis of human genome variation, transcriptional diversity on both the expression and splicing level and the unravelling of transcriptional regulatory networks. Resources - Hinv, STACKdb, Malaria resources and Trypanosome databases are available for on-line seaching. - SANBI offers WCD, STACKdb, stackPACK and eVOC and the eVOKE viewer as tools that can be downloaded. Sponsors: SANBI receives funding and support from a range of organisations in South Africa and Internationally. Organisations currently supporting SANBI include: South Africa * South African Medical Research Council * South African AIDS Vaccine Initiative * National Bioinformatics Network * National Research Foundation * Claude Leon Foundation * International Business Machines Inc. Europe * European Unions 6th Framework Programme * World Health Organization USA * US National Institutes of Health * Fogarty International Centre * Ludwig Institute for Cancer Research
Proper citation: South African National Bioinformatics Institute: Resources (RRID:SCR_001867) Copy
The UCLA-DOE Institute for Genomics and Proteomics carries out research in bioenergy, structural biology, genomics and proteomics, consistent with the research mission of the United States Department of Energy. Major interests of the 12 Principal Investigators and 9 Associate Members include systems approaches to organisms, structural biology, bioinformatics, and bioenergetic systems. The Institute sponsors 5 Core Technology Centers, for X-ray and NMR structural determination, bioinformatics and computation, protein expression and purification, and biochemical instrumentation. Services offered by this Institute: - Databases: * DIP (The Database of Interacting Proteins): The DIPTM database catalogs experimentally determined interactions between proteins. It combines information from a variety of sources to create a single, consistent set of protein-protein interactions. * ProLinks Database of Functional Linkages: The Prolinks database is a collection of inference methods used to predict functional linkages between proteins. These methods include the Phylogenetic Profile method which uses the presence and absence of proteins across multiple genomes to detect functional linkages; the Gene Cluster method, which uses genome proximity to predict functional linkage; Rosetta Stone, which uses a gene fusion event in a second organism to infer functional relatedness; and the Gene Neighbor method, which uses both gene proximity and phylogenetic distribution to infer linkage. - Data-to-Structure Servers: * SAVEs Structure Verification Server * Merohedral Twinning Test Server * SER Surface Entropy Reduction Server * VERIFY3D Structure Verification Server * ERRAT Structure Verification Server - Structure-to-Function Servers: * ProKnow Protein Functionator * Hot Patch Functional Site Locator
Proper citation: University of California at Los Angeles - Department of Energy Institute for Genomics and Proteomics (RRID:SCR_001921) Copy
http://www-genome.stanford.edu/
This resource hyperlinks to systematic analysis projects, resources, laboratories, and departments at Stanford University.
Proper citation: Stanford Genomic Resourses (RRID:SCR_001874) Copy
Suite of motif-based sequence analysis tools to discover motifs using MEME, DREME (DNA only) or GLAM2 on groups of related DNA or protein sequences; search sequence databases with motifs using MAST, FIMO, MCAST or GLAM2SCAN; compare a motif to all motifs in a database of motifs; associate motifs with Gene Ontology terms via their putative target genes, and analyze motif enrichment using SpaMo or CentriMo. Source code, binaries and a web server are freely available for noncommercial use.
Proper citation: MEME Suite - Motif-based sequence analysis tools (RRID:SCR_001783) Copy
http://sammeth.net/confluence/display/ASTA/2+-+Download
Tool that extracts and displays alternative splicing (AS) events from a given genomic annotation of exon-intron gene coordinates. By comparing all given transcripts, it detects the variations in their splicing structure and identifies all AS events (like exon skipping, alternate donor, etc) by assigning to each of them an AS code. It provides a visual summary of the AS landscape in the analyzed dataset, the possibility to browse the results on the UCSC website or to download them in GTF or ASTA format. You can use AStalavista for any genome by providing your own annotation set, the identifier of your gene(s) of interest, or analyze the AS landscape of reference annotation datasets like Gencode, RefSeq, Ensembl, FlyBase, etc.
Proper citation: AStalavista (RRID:SCR_001815) Copy
http://compbio.dfci.harvard.edu/tgi/
THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone.. Documented on August 19,2019.The goal of The Gene Index Project is to use the available Expressed Sequence Transcript (EST) and gene sequences, along with the reference genomes wherever available, to provide an inventory of likely genes and their variants and to annotate these with information regarding the functional roles played by these genes and their products. The promise of genome projects has been a complete catalog of genes in a wide range of organisms. While genome projects have been successful in providing reference genome sequences, the problem of finding genes and their variants in genomic sequence remains an ongoing challenge. TGI has created an inventory that contains genes and their variants together with description. In addition, this resource is attempting to use these catalogs to find links between genes and pathways in different species and to provide lists of features within completed genomes that can aid in the understanding of how gene expression is regulated. DATABASES *Eukaryotic Gene Orthologues (formerly known as TOGA - TIGR Orthologous Gene Alignment): Eukaryotic Gene Orthologues (EGO) at DFGI are generated by pair-wise comparison between the Tentative Consensus (TC) sequences that comprise the Dana Farber Gene Indices from individual organisms. The reciprocal pairs of the best match were clustered into individual groups and multiple sequence alignments were displayed for each group. *GeneChip Oncology Database (GCOD):Cancer gene expression database is a collection of publicly available microarray expression data on Affymetrix GeneChip Arrays related to human cancers. Currently only datasets with available raw data (Affymetrix .CEL files) are processed. All processed datasets were subjected to extensive manual curation, uniform processing and consistent quality control. You can browse the experiments in our collection, perform statistical analysis, and download processed data; or to search gene expression profiles using Entrez gene symbol, Unigene ID, or Affymetrix probeset ID. *Gene Indices: As of July 1, 2008, there are 111 publicly available gene indices. They are separated into 4 categories for better organization and easier access. Animal: 41, Plant: 45, Protist: 15, Fungal: 10 *Genomic Maps: Human, mouse, rat, chicken, drosophila melanogaster, zebrafish, mosquito, caenorhabditis elegans, Arabidopsis thaliana, rice, yeast, fission yeast Dana-Farber Cancer Institute (DFCI) Gene Indices Software Tools: *TGI Clustering tools (TGICL): a software system for fast clustering of large EST datasets. *GICL: this package contains the scripts and all the necessary pre-compiled binaries for 32bit Linux systems. *clview: an assembly file viewer. *SeqClean:a script for automated trimming and validation of ESTs or other DNA sequences by screening for various contaminants, low quality and low-complexity sequences. *cdbfasta/cdbyank: fast indexing/retrieval of fasta records from flat file databases. *DAS/XML Genomic Viewer The Genomic viewer borrows modules from http://www.biodas.org (lstein (at) cshl.org) & http://webreference.com.
Proper citation: Gene Index Project (RRID:SCR_002148) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.