Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://bio-bigdata.hrbmu.edu.cn/diseasemeth/
Human disease methylation database. DiseaseMeth version 2.0 is focused on aberrant methylomes of human diseases. Used for understanding of DNA methylation driven human diseases.
Proper citation: DiseaseMeth (RRID:SCR_005942) Copy
The DistiLD database aims to increase the usage of existing genome-wide association studies (GWAS) results by making it easy to query and visualize disease-associated SNPs and genes in their chromosomal context. The database performs three important tasks: # published GWAS are collected from several sources and linked to standardized, international disease codes ICD10 codes) # data from the International HapMap Project are analyzed to define linkage disequilibrium (LD) blocks onto which SNPs and genes are mapped # the web interface makes it easy to query and visualize disease-associated SNPs and genes within LD blocks. Users can query the database by diseases, SNPs or genes. No matter which of the three query modes was used, an intermediate page will be shown listing all the studies that matched the search with a link to the corresponding publication. The user can select either all studies related to a certain disease or one specific study for which to view the related LD blocks. The DistiLD resource integrates information on: * Associations between Single Nucleotide Polymorphisms (SNPs) and diseases from genome-wide association studies (GWAS) * Links between SNPs and genes based on linkage disequilibrium (LD) data from HapMap For convenience, we provide the complete datasets as two (zipped) tab-delimited files. The first file contains GWAS results mapped to LD blocks. The second file contains all SNPs and genes assigned to each LD block.
Proper citation: DistiLD - Diseases and Traits in LD (RRID:SCR_005943) Copy
http://newt-omics.mpi-bn.mpg.de/index.php
Newt-omics is a database, which enables researchers to locate, retrieve and store data sets dedicated to the molecular characterization of newts. Newt-omics is a transcript-centered database, based on an Expressed Sequence Tag (EST) data set from the newt, covering ~50,000 Sanger sequenced transcripts and a set of high-density microarray data, generated from regenerating hearts. Newt-omics also contains a large set of peptides identified by mass spectrometry, which was used to validate 13,810 ESTs as true protein coding. Newt-omics is open to implement additional high-throughput data sets without changing the database structure. Via a user-friendly interface Newt-omics allows access to a huge set of molecular data without the need for prior bioinformatical expertise. The newt Notopthalmus viridescens is the master of regeneration. This organism is known for more than 200 years for its exceptional regenerative capabilities. Newts can completely replace lost appendages like limb and tail, lens and retina and parts of the central nervous system. Moreover, after cardiac injury newts can rebuild the functional myocardium with no scar formation. To date only very limited information from public databases is available. Newt-Omics aims to provide a comprehensive platform of expressed genes during tissue regeneration, including extensive annotations, expression data and experimentally verified peptide sequences with yet no homology to other publicly available gene sequences. The goal is to obtain a detailed understanding of the molecular processes underlying tissue regeneration in the newt, that may lead to the development of approaches, efficiently stimulating regenerative pathways in mammalians. * Number of contigs: 26594 * Number of est in contigs: 48537 * Number of transcripts with verified peptide: 5291 * Number of peptides: 15169
Proper citation: Newtomics (RRID:SCR_006073) Copy
http://www.nematodes.org/nembase4/
NEMBASE is a comprehensive Nematode Transcriptome Database including 63 nematode species, over 600,000 ESTs and over 250,000 proteins. Nematode parasites are of major importance in human health and agriculture, and free-living species deliver essential ecosystem services. The genomics revolution has resulted in the production of many datasets of expressed sequence tags (ESTs) from a phylogenetically wide range of nematode species, but these are not easily compared. NEMBASE4 presents a single portal into extensively functionally annotated, EST-derived transcriptomes from over 60 species of nematodes, including plant and animal parasites and free-living taxa. Using the PartiGene suite of tools, we have assembled the publicly available ESTs for each species into a high-quality set of putative transcripts. These transcripts have been translated to produce a protein sequence resource and each is annotated with functional information derived from comparison with well-studied nematode species such as Caenorhabditis elegans and other non-nematode resources. By cross-comparing the sequences within NEMBASE4, we have also generated a protein family assignment for each translation. The data are presented in an openly accessible, interactive database. An example of the utility of NEMBASE4 is that it can examine the uniqueness of the transcriptomes of major clades of parasitic nematodes, identifying lineage-restricted genes that may underpin particular parasitic phenotypes, possible viral pathogens of nematodes, and nematode-unique protein families that may be developed as drug targets.
Proper citation: NEMBASE (RRID:SCR_006070) Copy
http://hfv.lanl.gov/content/index
The Hemorrhagic Fever Viruses (HFV) sequence database collects and stores sequence data and provides a user-friendly search interface and a large number of sequence analysis tools, following the model of the highly regarded and widely used Los Alamos HIV database. The database uses an algorithm that aligns each sequence to a species-wide reference sequence. The NCBI RefSeq database is used for this; if a reference sequence is not available, a Blast search finds the best candidate. Using this method, sequences in each genus can be retrieved pre-aligned. Hemorrhagic fever viruses (HFVs) are a diverse set of over 80 viral species, found in 10 different genera comprising five different families: arena-, bunya-, flavi-, filo- and togaviridae. All these viruses are highly variable and evolve rapidly, making them elusive targets for the immune system and for vaccine and drug design. About 55,000 HFV sequences exist in the public domain today. A central website that provides annotated sequences and analysis tools will be helpful to HFV researchers worldwide.
Proper citation: HFV Database (RRID:SCR_006017) Copy
http://sourceforge.net/projects/bless-ec/
Software tool for Bloom-filter-based error correction for next-generation sequencing (NGS) reads. The algorithm produces accurate correction results with much less memory.
Proper citation: BLESS (RRID:SCR_005963) Copy
http://jjwanglab.org:8080/gwasdb/
Combines collections of genetic variants (GVs) from GWAS and their comprehensive functional annotations, as well as disease classifications. Used to maximize utilility of GWAS data to gain biological insights through integrative, multi-dimensional functional annotation portal. In addition to all GVs annotated in NHGRI GWAS Catalog, we manually curate GVs that are marginally significant (P value < 10-3) by looking into supplementary materials of each original publication and provide extensive functional annotations for these GVs. GVs are manually classified by diseases according to Disease Ontology Lite and HPO (Human Phenotype Ontology) for easy access. Database can also conduct gene based pathway enrichment and PPI network association analysis for those diseases with sufficient variants. SOAP services are available. You may Download GWASdb SNP. (This file contains all of the significant SNP in GWASdb. In the pvalue column, 0 means this P-value is not reported in the study but it is significant SNP. In the source column, GWAS:A represents the original data in GWAS catalog, while GWAS:B is our curation data which P-value < 10-3)
Proper citation: GWASdb (RRID:SCR_006015) Copy
http://equilibrator.weizmann.ac.il/
Web interface designed for thermodynamic analysis of biochemical systems. eQuilibrator enables free-text search for biochemical compounds and reactions and provides thermodynamic estimates for both in a variety of conditions. It can provide estimates for compounds in the KEGG database, and individual compounds and enzymes can be searched for by their common names (water, glucosamine, hexokinase). Reactions can be entered in a free-text format that eQuilibrator parses automatically. eQuilibrator also allows manipulation of the conditions of a reaction - pH, ionic strength, and reactant and product concentrations.
Proper citation: eQuilibrator (RRID:SCR_006011) Copy
http://aias.biol.uoa.gr/OMPdb/
A database of Beta-barrel outer membrane proteins from Gram-negative bacteria. The web interface of OMPdb offers the user the ability not only to view the available data, but also to submit advanced queries for text search within the database''s protein entries or run BLAST searches against the database. The most up-to-date version of the database (as well as all past versions) can be downloaded in various formats (flat text, XML format or raw FASTA sequences). For constructing OMPdb, multiple freely accessible resources were combined and a detailed literature search was performed. The classification of OMPdb''s protein entries into families is based mainly on structural and functional criteria. Information included in the database consists of sequence data, as well as annotation for structural characteristics (such as the transmembrane segments), literature references and links to other public databases, features that are unique worldwide. Along with the database, a collection of profile Hidden Markov Models that were shown to be characteristic for Beta-barrel outer membrane proteins was also compiled. This set, when used in combination with our previously developed algorithms (PRED-TMBB, MCMBB and ConBBPRED) will serve as a powerful tool in matters of discrimination and classification of novel Beta-barrel proteins and whole-genome analyses., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: OMPdb (RRID:SCR_006221) Copy
http://www.bioconductor.org/packages/devel/bioc/html/deepSNV.html
Software package that provides quantitative variant callers for detecting subclonal mutations in ultra-deep (>=100x coverage) sequencing experiments. The algorithm is used for a comparative setup with a control experiment of the same loci and uses a beta-binomial model and a likelihood ratio test to discriminate sequencing errors and subclonal SNVs (single nucleotide variants).
Proper citation: deepSNV (RRID:SCR_006214) Copy
http://evolution.genetics.washington.edu/phylip.html
A free package of software programs for inferring phylogenies (evolutionary trees). The source code is distributed (in C), and executables are also distributed. In particular, already-compiled executables are available for Windows (95/98/NT/2000/me/xp/Vista), Mac OS X, and Linux systems. Older executables are also available for Mac OS 8 or 9 systems.
Proper citation: PHYLIP (RRID:SCR_006244) Copy
http://stormo.wustl.edu/ScerTF
Catalog of over 1,200 position weight matrices (PWMs) for 196 different yeast transcription factors (TFs). They've curated 11 literature sources, benchmarked the published position-specific scoring matrices against in-vivo TF occupancy data and TF deletion experiments, and combined the most accurate models to produce a single collection of the best performing weight matrices for Saccharomyces cerevisiae. ScerTF is useful for a wide range of problems, such as linking regulatory sites with transcription factors, identifying a transcription factor based on a user-input matrix, finding the genes bound/regulated by a particular TF, and finding regulatory interactions between transcription factors. Enter a TF name to find the recommended matrix for a particular TF, or enter a nucleotide sequence to identify all TFs that could bind a particular region.
Proper citation: ScerTF (RRID:SCR_006121) Copy
http://www-bionet.sscc.ru/sitex/
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on August 19,2019. Analyzing protein structure projection on exon-intron structure of corresponding gene through years led to several fundamental conclusions about structural and functional organization of the protein. According to these results we decided to map the protein functional sites. So we created the database SitEx that keep the information about this mapping and included the BLAST search and 3D similar structure search using PDB3DScan for the polypeptide encoded by one exon, participating in organizing the functional site. This will help: # to study the positions of the functional sites in exon structure; # to make the complex analysis of the protein function; # to exposure the exons that took part in exon shuffling and came from bacterial genomes; # to study the peculiarities of coding the polypeptide structures. Currently, SitEx contains information about 9994 functional sites presented in 2021 proteins described in proteomes of 17 organisms.
Proper citation: SitEx (RRID:SCR_006122) Copy
https://compbio.dfci.harvard.edu/predictivenetworks//
A flexible, open-source, web-based application and data services framework that enables the integration, navigation, visualization and analysis of gene interaction networks. The primary goal of PN is to allow biomedical researchers to evaluate experimentally derived gene lists in the context of large-scale gene interaction networks. The PN analytical pipeline involves two key steps. The first is the collection of a comprehensive set of known gene interactions derived from a variety of publicly available sources. The second is to use these ''known'' interactions together with gene expression data to infer robust gene networks. The regression-based network inference algorithm creates a graph of gene interactions in which cycles may be present (but no self-loops). Based on information-theoretic techniques, a causal gene interaction network is inferred from both prior knowledge (interactions extracted from biomedical literature and structured biological databases) and gene expression data. A prediction model is fitted for each gene, given its parents, enabling assessment of the predictive ability of the network model.
Proper citation: Predictive Networks (RRID:SCR_006110) Copy
http://202.38.126.151:8080/SDisease/
Curated database of experimentally supported data of RNA Splicing mutation and disease. The RNA Splicing mutations include cis-acting mutations that disrupt splicing and trans-acting mutations that affecting RNA-dependent functions that cause disease. Information such as EntrezGeneID, gene genomic sequence, mutation (nucleotide substitutions, deletions and insertions), mutation location within the gene, organism, detailed description of the splicing mutation and references are also given. Users are able to submit new entries to the database. This database integrating RNA splicing and disease associations would be helpful for understanding not only the RNA splicing but also its contribution to disease. In SpliceDisease database, they manually curated 2337 splicing mutation disease entries involving 303 genes and 370 diseases, which have been supported experimentally in 898 publications. The SpliceDisease database provides information including the change of the nucleotide in the sequence, the location of the mutation on the gene, the reference PubMed ID and detailed description for the relationship among gene mutations, splicing defects and diseases. They standardized the names of the diseases and genes and provided links for these genes to NCBI and UCSC genome browser for further annotation and genomic sequences. For the location of the mutation, they give direct links of the entry to the respective position/region in the genome browser.
Proper citation: SpliceDisease (RRID:SCR_006130) Copy
Collection of chemical structures. Provides access to structures, properties and associated information from hundreds of data sources to find compounds of interest and provides services to improve this data by curation and annotation and to integrate it with users applications.
Proper citation: ChemSpider (RRID:SCR_006360) Copy
http://cran.r-project.org/web/packages/QCGWAS/
Software tools for (automated and manual) quality control of the results of Genome Wide Association Studies.
Proper citation: QCGWAS (RRID:SCR_006408) Copy
http://floresta.eead.csic.es/primers4clades
Web application for the design of PCR primers for cross-species amplification of novel sequences from metagenomic DNA or from uncharacterized organisms belonging to user-specified phylogenetic lineages. It implements an extended CODEHOP strategy and evaluates thermodynamic properties of the oligonucleotide pairs.
Proper citation: primers4clades (RRID:SCR_015714) Copy
http://amp.pharm.mssm.edu/clustergrammer/
Clustergrammer is a web-based tool for visualizing and analyzing high-dimensional data as interactive and shareable hierarchically clustered heatmaps. Clustergrammer enables intuitive exploration of high-dimensional data and has several optional biology-specific features.
Proper citation: clustergrammer (RRID:SCR_015681) Copy
https://bioconductor.org/packages/release/bioc/html/oligo.html
Software package to analyze oligonucleotide arrays (expression/SNP/tiling/exon) at probe-level. It currently supports Affymetrix (CEL files) and NimbleGen arrays (XYS files).
Proper citation: oligo (RRID:SCR_015729) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.