Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://sourceforge.net/projects/dmetanalyzer/
Software tool for the automatic association analysis among the variation of the patient genomes and the clinical conditions of patients, i.e. the different response to drugs. The system allows: (i) to automatize the workflow of analysis of DMET (drug metabolism enzymes and transporters)-SNP (Single Nucleotide Polymorphism) data avoiding the use of multiple tools; (ii) the automatic annotation of DMET-SNP data and the search in existing databases of SNPs (e.g. dbSNP), (iii) the association of SNP with pathway through the search in PharmaKGB, a major knowledge base for pharmacogenomic studies. It has a simple graphical user interface that allows users (doctors/biologists) to upload and analyze DMET files produced by Affymetrix DMET-Console in an interactive way.
Proper citation: DMET-Analyzer (RRID:SCR_002030) Copy
https://gemini.readthedocs.io/en/latest/
Framework for exploring genetic variation in the context of the genome annotations available for the human genome. Users can load a VCF file into a database and each variant is automatically annotated by comparing it to several genome annotations from source such as ENCODE tracks, UCSC tracks, OMIM, dbSNP, KEGG, and HPRD.
Proper citation: GEMINI (RRID:SCR_014819) Copy
https://bitbucket.org/nsegata/phylophlan/wiki/Home
Software pipeline for reconstructing highly accurate and resolved phylogenetic trees based on whole-genome sequence information. Pipeline is scalable to thousands of genomes and uses the most conserved 400 proteins for extracting the phylogenetic signal. PhyloPhlAn also implements taxonomic curation, estimation, and insertion operations., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: PhyloPhlAn (RRID:SCR_013082) Copy
http://ctb.pku.edu.cn/main/SheGroup/Software/MED2.htm
A non-supervised gene prediction algorithm for prokaryotic genomes with multivariate entropy distance method.
Proper citation: MED (RRID:SCR_013403) Copy
http://bioinfo.unl.edu/raiphy.php
A semi-supervised metagenomic fragment classification software program that utilizes the genome signatures to characterize the DNA sequences and taxonomic classification is based on an information theoretic measure referred as Relative Abundance Index (RAI). A DNA sequence of unknown source is classified and taxonomically labeled based on the phylogenetic profiles of the previously sequenced genomes. The profiles are iteratively updated using the unknown DNA sequences and the classification results. After a few cycles, the metagenome is classified into operational taxonomic units.
Proper citation: RAIphy (RRID:SCR_004720) Copy
Database of genetic and molecular biology data for the model higher plant Arabidopsis thaliana. Data available includes the complete genome sequence along with gene structure, gene product information, metabolism, gene expression, DNA and seed stocks, genome maps, genetic and physical markers, publications, and information about the Arabidopsis research community. Gene product function data is updated every two weeks from the latest published research literature and community data submissions. Gene structures are updated 1-2 times per year using computational and manual methods as well as community submissions of new and updated genes. TAIR also provides extensive linkouts from data pages to other Arabidopsis resources. The data can be searched, viewed and analyzed. Datasets can also be downloaded. Pages on news, job postings, conference announcements, Arabidopsis lab protocols, and useful links are provided.
Proper citation: TAIR (RRID:SCR_004618) Copy
https://computation-rnd.llnl.gov/lmat/
Open-source software tool to assign taxonomic labels to as many reads as possible in very large metagenomic datasets and report the taxonomic profile of the input sample. The quick "single pass" analysis of every read allows read binning to support additional more computationally expensive analysis such as metagenomic assembly or sensitive database searches on targeted subsets of reads.
Proper citation: LMAT (RRID:SCR_004646) Copy
http://metaphyler.cbcb.umd.edu/
A taxonomic classifier for metagenomic shotgun reads, which uses phylogenetic marker genes as a taxonomic reference. The classifier, based on BLAST, uses different thresholds (automatically learned from the reference database) for each combination of taxonomic rank, reference gene, and sequence length. The reference database includes marker genes from all complete genomes, several draft genomes and the NCBI nr protein database.
Proper citation: MetaPhyler (RRID:SCR_004848) Copy
http://compbio.cs.toronto.edu/prism/
Software for split read (reads which span across a structrual variant -- SV ) mapping and SV calling from the mapping result. It is able to detect small insertions and abitrary size deletions, inversions and tandom duplications with the direction of discordant read pairs. PRISM_CTX is a tool for detecting inter-chromosome trans-location events.
Proper citation: PRISM - Pair Read Informed Split Mapper (RRID:SCR_004812) Copy
http://anntools.sourceforge.net/
Software tool for annotating single nucleotide substitutions (SNP/SNV), small insertions/deletions (indels), and copy number variations (CNV) calls generated from sequencing and microarray data. Only human genome build 37/hg19 can be annotated at this time.
Proper citation: AnnTools (RRID:SCR_005170) Copy
http://cbrc.kaust.edu.sa/readscan/
A highly scalable parallel software program to identify non-host sequences (of potential pathogen origin) and estimate their genome relative abundance in high-throughput sequence datasets.
Proper citation: READSCAN (RRID:SCR_005204) Copy
http://odin.mdacc.tmc.edu/~xsu1/VirusSeq.html
An algorithmic software tool for detecting known viruses and their integration sites using next-generation sequencing of human cancer tissue. VirusSeq takes FASTQ files (paired-end reads) as input.
Proper citation: VirusSeq (RRID:SCR_005206) Copy
http://snpeff.sourceforge.net/
Genetic variant annotation and effect prediction software toolbox that annotates and predicts effects of variants on genes (such as amino acid changes). By using standards, such as VCF, SnpEff makes it easy to integrate with other programs.
Proper citation: SnpEff (RRID:SCR_005191) Copy
https://code.google.com/p/clever-sv/
A collection of tools to discover and genotype structural variations in genomes from paired-end sequencing reads. The main software is written in C++ with some auxiliary scripts in Python.
Proper citation: CLEVER Toolkit (RRID:SCR_005255) Copy
https://code.google.com/p/clippers/
A software program designed to identify long deletions of a genome as well as the RNA splicings using long Illumina reads. Currently, Clippers is implemented for long reads Illumina, ex: 75bp or 100bp, allowing mismatches and a single deletion/splicing. Clippers is a sister tool of PerM, our short reads aligner. Users are strongly suggested to use PerM to initially mapped reads and identify the deletion/splicing with the initially unmapped reads. We plan to extend it to ABI SOLiD reads in the near future. Clippers outputs gap-alignments in SAM format. You can use SAMtools or other program to interpret the deletion/splicing. The input files are a reference in fasta format and the reads is in fasta or fastq format.
Proper citation: Clippers (RRID:SCR_005256) Copy
http://sv.gersteinlab.org/age/
A tool that implements an algorithm for optimal alignment of sequences with Structural Variations (SVs).
Proper citation: AGE (RRID:SCR_005253) Copy
Software mining pipeline guided by a Bayesian principle to detect single nucleotide polymorphisms, insertion and deletions by comparing high-throughput pyrosequencing reads with a reference genome of related organisms. This pipeline is extended to identify and visualize large-size structural variations, including insertions, deletions, inversions and translocations.
Proper citation: inGAP (RRID:SCR_005261) Copy
http://sv.gersteinlab.org/pemer/
Software package as computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data. Package is composed of three modules, PEMer workflow, SV-Simulation and BreakDB. PEMer workflow is a sensitive software for detecting SVs from paired-end sequence reads. SV-Simulation randomly introduces SVs into a given genome and generates simulated paired-end reads from novel genome.
Proper citation: PEMer (RRID:SCR_005263) Copy
Curated, open-source, integrated data resource for comparative functional genomics in crops and model plant species to facilitate the study of cross-species comparisons using information generated from projects supported by public funds. It currently hosts annotated whole genomes in over two dozen plant species and partial assemblies for almost a dozen wild rice species in the Ensembl browser, genetic and physical maps with genes, ESTs and QTLs locations, genetic diversity data sets, structure-function analysis of proteins, plant pathways databases (BioCyc and Plant Reactome platforms), and descriptions of phenotypic traits and mutations. The web-based displays for phenotypes include the Genes and Quantitative Trait Loci (QTL) modules. Sequence based relationships are displayed in the Genomes module using the genome browser adapted from Ensembl, in the Maps module using the comparative map viewer (CMap) from GMOD, and in the Proteins module displays. BLAST is used to search for similar sequences. Literature supporting all the above data is organized in the Literature database. In addition, Gramene now hosts a variety of web services including a Distributed Annotation Server (DAS), BLAST and a public MySQL database. Twice a year, Gramene releases a major build of the database and makes interim releases to correct errors or to make important updates to software and/or data. Additionally you can access Gramene through an FTP site.
Proper citation: Gramene (RRID:SCR_002829) Copy
Database designed for web-based examination of the human erythroid transcriptome. The database is organized to provide a cytogenetic band position, a unique name as well as a concise annotation for each entry. Search queries may be performed by name, keyword or cytogenetic location. Search results are linked to primary sequence data and three major human genome browsers for access to information considered current at the time of each search. Hembase provides interested scientists and clinical hematologists with a genome-based approach toward the study of erythroid biology. Red blood cells in the circulation arise from hematopoietic stem cells that proliferate as erythroid progenitors and differentiate into erythroid precursor cells in response to the hormone erythropoietin. Messenger RNA was isolated from those cells and used to generate gene libraries. Sequencing several thousand expressed sequence tags (EST) from those libraries was then performed. Those EST and sequences encoding several hundred additional genes with known expression in erythroid cells are compiled here as a database of human erythroid gene activity. The database is organized and linked according to the location of these sequences within the human genome., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 15,2026.
Proper citation: HemBase (RRID:SCR_002880) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.