Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
https://github.com/tk2/RetroSeq
A tool for discovery and genotyping of transposable element variants (TEVs) (also known as mobile element insertions) from next-gen sequencing reads aligned to a reference genome in BAM format. The goal is to call TEVs that are not present in the reference genome but present in the sample that has been sequenced. It should be noted that RetroSeq can be used to locate any class of viral insertion in any species where whole-genome sequencing data with a suitable reference genome is available. RetroSeq is a two phase process, the first being the read pair discovery phase where discorandant mate pairs are detected and assigned to a TE class (Alu, SINE, LINE, etc.) by using either the annotated TE elements in the reference and/or aligned with Exonerate to the supplied library of viral sequences.
Proper citation: RetroSeq (RRID:SCR_005133) Copy
https://github.com/cwhelan/cloudbreak
Software providing a Hadoop-based genomic structural variation (SV) caller for Illumina paired-end DNA sequencing data. It contains a full pipeline for aligning data in the form of FASTQ files using alignment pipelines that generate many possible mappings for every read, in the Hadoop framework. It then contains Hadoop jobs for computing genomic features from the alignments, and for calling insertion and deletion variants from those features.
Proper citation: Cloudbreak (RRID:SCR_005097) Copy
http://yost.genetics.utah.edu/software.php
A software analysis pipeline for mapping mutations using RNA-seq that works without parental strain information, without the requirement of a pre-existing snp map of the organism, and without erroneous assumptions that recombination occurs at the same frequency across the genome. In addition, it compensates for the considerable amount of noise in RNA-seq datasets and simultaneously identifies the region where the mutation lies and generates a list of putative coding region mutations in the linked genomic segment. MMAPPR can utilize RNA-seq datasets from isolated tissues or whole organisms that are often generated for phenotypic analysis and gene network analysis in novel mutants.
Proper citation: MMAPPR (RRID:SCR_005092) Copy
http://www.omicsoft.com/fusionmap/
An efficient fusion aligner which aligns reads spanning fusion junctions directly to the genome without prior knowledge of potential fusion regions. It detects and characterizes fusion junctions at base-pair resolution. FusionMap can be applied to detect fusion junctions in both single- and paired-end dataset from either gDNA-Seq or RNA-Seq studies. FusionMap runs under both Windows and Linux (requiring MONO) environments. Although it can run on 32 bit machine, it is recommended to run on 64-bit machine with 8GB RAM or more. If you have an ArrayStudio License, you can run the fusion detection easily through its GUI.
Proper citation: FusionMap (RRID:SCR_005242) Copy
http://www.cs.helsinki.fi/en/gsa/traph/
A software tool for transcript identification and quantification with RNA-Seq. The method has a two-fold advantage: on the one hand, it translates the problem as an established one in the field of network flows, which can be solved in polynomial time, with different existing solvers; on the other hand, it is general enough to encompass many of the previous proposals under the least sum of squares model.
Proper citation: Traph (RRID:SCR_005119) Copy
http://www.raetschlab.org/suppl/rquant
Software for quantitative detection of alternative transcripts with RNA-Seq data. The method, based on quadratic programming, estimates biases introduced by experimental settings and is thus a powerful tool to reveal and quantify novel (alternative) transcripts.
Proper citation: rQuant (RRID:SCR_005150) Copy
http://www.bsse.ethz.ch/cbg/software/shorah
A software package that allows for inference about the structure of a population from a set of short sequence reads as obtained from ultra-deep sequencing of a mixed sample. The package contains programs that support mapping of reads to a reference genome, correcting sequencing errors by locally clustering reads in small windows of the alignment, reconstructing a minimal set of global haplotypes that explain the reads, and estimating the frequencies of the inferred haplotypes.
Proper citation: ShoRAH (RRID:SCR_005211) Copy
http://www.broadinstitute.org/scientific-community/science/projects/viral-genomics/v-phaser-2
A software tool to call variants in genetically heterogeneous populations from ultra-deep sequence data. It combines information regarding the covariation (i.e. phasing) between observed variants to increase sensitivity and an expectation maximization algorithm that iteratively recalibrates base quality scores to increase specificity. V-Phaser can reliably detect rare variants in diverse populations that occur at frequencies of <1%. V-Phaser 2 is a complete rewrite of the original V-Phaser. It contains a new model for length polymorphisms (indels) and incorporates paired end read information in its phasing model. The data access and probability computation sections of the code have also been highly optimized, resulting in substantial improvements in running time and memory usage.
Proper citation: V-Phaser 2 (RRID:SCR_005212) Copy
https://sites.google.com/site/nsmapforrnaseq/
Software designed to identify and quantify isoforms from RNA-seq by incorporating a sparsity term into expression level estimation to enable isoform structure prediction and expression estimation simultaneously.
Proper citation: NSMAP (RRID:SCR_005213) Copy
http://sourceforge.net/projects/cova/
A variant annotation and comparison tool for next-generation sequencing. It annotates the effects of variants on genes and compares those among multiple samples, which helps to pinpoint causal variation(s) relating to phenotype.
Proper citation: COVA (RRID:SCR_005175) Copy
http://bmda.cs.unibas.ch/HivHaploTyper/
Software for reconstructing haplotypes from next-generation sequencing data., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: PredictHaplo (RRID:SCR_005207) Copy
http://sourceforge.net/projects/qure/
A software program for viral quasispecies reconstruction, specifically developed to analyze long read (>100 bp) next-generation sequencing (NGS) data. The software performs alignments of sequence fragments against a reference genome, finds an optimal division of the genome into sliding windows based on coverage and diversity and attempts to reconstruct all the individual sequences of the viral quasispecies--along with their prevalence--using a heuristic algorithm, which matches multinomial distributions of distinct viral variants overlapping across the genome division. QuRe comes with a built-in Poisson error correction method and a post-reconstruction probabilistic clustering, both parameterized on given error rates in homopolymeric and non-homopolymeric regions.
Proper citation: QuRe (RRID:SCR_005209) Copy
http://bioinfo.mc.vanderbilt.edu/VirusFinder/
Software tool for efficient and accurate detection of viruses and their integration sites in host genomes through next generation sequencing data. Specifically, it detects virus infection, co-infection with multiple viruses, virus integration sites in host genomes, as well as mutations in the virus genomes. It also facilitates virus discovery by reporting novel contigs, long sequences assembled from short reads that map neither to the host genome nor to the genomes of known viruses. VirusFinder 2 works with both paired-end and single-end data, unlike the previous 1.x versions that accepted only paired-end reads. The types of NGS data that VirusFinder 2 can deal with include whole genome sequencing (WGS), whole transcriptome sequencing (RNA-Seq), targeted sequencing data such as whole exome sequencing (WES) and ultra-deep amplicon sequencing.
Proper citation: VirusFinder (RRID:SCR_005205) Copy
Software designed as a user friendly solution to extract and annotate biologically important transcripts from next generation RNA sequencing data.
Proper citation: RNA-eXpress (RRID:SCR_005167) Copy
http://unoseq.sourceforge.net/
A Java library to analyze next generation sequencing data and especially perform expression profiling in organisms where no well-annotated reference genome exists.
Proper citation: UnoSeq (RRID:SCR_005116) Copy
NIH established expectations for sharing data obtained through NIH-funded genome-wide association studies (GWAS) with the implementation of the GWAS Policy. Information and resources related to the GWAS Policy can be found on this website.
Proper citation: Genomic Datasharing (RRID:SCR_005233) Copy
http://bioinformatics.wistar.upenn.edu/isoformex
Software that estimates transcript expression levels and gene expression levels from mRNA-Seq data. Technically speaking, IsoformEx parses bowtie alignment files in a project directory (e.g. ~yourid/isoformex/xxx, where xxx is the project name) and generates two files: (1) xxx/xxx_transcript_1.txt: expression levels of all transcripts, (2) xxx/xxx_gene_1.txt: expression levels of all genes.
Proper citation: IsoformEx (RRID:SCR_005235) Copy
http://code.google.com/p/aldex/
RNA-seq tool that uses the Dirichlet distribution and a transformation to identify genes that exhibit small within-condition and large between-condition variance.
Proper citation: aldex (RRID:SCR_005110) Copy
http://www.broadinstitute.org/cancer/cga/absolute
Software to estimate purity / ploidy, and from that compute absolute copy-number and mutation multiplicities. When DNA is extracted from an admixed population of cancer and normal cells, the information on absolute copy number per cancer cell is lost in the mixing. The purpose of ABSOLUTE is to re-extract these data from the mixed DNA population. This process begins by generation of segmented copy number data, which is input to the ABSOLUTE algorithm together with pre-computed models of recurrent cancer karyotypes and, optionally, allelic fraction values for somatic point mutations. The output of ABSOLUTE then provides re-extracted information on the absolute cellular copy number of local DNA segments and, for point mutations, the number of mutated alleles.
Proper citation: ABSOLUTE (RRID:SCR_005198) Copy
http://variant.bioinfo.cipf.es/
Analysis tool that can report the functional properties of any variant in all the human, mouse or rat genes (and soon new model organisms will be added) and the corresponding neighborhoods. Also other non-coding extra-genic regions, such as miRNAs are included in the analysis. It not only reports the obvious functional effects in the coding regions but also analyzes noncoding SNVs situated both within the gene and in the neighborhood that could affect different regulatory motifs, splicing signals, and other structural elements. These include: Jaspar regulatory motifs, miRNA targets, splice sites, exonic splicing silencers, calculations of selective pressures on the particular polymorphic positions, etc. Software analysis pipelines used in the analysis of NGS data are highly modular, heterogeneous, and rapidly evolving. VARIANT can easily be incorporated into a NGS resequencing pipeline either as a CLI or invoked a webservice. It inputs data directly from the most widely used programs for SNV detection., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: VARIANT (RRID:SCR_005194) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.