Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
https://github.com/adrlar/CanSNPer
Software that is a hierarchical genotype classifier of clonal pathogens.
Proper citation: CanSNPer (RRID:SCR_001980) Copy
http://sourceforge.net/projects/denovoassembler/files/
Software that assembles reads obtained with new sequencing technologies (Illumina, 454, SOLiD) using MPI 2.2.
Proper citation: Ray (RRID:SCR_001916) Copy
http://cakesomatic.sourceforge.net/
A bioinformatics software pipeline that integrates four publicly available somatic variant-calling algorithms to identify single nucleotide variants with higher sensitivity and accuracy than any one algorithm alone.
Proper citation: Cake (RRID:SCR_002133) Copy
http://petrov.stanford.edu/cgi-bin/Tlex.html
Software package for fast and accurate discovery, annotation, re-annotation and population analysis of Transposable Elements using Next-Generation Sequencing data.
Proper citation: T-lex (RRID:SCR_005134) Copy
http://woldlab.caltech.edu/rnaseq
Software for Mapping and Quantifying Mammalian Transcriptomes by RNA-Seq. Its functions are to (i) assign reads that map uniquely in the genome to their site of origin and, for reads that match equally well to several sites (''multireads''), assign them to their most likely site(s) of origin; (ii) detect splice-crossing reads and assign them to their gene of origin; (iii) organize reads that cluster together, but do not map to an already known exon, into candidate exons or parts of exons; and (iv) calculate the prevalence of transcripts from each known or newly proposed RNA, based on normalized counts of unique reads, spliced reads and multireads. The new candidate RNA regions produced can be thought of as ESTs, and, like ESTs, some are provisionally appended to existing gene models if they meet several additional criteria. Remaining unassigned candidate transcribed regions (labeled RNAFAR features) can then be used in conjunction with other confirming data to develop new or revised gene models.
Proper citation: ERANGE (RRID:SCR_005240) Copy
http://www.broadinstitute.org/cancer/cga/rna-seqc
Java software which computes a series of quality control metrics for RNA-seq data and can compare sequencing quality across different samples or experiments to evaluate different experimental parameters. The input can be one or more BAM files, and the output consists of HTML reports and tab delimited files of metrics data.
Proper citation: RNA-SeQC (RRID:SCR_005120) Copy
http://cbrc.kaust.edu.sa/readscan/
A highly scalable parallel software program to identify non-host sequences (of potential pathogen origin) and estimate their genome relative abundance in high-throughput sequence datasets.
Proper citation: READSCAN (RRID:SCR_005204) Copy
http://odin.mdacc.tmc.edu/~xsu1/VirusSeq.html
An algorithmic software tool for detecting known viruses and their integration sites using next-generation sequencing of human cancer tissue. VirusSeq takes FASTQ files (paired-end reads) as input.
Proper citation: VirusSeq (RRID:SCR_005206) Copy
http://smithlab.usc.edu/methpipe/
A computational pipeline for analyzing bisulfite sequencing data.
Proper citation: MethPipe (RRID:SCR_005168) Copy
http://snpeff.sourceforge.net/
Genetic variant annotation and effect prediction software toolbox that annotates and predicts effects of variants on genes (such as amino acid changes). By using standards, such as VCF, SnpEff makes it easy to integrate with other programs.
Proper citation: SnpEff (RRID:SCR_005191) Copy
https://code.google.com/p/simrare/
A stand-alone executable software with user-friendly graphical interface implemented in Python/C++ for rare variant association studies. It is designed as a unified simulation framework to provide an unbiased and easy manner to evaluate association methods, including novel methods, under a broad range of choice of biological contexts. It consists of three modules, variant data simulator, genotype/phenotype generator and association method evaluator. SimRare generates variant data for gene regions using forward-time simulation which incorporates realistic population demographic and evolutionary scenarios. For phenotype data it is capable of generating both case-control and quantitative traits. The phenotypic effects of variants can be detrimental, protective or non-causal. SimRare has a graphical user interface which allows for easy entry of genetic and phenotypic parameters. Simulated data can be written into external files in a standard format. For novel association method implemented in R it can be imported into SimRare, which has been equipped built in functions to evaluate performance of new method and visually compare it with currently available ones in an unbiased manner.
Proper citation: SimRare (RRID:SCR_005226) Copy
https://github.com/ruping/Breakpointer
A fast tool for locating sequence breakpoints from the alignment of single end reads (SE) produced by next generation sequencing (NGS). It adopts a heuristic method in searching for local mapping signatures created by insertion/deletions (indels) or more complex structural variants(SVs). With current NGS single-end sequencing data, the output regions by Breakpoint mainly contain the approximate breakpoints of indels and a limited number of large SVs. Notably, Breakpointer can uncover breakpoints of insertions which are longer than the read length. Breakpointer also can find breakpoints of many variants located in repetitive regions. The regions can be used not only as a extra support for SV predictions by other tools (such as by split-read method), but also can serve as a database for searching variants which might be missed by other tools. Breakpointer is a command line tool that runs under linux system. Breakpointer takes advanage of two local mapping features of single-end reads as a consequence of indel/SVs: 1) non-uniform read distribution (depth skewness) and 2) misalignments at the boundaries of indel/SVs. These features are summarized as breakpoint signature. Breakpointer proceeds in three stages in capturing this signature. It is implemented in C++ and perl. Input is the file or files containing alignments of single-end reads against a reference genome (in .BAM format). Output is the predicted regions containing potential breakpoints of SVs (in .GFF format). To be able to read in .BAM files, Breakpointer requires bamtools API, which users should install beforehand.
Proper citation: Breakpointer (RRID:SCR_005254) Copy
http://www.genoscope.cns.fr/externe/gmorse/
Software aimed at using RNA-Seq short reads to build de novo gene models. First, candidate exons are built directly from the positions of the reads mapped on the genome (without any ab initio assembly of the reads), and all the possible splice junctions between those exons are tested against unmapped reads : the testing of junctions is directed by the information available in the RNA-Seq dataset rather than a priori knowledge about the genome. Exons can thus be chained into stranded gene models.
Proper citation: G-Mo.R-Se (RRID:SCR_005273) Copy
Software mining pipeline guided by a Bayesian principle to detect single nucleotide polymorphisms, insertion and deletions by comparing high-throughput pyrosequencing reads with a reference genome of related organisms. This pipeline is extended to identify and visualize large-size structural variations, including insertions, deletions, inversions and translocations.
Proper citation: inGAP (RRID:SCR_005261) Copy
http://sv.gersteinlab.org/pemer/
Software package as computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data. Package is composed of three modules, PEMer workflow, SV-Simulation and BreakDB. PEMer workflow is a sensitive software for detecting SVs from paired-end sequence reads. SV-Simulation randomly introduces SVs into a given genome and generates simulated paired-end reads from novel genome.
Proper citation: PEMer (RRID:SCR_005263) Copy
https://code.google.com/p/phantompeakqualtools/
Software package that computes quick but highly informative enrichment and quality measures for ChIP-seq/DNase-seq/FAIRE-seq/MNase-seq data. It can also be used to obtain robust estimates of the predominant fragment length or characteristic tag shift values in these assays.
Proper citation: phantompeakqualtools (RRID:SCR_005331) Copy
http://research.cs.wisc.edu/wham/
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 28,2023. High-throughput sequence alignment tool that aligns short DNA sequences (reads) to the whole human genome at a rate of over 1500 million 60bps reads per hour, which is one to two orders of magnitudes faster than the leading state-of-the-art techniques. Feature list for the current version (v 0.1.5) of WHAM: * Supports paired-end reads * Supports up to 5 errores * Supports alignments with gaps * Supports quality scores for filtering invalid alignments, and sorting valid alignments * finds ALL valid alignments * Supports multi-threading * Supports rich reporting modes * Supports SAM format output
Proper citation: WHAM (RRID:SCR_005497) Copy
http://www-personal.umich.edu/~jianghui/seqmap/
A software tool for mapping large amount of oligonucleotide to the genome. It is designed for finding all the places in a genome where an oligonucleotide could potentially come from. SeqMap can efficiently map as many as dozens of millions of short sequences to a genome of several billions of nucleotides. While doing the mapping, several mutations as well as insertions / deletions of the nucleotide bases in the sequences can be tolerated and furthermore detected. Various input and output formats are supported, as well as many command line options for tuning almost every steps in the mapping process. A typical mapping can be done in a few hours on an ordinary PC.
Proper citation: SeqMap (RRID:SCR_005495) Copy
http://www.genome.umd.edu/jellyfish.html
A software tool for fast, memory-efficient counting of k-mers in DNA. A k-mer is a substring of length k, and counting the occurrences of all such substrings is a central step in many analyses of DNA sequence. JELLYFISH can count k-mers quickly by using an efficient encoding of a hash table and by exploiting the compare-and-swap CPU instruction to increase parallelism. Jellyfish is a command-line program that reads FASTA and multi-FASTA files containing DNA sequences. It outputs its k-mer counts in an binary format, which can be translated into a human-readable text format using the jellyfish dump command., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: Jellyfish (RRID:SCR_005491) Copy
http://mrfast.sourceforge.net/
Software designed to map short reads generated with the Illumina platform to reference genome assemblies; in a fast and memory-efficient mannerl. Currently Supported Features: * Output in SAM format * Indels up to 8 bp (4 bp deletions and 4 bp insertions) * Paired-end mapping ** Discordant option to generate mapping file ready for VariationHunter to detect structural variants. * One end anchored (OEA) map locations for novel sequence insertion detection with NovelSeq * Matepair library mapping (long inserts with RF orientation). Planned Features: * Multithreading
Proper citation: mrFAST (RRID:SCR_005487) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.