Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://bejerano.stanford.edu/prism/public/html/
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on May 5,2022.Tool that predicts interactions between transcription factors and their regulated genes from binding motifs. Understanding vertebrate development requires unraveling the cis-regulatory architecture of gene regulation. PRISM provides accurate genome-wide computational predictions of transcription factor binding sites for the human and mouse genomes, and integrates the predictions with GREAT to provide functional biological context. Together, accurate computational binding site prediction and GREAT produce for each transcription factor: 1. putative binding sites, 2. putative target genes, 3. putative biological roles of the transcription factor, and 4. putative cis-regulatory elements through which the factor regulates each target in each functional role., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: PRISM (Stanford database) (RRID:SCR_005375) Copy
https://code.google.com/p/knime4bio/
A set of custom nodes for the KNIME (The Konstanz Information Miner) graphical workbench, for analysing next-generation sequencing (NGS) data without the requirement of programming skills.
Proper citation: Knime4Bio (RRID:SCR_005376) Copy
http://ergatis.sourceforge.net/
A web interface and scalable software system for bioinformatics workflows that is used to create, run, and monitor reusable computational analysis pipelines. It contains pre-built components for common bioinformatics analysis tasks. These components can be arranged graphically to form highly-configurable pipelines. Each analysis component supports multiple output formats, including the Bioinformatic Sequence Markup Language (BSML). The current implementation includes support for data loading into project databases following the CHADO schema, a highly normalized, community-supported schema for storage of biological annotation data. Ergatis uses the Workflow engine to process its work on a compute grid. Workflow provides an XML language and processing engine for specifying the steps of a computational pipeline. It provides detailed execution status and logging for process auditing, facilitates error recovery from point of failure, and is highly scalable with support for distributed computing environments. The XML format employed enables commands to be run serially, in parallel, and in any combination or nesting level.
Proper citation: Ergatis (RRID:SCR_005377) Copy
An algorithm for detecting genomic structural variations at base-pair resolution using next-generation sequencing data. CREST uses pieces of DNA called soft clips to find structural variations. Soft clips are the DNA segments produced during sequencing that fail to properly align to the reference genome as the sample genome is reassembled. CREST uses the soft clips to precisely identify sites of chromosomal rearrangement or where pieces of DNA are inserted or deleted.
Proper citation: CREST (RRID:SCR_005257) Copy
http://sourceforge.net/projects/molbiolib/
A compact, portable, and extensively tested C++11 software framework and set of applications tailored to the demands of next-generation sequencing data and applicable to many other applications. It is designed to work with common file formats and data types used both in genomic analysis and general data analysis. A central relational-database-like Table class is a flexible and powerful object to intuitively represent and work with a wide variety of tabular datasets, ranging from alignment data to annotations. MolBioLib includes programs to perform a wide variety of analysis tasks such as computing read coverage, annotating genomic intervals, and novel peak calling with a wavelet algorithm. This package assumes fluency in both UNIX and C++.
Proper citation: MolBioLib (RRID:SCR_005372) Copy
https://code.google.com/p/phenoman/
An interactive software program that integrates phenotypic data exploration, selection, management and quality control using a unified platform for association studies of rare and common variants.
Proper citation: PhenoMan (RRID:SCR_005249) Copy
http://www.bioextract.org/GuestLogin
An open, web-based system designed to aid researchers in the analysis of genomic data by providing a platform for the creation of bioinformatic workflows. Scientific workflows are created within the system by recording tasks performed by the user. These tasks may include querying multiple, distributed data sources, saving query results as searchable data extracts, and executing local and web-accessible analytic tools. The series of recorded tasks can then be saved as a reproducible, sharable workflow available for subsequent execution with the original or modified inputs and parameter settings. Integrated data resources include interfaces to the National Center for Biotechnology Information (NCBI) nucleotide and protein databases, the European Molecular Biology Laboratory (EMBL-Bank) non-redundant nucleotide database, the Universal Protein Resource (UniProt), and the UniProt Reference Clusters (UniRef) database. The system offers access to numerous preinstalled, curated analytic tools and also provides researchers with the option of selecting computational tools from a large list of web services including the European Molecular Biology Open Software Suite (EMBOSS), BioMoby, and the Kyoto Encyclopedia of Genes and Genomes (KEGG). The system further allows users to integrate local command line tools residing on their own computers through a client-side Java applet.
Proper citation: BioExtract (RRID:SCR_005397) Copy
http://statgenpro.psychiatry.hku.hk/limx/kggseq/
A biological Knowledge-based mining platform for Genomic and Genetic studies using Sequence data. The software platform, constituted of bioinformatics and statistical genetics functions, makes use of valuable biologic resources and knowledge for sequencing-based genetic mapping of variants / genes responsible for human diseases / traits. It facilitates geneticists to fish for the genetic determinants of human diseases / traits in the big sea of DNA sequences. KGGSeq has paid attention to downstream analysis of genetic mapping. The framework was implemented to filter and prioritize genetic variants from whole exome sequencing data.
Proper citation: KGGSeq (RRID:SCR_005311) Copy
http://soap.genomics.org.cn/soapindel.html
Software focusing on calling indels from the next-generation paired-end sequencing data.
Proper citation: SOAPindel (RRID:SCR_005272) Copy
http://www.bioconductor.org/packages/2.12/bioc/html/PING.html
Software program for probabilistic inference of ChIP-Seq using an empirical Bayes mixture model approach.
Proper citation: PING (RRID:SCR_005394) Copy
http://code.google.com/p/rseqc/
Software package to comprehensively evaluate different aspects of RNA-seq experiments, such as sequence quality, GC bias, polymerase chain reaction bias, nucleotide composition bias, sequencing depth, strand specificity, coverage uniformity and read distribution over the genome structure. RSeQC takes both SAM and BAM files as input, which can be produced by most RNA-seq mapping tools as well as BED files, which are widely used for gene models.
Proper citation: RSeQC (RRID:SCR_005275) Copy
http://bioapps.sabanciuniv.edu/mugex/v02/
Service that automatically extracts mutation-gene pairs from MEDLINE abstracts for a given disease.
Proper citation: MuGeX (RRID:SCR_005306) Copy
http://www.broadinstitute.org/software/scripture/
Software for transcriptome reconstruction that relies solely on RNA-Seq reads and an assembled genome to build a transcriptome ab initio. The statistical methods to estimate read coverage significance are also applicable to other sequencing data. Scripture also has modules for ChIP-Seq peak calling.
Proper citation: Scripture (RRID:SCR_005269) Copy
Tool for searching sequence databases for homologs of protein sequences, and for making protein sequence alignments. It implements methods using probabilistic models called profile hidden Markov models (profile HMMs). Compared to BLAST, FASTA, and other sequence alignment and database search tools based on older scoring methodology, HMMER aims to be significantly more accurate and more able to detect remote homologs because of the strength of its underlying mathematical models. In the past, this strength came at significant computational expense, but in the new HMMER3 project, HMMER is now essentially as fast as BLAST.
Proper citation: Hmmer (RRID:SCR_005305) Copy
http://bioinformatics.mdanderson.org/main/SpliceSeq:Overview
A Java application to investigate alternative mRNA splicing patterns in data from high-throughput mRNA sequencing studies. Sequence reads are mapped to splice graphs that unambiguously quantify the inclusion level of each exon and splice junction. The graphs are then traversed to predict the protein isoforms that are likely to result from the observed exon and splice junction reads. UniProt annotations are mapped to each protein isoform to identify potential functional impacts of alternative splicing. This tool may be used on a single RNASeq sample to identify genes with multiple spliceforms, on a pair of samples to identify differential splicing between the two, or on groups of samples to identify statistically significant group level differences in splicing patterns. SpliceSeq can be run from the install page as a java web start application to explore the sequencing data on their server or can be installed locally to analyze your own mRNA-Seq data.
Proper citation: SpliceSeq (RRID:SCR_005267) Copy
http://splitread.sourceforge.net/
Software for detecting INDELs (small insertions and deletion with size less than 50bp) as well as large deletions that are within the coding regions from the exome sequencing data. It also can be applied to the whole genome sequencing data.
Proper citation: SPLITREAD (RRID:SCR_005264) Copy
A Comprehensive Bioinformatics Scientific Workflow Module for Distributed Analysis of Large-Scale Biological Data that is distributed on top of the core Kepler scientific workflow system.
Proper citation: bioKepler (RRID:SCR_005385) Copy
http://code.google.com/p/hydra-sv/
Software that detects structural variation (SV) breakpoints by clustering discordant paired-end alignments whose signatures corroborate the same putative breakpoint. Hydra can detect breakpoints caused by all classes of structural variation. Moreover, it was designed to detect variation in both unique and duplicated genomic regions; therefore, it will examine paired-end reads having multiple discordant alignments. Hydra does not attempt to classify SV breakpoints based on the mapping distances and orientations of each breakpoint cluster, it merely detects and reports breakpoints. This is an intentional decision, as it was observed that in loci affected by complex rearrangements, the type of variant suggested by the breakpoint signature is not always correct. Hydra does report the orientations, distances, number of supporting read-pairs, etc., for each breakpoint. It is suggested that downstream methods be used to classify variants based on the genomic features that they overlap and the co-occurrence of other breakpoints. For example, they developed BEDTools for exactly this purpose and the breakpoints reported by Hydra are in the BEDPE format used by BEDTools. Future releases of Hydra will include scripts that assist in the classification process.
Proper citation: Hydra (RRID:SCR_005260) Copy
http://bioinformatics.ua.pt/becas/
Web application, API and widget able to recognize and annotate biomedical concepts in text.Provides annotations for isolated, nested and intersected entities.Identifies concepts from multiple semantic groups, providing preferred names and enriching them with references to public knowledge resources.
Proper citation: becas (RRID:SCR_005337) Copy
https://code.google.com/p/mirpara/
A SVM (support vector machine-based software tool for prediction of most probable microRNA coding regions in genome scale sequences.
Proper citation: MiRPara (RRID:SCR_005294) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.