Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://sourceforge.net/projects/autoassemblyd/
Software which performs local and remote genome assembly by several assemblers based on an XML Template which can replace the large command lines required by most assemblers.
Proper citation: AutoAssemblyD (RRID:SCR_001087) Copy
http://www.genome.jp/kegg/expression/
Database for mapping gene expression profiles to pathways and genomes. Repository of microarray gene expression profile data for Synechocystis PCC6803 (syn), Bacillus subtilis (bsu), Escherichia coli W3110 (ecj), Anabaena PCC7120 (ana), and other species contributed by the Japanese research community.
Proper citation: Kyoto Encyclopedia of Genes and Genomes Expression Database (RRID:SCR_001120) Copy
https://github.com/Ensembl/WiggleTools
A multithreaded software library that computes statistics on large numbers of datasets, generating statistical summaries within minutes with limited memory requirements, whether on the whole genome or on selected regions.
Proper citation: WiggleTools (RRID:SCR_001170) Copy
http://www.biobase-international.com/product/genome-trax
Service that provides a comprehensive compilation of variant knowledge that allows you to identify pathogenic variants in human whole genome or exome sequences. It makes it easy to upload a complete genome?s worth of variations and identify the biologically relevant subset of known mutations, mutations that are novel and appear in a candidate disease genes, or mutations that are predicted to have a deleterious effect. The database includes a comprehensive collection of disease causing mutations from HGMD Professional, regulatory sites from TRANSFAC , and disease genes, drug targets and pathways from PROTEOME, as well as pharmacogenomic variants. It integrates the best public data-sets on somatic mutations, allele frequencies and clinical variants, in their most up-to-date version, for a total of more than 165 million annotations. It is possible to identify known pathogenic variants, remove harmless common variants, and obtain deleterious predictions for novel variants. With family data, it is possible to identify variants that are de novo, compound heterozygous only in the offspring. All of the results can be downloaded to Excel for further review. For core facilities and bioinformaticians, the complete underlying data is made available for download and easy integration into custom analysis pipelines. Genome Trax data is optimized to work with many other software packages, such as ANNOVARTM, CLC bio, Alamut, SimulConsult, and Cartagenia.
Proper citation: Genome Trax (RRID:SCR_001234) Copy
http://www.zbh.uni-hamburg.de/?id=211
A collection of flexible and memory-efficient software programs for k-mer counting and indexing of large sequence sets. It is based on enhanced suffix arrays which gives a much larger flexibility concerning the choice of the k-mer size. It can process large data sizes of several billion bases.
Proper citation: TALLYMER (RRID:SCR_001244) Copy
https://dalexander.github.io/admixture/download.html
A software tool for maximum likelihood estimation of individual ancestries from multilocus SNP genotype datasets. It uses the same statistical model as STRUCTURE but calculates estimates much more rapidly using a fast numerical optimization algorithm. It uses a block relaxation approach to alternately update allele frequency and ancestry fraction parameters. Each block update is handled by solving a large number of independent convex optimization problems, which are tackled using a fast sequential quadratic programming algorithm. Convergence of the algorithm is accelerated using a novel quasi-Newton acceleration method., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: ADMIXTURE (RRID:SCR_001263) Copy
http://med.stanford.edu/tanglab/software/frappe.html
Software using a f frequentist approach for estimating individual ancestry proportion.
Proper citation: frappe (RRID:SCR_001264) Copy
http://www.morpholinodatabase.org/
Central database to house data on morpholino screens currently containing over 700 morpholinos including control and multiple morpholinos against the same target. A publicly accessible sequence-based search opens this database for morpholinos against a particular target for the zebrafish community. Morpholino Screens: They set out to identify all cotranslationally translocated genes in the zebrafish genome (Secretome/CTT-ome). Morpholinos were designed against putative secreted/CTT targets and injected into 1-4 cell stage zebrafish embryos. The embryos were observed over a 5 day period for defects in several different systems. The first screen examined 184 gene targets of which 26 demonstrated defects of interest (Pickart et al. 2006). A collaboration with the Verfaillie laboratory examined the knockdown of targets identified in a comparative microarray analysis of hematopoietic stem cells demonstrating how microarray and morpholino technologies can be used in conjunction to enrich for defects in specific developmental processes. Currently, many collaborations are underway to identify genes involved in morphological, kidney, skin, eye, pigment, vascular and hematopoietic development, lipid metabolism and more. The screen types referred to in the search functions are the specific areas of development that were examined during the various screens, which include behavior, general morphology, pigmentation, toxicity, Pax2 expression, and development of the craniofacial structures, eyes, kidneys, pituitary, and skin. Only data pertaining to specific tests performed are presented. Due to the complexity of this international collaboration and time constraints, not all morpholinos were subjected to all screen types. They are currently expanding public access to the database. In the future we will provide: * Mortality curves and dose range for each morpholino * Preliminary data regarding the effectiveness of each morpholino * Expanded annotation for each morpholino * External linkage of our morpholino sequences to ZFIN and Ensembl. To submit morpholino-knockdown results to MODB please contact the administrator for a user name and password.
Proper citation: Morpholino Database (RRID:SCR_001378) Copy
Software resource for multiple whole genome alignment. It uses Nucmer, a custom graph-based segmentation procedure, for pairwise alignment, and the Seqan:TCoffee's multiple alignment strategy.
Proper citation: Mugsy (RRID:SCR_001414) Copy
Catalog of published genome-wide association studies. Genome-wide set of genetic variants in different individuals to see if any variant is associated with trait and disease. Database of genome-wide association study (GWAS) publications including only those attempting to assay single nucleotide polymorphisms (SNPs). Publications are organized from most to least recent date of publication. Studies are identified through weekly PubMed literature searches, daily NIH-distributed compilations of news and media reports, and occasional comparisons with an existing database of GWAS literature (HuGE Navigator). Works with HANCESTRO ancestry representation.
Proper citation: GWAS: Catalog of Published Genome-Wide Association Studies (RRID:SCR_012745) Copy
Integrated database resource consisting of 16 main databases, broadly categorized into systems information, genomic information, and chemical information. In particular, gene catalogs in completely sequenced genomes are linked to higher-level systemic functions of cell, organism, and ecosystem. Analysis tools are also available. KEGG may be used as reference knowledge base for biological interpretation of large-scale datasets generated by sequencing and other high-throughput experimental technologies.
Proper citation: KEGG (RRID:SCR_012773) Copy
A high-quality integrated knowledge resource specialized in the immunoglobulins (IG) or antibodies, T cell receptors (TR), major histocompatibility complex (MHC) of human and other vertebrate species, and in the immunoglobulin superfamily (IgSF), MHC superfamily (MhcSF) and related proteins of the immune system (RPI) of vertebrates and invertebrates, serving as the global reference in immunogenetics and immunoinformatics. IMGT provides a common access to sequence, genome and structure Immunogenetics data, based on the concepts of IMGT-ONTOLOGY and on the IMGT Scientific chart rules. IMGT works in close collaboration with EBI (Europe), DDBJ (Japan) and NCBI (USA). IMGT consists of sequence databases, genome database, structure database, and monoclonal antibodies database, Web resources and interactive tools.
Proper citation: IMGT - the international ImMunoGeneTics information system (RRID:SCR_012780) Copy
http://gmod.org/wiki/Flash_GViewer
Flash GViewer is a customizable Flash movie that can be easily inserted into a web page to display each chromosome in a genome along with the locations of individual features on the chromosomes. It is intended to provide an overview of the genomic locations of a specific set of features - eg. genes and QTLs associated with a specific phenotype, etc. rather than as a way to view all features on the genome. The features can hyperlink out to a detail page to enable to GViewer to be used as a navigation tool. In addition the bands on the chromosomes can link to defineable URL and new region selection sliders can be used to select a specific chromosome region and then link out to a genome browser for higher resolution information. Genome maps for Rat, Mouse, Human and C. elegans are provided but other genome maps can be easily created. Annotation data can be provided as static text files or produced as XML via server scripts. This tool is not GO-specific, but was built for the purpose of viewing GO annotation data. Platform: Online tool
Proper citation: Flash Gviewer (RRID:SCR_012870) Copy
http://www-sequence.stanford.edu/group/candida/
The Stanford Genome Technology Center began a whole genome shotgun sequencing of strain SC5314 of Candida albicans. After reaching its original goal of 1.5X mean coverage of the haploid genome (16Mb) in summer, 1998, Stanford was awarded a supplemental grant to continue sequencing up to a coverage of 10X, performing as much assembly of the sequence as possible, using recognizable genes as nucleation points. Candida albicans is one of the most commonly encountered human pathogens, causing a wide variety of infections ranging from mucosal infections in generally healthy persons to life-threatening systemic infections in individuals with impaired immunity. Oral and esophogeal Candida infections are frequently seen in AIDS patients. Few classes of drugs are effective against these fungal infections, and all of them have limitations with regard to efficacy and side-effects.
Proper citation: Sequencing of Candida Albicans (RRID:SCR_013437) Copy
Functional genomic database for malaria parasites. Database for Plasmodium spp. Provides resource for data analysis and visualization in gene-by-gene or genome-wide scale. PlasmoDB 5.5 contains annotated genomes, evidence of transcription, proteomics evidence, protein function evidence, population biology and evolution data. Data can be queried by selecting from query grid or drop down menus. Results can be combined with each other on query history page. Search results can be downloaded with associated functional data and registered users can store their query history for future retrieval or analysis.Key community database for malaria researchers, intersecting many types of laboratory and computational data, aggregated by gene.
Proper citation: PlasmoDB (RRID:SCR_013331) Copy
Database for ESTs (Expressed Sequence Tags), consensus sequences, bacterial artificial chromosome (BAC) clones, BES (BAC End Sequences). They have generated 69,545 ESTs from 6 full-length cDNA libraries (Porcine Abdominal Fat, Porcine Fat Cell, Porcine Loin Muscle, Liver and Pituitary gland). They have also identified a total of 182 BAC contigs from chromosome 6. It is very valuable resources to study porcine quantitative trait loci (QTL) mapping and genome study. Users can explore genomic alignment of various data types, including expressed sequence tags (ESTs), consensus sequences, singletons, QTL, Marker, UniGene and BAC clones by several options. To estimate the genomic location of sequence dataset, their data aligned BES (BAC End Sequences) instead of genomic sequence because Pig Genome has low-coverage sequencing data. Sus scrofa Genome Database mainly provide comparative map of four species (pig, cattle, dog and mouse) in chromosome 6.
Proper citation: PiGenome (RRID:SCR_013394) Copy
http://bioinformatics.psb.ugent.be/ENIGMA/
A software tool to extract gene expression modules from perturbational microarray data, based on the use of combinatorial statistics and graph-based clustering. The modules are further characterized by incorporating other data types, e.g. GO annotation, protein interactions and transcription factor binding information, and by suggesting regulators that might have an effect on the expression of (some of) the genes in the module. Version : ENIGMA 1.1 used GO annotation version : Aug 29th 2007
Proper citation: ENIGMA (RRID:SCR_013400) Copy
http://fnih.org/work/past-programs/genetic-association-information-network-gain
The Genetic Association Information Network (GAIN) supports a series of Genome-Wide Association Studies (GWAS) designed to identify specific points of DNA variation associated with the occurrence of a particular common disease. Initially focusing on six major common diseases, GAIN focused on combining the results with clinical data to create a significant new resource for genetic researchers.
Proper citation: Genetic Association Information Network (GAIN) (RRID:SCR_013703) Copy
A comparative platform for green plant genomics. Families of orthologous and paralogous genes that represent the modern descendents of ancestral gene sets are constructed at key phylogenetic nodes. These families allow easy access to clade specific orthology / paralogy relationships as well as clade specific genes and gene expansions. As of release v9.1, Phytozome provides access to forty-one sequenced and annotated green plant genomes which have been clustered into gene families at 20 evolutionarily significant nodes. Where possible, each gene has been annotated with PFAM, KOG, KEGG, and PANTHER assignments, and publicly available annotations from RefSeq, UniProt, TAIR, JGI are hyper-linked and searchable., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: Phytozome (RRID:SCR_006507) Copy
We at NRSP-8 bioinformatics coordination program strive to serve the animal genomics research community to better use computer tools and methods, to best utilize available resources, and in working with researchers in the community, to effectively share, combine, manage, manipulate, and analyze information from genomics/genetics studies. This site is designed as an information center to serve the national animal genome research projects of cattle, chicken, pigs, sheep, horse, and aquaculture species. This is home to databases and web sites (being) built for structural, functional and application oriented studies of the animal genomics, to serve the purpose of research, education and related activities in the scientific, industrial and educational communities in the states and world wide. The challenges in bioinformatics support/research for animal genomics may involve * Effective data collection, organization and management * Rapid development of most needed bioinformatics tools and resources * Efficient use of these tools for innovative data analysis Projects: * Animal Trait Ontology (ATO) Project * Virtual Comparative Genomics * The Past, the Current, and the Potentials * Collaborative and Hosted Works
Proper citation: NAGRP Bioinformatics Coordination Program (RRID:SCR_006564) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.