Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://www.genome.jp/kegg/expression/
Database for mapping gene expression profiles to pathways and genomes. Repository of microarray gene expression profile data for Synechocystis PCC6803 (syn), Bacillus subtilis (bsu), Escherichia coli W3110 (ecj), Anabaena PCC7120 (ana), and other species contributed by the Japanese research community.
Proper citation: Kyoto Encyclopedia of Genes and Genomes Expression Database (RRID:SCR_001120) Copy
http://www.well.ox.ac.uk/happy/
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 28,2023. Software package for Multipoint QTL Mapping in Genetically Heterogeneous Animals (entry from Genetic Analysis Software) The method is implemented in a C-program and there is now an R version of HAPPY. You can run HAPPY remotely from their web server using your own data (or try it out on the data provided for download).
Proper citation: Happy (RRID:SCR_001395) Copy
http://neuronalarchitects.com/ibiofind.html
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 17, 2016. C#.NET 4.0 WPF / OWL / REST / JSON / SPARQL multi-threaded, parallel desktop application enables the construction of biomedical knowledge through PubMed, ScienceDirect, EndNote and NIH Grant repositories for tracking the work of medical researchers for ranking and recommendations. Users can crawl web sites, build latent semantic indices to generate literature searches for both Clinical Translation Science Award and non-CTSA institutions, examine publications, build Bayesian networks for neural correlates, gene to gene interactions, protein to protein interactions and as well drug treatment hypotheses. Furthermore, one can easily access potential researcher information, monitor and evolve their networks and search for possible collaborators and software tools for creating biomedical informatics products. The application is designed to work with the ModelMaker, R, Neural Maestro, Lucene, EndNote and MindGenius applications to improve the quality and quantity of medical research. iBIOFind interfaces with both eNeoTutor and ModelMaker 2013 Web Services Implementation in .NET for eNeoTutor to aid instructors to build neuroscience courses as well as rare diseases. Added: Rare Disease Explorer: The Visualization of Rare Disease, Gene and Protein Networks application module. Cinematics for the Image Finder from Yale. The ability to automatically generate and update websites for rare diseases. Cytoscape integration for the construction and visualization of pathways for Molecular targets of Model Organisms. Productivity metrics for medical researchers in rare diseases. iBIOFind 2013 database now includes over 150 medical schools in the US along with Clinical Translational Science Award Institutions for the generation of biomedical knowledge, biomedical informatics and Researcher Profiles.
Proper citation: iBIOFind (RRID:SCR_001587) Copy
Issue
http://www.nitrc.org/projects/plink
Open source whole genome association analysis toolset, designed to perform range of basic, large scale analyses in computationally efficient manner. Used for analysis of genotype/phenotype data. Through integration with gPLINK and Haploview, there is some support for subsequent visualization, annotation and storage of results. PLINK 1.9 is improved and second generation of the software.
Proper citation: PLINK (RRID:SCR_001757) Copy
http://athina.biol.uoa.gr/bioinformatics/GENEVITO/
A JAVA-based computer application that serves as a workbench for genome-wide analysis through visual interaction. GeneViTo offers an inspectional view of genomic functional elements, concerning data stemming both from database annotation and analysis tools for an overall analysis of existing genomes. The application deals with various experimental information concerning both DNA and protein sequences (derived from public sequence databases or proprietary data sources) and meta-data obtained by various prediction algorithms, classification schemes or user-defined features. Interaction with a Graphical User Interface (GUI) allows easy extraction of genomic and proteomic data referring to the sequence itself, sequence features, or general structural and functional features. Emphasis is laid on the potential comparison between annotation and prediction data in order to offer a supplement to the provided information, especially in cases of poor annotation, or an evaluation of available predictions. Moreover, desired information can be output in high quality JPEG image files for further elaboration and scientific use. GeneViTo has already been applied to visualize the genomes of two microbial organisms: the bacterion Chlamydia trachomatis and the archaeon Methanococcus jannaschii. The application is compatible with Linux or Windows ME-2000-XP operating systems, provided that the appropriate Java Runtime Environment (Java 1.4.1) is already installed in the system.
Proper citation: GeneVito (RRID:SCR_006211) Copy
It facilitates the search for and dissemination of mass spectra from biologically active metabolites quantified using Gas chromatography (GC) coupled to mass spectrometry (MS). Use the Search Page to search for a compound of your interest, using the name, mass, formula, InChI etc. as query input. Additionally, a Library Search service enables the search of user submitted mass spectra within the GMD. In parallel to the library search, a prediction of chemical sub-groups is performed. This approach has reached beta level and a publication is currently under review. Using several sub-group specific Decision Trees (DTs), mass spectra are classified with respect to the presence of the chemical moieties within the linked (unknown) compound. Prediction of functional groups (ms analysis) facilitates the search of metabolites within the GMD by means of user submitted GC-MS spectra consisting of retention index (n-alkanes, if vailable) and mass intensities ratios. In addition, a functional group prediction will help to characterize those metabolites without available reference mass spectra included in the GMD so far. Instead, the unknown metabolite is characterized by predicted presence or absence of functional groups. For power users this functionality presented here is exposed as soap based web services. Functional group prediction of compounds by means of GC-EI-MS spectra using Microsoft analysis service decision trees All currently available trained decision trees and sub-structure predictions provided by the GMD interface. Table describes the functional group, optional use of an RI system, record date of the trained decision tree, number of MSTs with proportion of MSTs linked to metabolites with the functional group present for each tree. Average and standard deviation of the 50-fold CV error, namely the ratio false over correctly sorted MSTs in the trained DT, are listed. The GMD website offers a range of mass spectral reference libraries to academic users which can be downloaded free of charge in various electronic formats. The libraries are constituted by base peak normalized consensus spectra of single analytes and contain masses in the range 70 to 600 amu, while the ubiquitous mass fragments typically generated from compounds carrying a trimethylsilyl-moiety, namely the fragments at m/z 73, 74, 75, 147, 148, and 149, were excluded.
Proper citation: GMD (RRID:SCR_006625) Copy
http://www.cdc.gov/genomics/default.htm
The Office of Public Health Genomics (OPHG) aims to integrate genomics into public health research, policy, and programs. Doing so could improve interventions designed to prevent and control the country''s leading chronic, infectious, environmental, and occupational diseases. OPHG''s efforts focus on conducting population-based genomic research, assessing the role of family health history in disease risk and prevention, supporting a systematic process for evaluating genetic tests, translating genomics into public health research and programs, and strengthening capacity for public health genomics in disease prevention programs. Goals: To improve public health interventions of diseases of major public health importance, including chronic, infectious, environmental, and occupational diseases, through six major initiatives: * Evaluation of Genomic Applications in Practice and Prevention (EGAPP), * Human Genome Epidemiology Network (HuGENet), * NHANES Collaborative Genomics Project, * Family History Public Health Initiative, * Genomics Translation Research and Programs, and, * Genomic Applications in Practice and Prevention Network (GAPPNet).
Proper citation: Public Health Genomics (RRID:SCR_006462) Copy
https://www.fludb.org/brc/home.spg?decorator=influenza
The Influenza Research Database (IRD) serves as a public repository and analysis platform for flu sequence, experiment, surveillance and related data.
Proper citation: Influenza Research Database (IRD) (RRID:SCR_006641) Copy
DPVweb provides a central source of information about viruses, viroids and satellites of plants, fungi and protozoa. Comprehensive taxonomic information, including brief descriptions of each family and genus, and classified lists of virus sequences are provided. The database also holds detailed, curated, information for all sequences of viruses, viroids and satellites of plants, fungi and protozoa that are complete or that contain at least one complete gene. For comparative purposes, it also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA genome. The start and end positions of each feature (gene, non-translated region and the like) have been recorded and checked for accuracy. As far as possible, nomenclature for genes and proteins are standardized within genera and families. Sequences of features (either as DNA or amino acid sequences) can be directly downloaded from the website in FASTA format. The sequence information can also be accessed via client software for PC computers (freely downloadable from the website) that enable users to make an easy selection of sequences and features of a chosen virus for further analyses. The public sequence databases contain vast amounts of data on virus genomes but accessing and comparing the data, except for relatively small sets of related viruses can be very time consuming. The procedure is made difficult because some of the sequences on these databases are incorrectly named, poorly annotated or redundant. The NCBI Reference Sequence project (1) provides a comprehensive, integrated, non-redundant set of sequences, including genomic DNA, transcript (RNA) and protein products, for major research organisms. This now includes curated information for a single sequence of each fully sequenced virus species. While this is a welcome development, it can only deal with complete sequences. An important feature of DPV is the opportunity to access genes (and other features) of multiple sequences quickly and accurately. Thus, for example, it is easy to obtain the nucleotide or amino acid sequences of all the available accessions of the coat protein gene of a given virus species or for a group of viruses. To increase its usefulness further, DPVweb also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA (ssDNA) genome. Sponsors: This site is supported by the Association of Applied Biologists and the Zhejiang Academy of Agricultural Sciences, Hangzhou, People''s Republic of China.
Proper citation: Descriptions of Plant Viruses (RRID:SCR_006656) Copy
http://bioconductor.org/packages/bioc/html/GeneAnswers.html
GeneAnswers provide an integrated tool for given genes biological or medical interpretation. It includes statistical test of given genes and specified categories. Microarray techniques have been widely employed in genomic scale studies for more than one decade. The standard analysis of microarray data is to filter out a group of genes from thousands of probes by certain statistical criteria. These genes are usually called significantly differentially expressed genes. Recently, next generation sequencing (NGS) is gradually adopted to explore gene transcription, methylation, etc. Also a gene list can be obtained by NGS preliminary data analysis. However, this type of information is not enough to understand the potential linkage between identified genes and interested functions. The integrated functional and pathway analysis with gene expression data would be very helpful for researchers to interpret the relationship between the identified genes and proposed biological or medical functions and pathways. The GeneAnswers package provides an integrated solution for a group of genes and specified categories (biological or medical functions, such as Gene Ontology, Disease Ontology, KEGG, etc) to reveal the potential relationship between them by means of statistical methods, and make user-friendly network visualization to interpret the results. Besides the package has a function to combine gene expression profile and category analysis together by outputting concept-gene cross tables, keywords query on NCBI Entrez Gene and application of human based Disease ontology analysis of given genes from other species can help people to understand or discover potential connection between genes and functions. Sponsors: This project was supported in part by Award Number UL1RR025741 from the National Center for Research Resources.
Proper citation: GeneAnswers (RRID:SCR_006498) Copy
http://inparanoid.sbc.su.se/cgi-bin/index.cgi
Collection of pairwise comparisons between 100 whole genomes generated by a fully automatic method for finding orthologs and in-paralogs between TWO species. Ortholog clusters in the InParanoid are seeded with a two-way best pairwise match, after which an algorithm for adding in-paralogs is applied. The method bypasses multiple alignments and phylogenetic trees, which can be slow and error-prone steps in classical ortholog detection. Still, it robustly detects complex orthologous relationships and assigns confidence values for in-paralogs. The original data sets can be downloaded.
Proper citation: InParanoid: Eukaryotic Ortholog Groups (RRID:SCR_006801) Copy
A web-based application designed from a genetic epidemiology point of view to analyze association studies using single nucleotide polymorphisms (SNPs). For each selected SNP, you will receive: * Allele and genotype frequencies * Test for Hardy-Weinberg equilibrium * Analysis of association with a response variable based on linear or logistic regression * Multiple inheritance models: co-dominant, dominant, recessive, over-dominant and additive * Analysis of interactions (gene-gene or gene-environment) If multiple SNPs are selected: * Linkage disequilibrium statistics * Haplotype frequency estimation * Analysis of association of haplotypes with the response * Analysis of interactions (haplotypes-covariate)
Proper citation: SNPSTATS (RRID:SCR_002142) Copy
Database of genetic and molecular biological information about the filamentous fungi of the genus Aspergillus including information about genes and proteins of Aspergillus nidulans and Aspergillus fumigatus; descriptions and classifications of their biological roles, molecular functions, and subcellular localizations; gene, protein, and chromosome sequence information; tools for analysis and comparison of sequences; and links to literature information; as well as a multispecies comparative genomics browser tool (Sybil) for exploration of orthology and synteny across multiple sequenced Sgenus species. Also available are Gene Ontology (GO) and community resources. Based on the Candida Genome Database, the Aspergillus Genome Database is a resource for genomic sequence data and gene and protein information for Aspergilli. Among its many species, the genus contains an excellent model organism (A. nidulans, or its teleomorph Emericella nidulans), an important pathogen of the immunocompromised (A. fumigatus), an agriculturally important toxin producer (A. flavus), and two species used in industrial processes (A. niger and A. oryzae). Search options allow you to: *Search AspGD database using keywords. *Find chromosomal features that match specific properties or annotations. *Find AspGD web pages using keywords located on the page. *Find information on one gene from many databases. *Search for keywords related to a phenotype (e.g., conidiation), an allele (such as veA1), or an experimental condition (e.g., light). Analysis and Tools allow you to: *Find similarities between a sequence of interest and Aspergillus DNA or protein sequences. *Display and analyze an Aspergillus sequence (or other sequence) in many ways. *Navigate the chromosomes set. View nucleotide and protein sequence. *Find short DNA/protein sequence matches in Aspergillus. *Design sequencing and PCR primers for Aspergillus or other input sequences. *Display the restriction map for a Aspergillus or other input sequence. *Find similarities between a sequence of interest and fungal nucleotide or protein sequences. AspGD welcomes data submissions.
Proper citation: ASPGD (RRID:SCR_002047) Copy
http://www.ncbi.nlm.nih.gov/HTGS/
Database of high-throughput genome sequences from large-scale genome sequencing centers, including unfinished and finished sequences. It was created to accommodate a growing need to make unfinished genomic sequence data rapidly available to the scientific community in a coordinated effort among the International Nucleotide Sequence databases, DDBJ, EMBL, and GenBank. Sequences are prepared for submission by using NCBI's software tools Sequin or tbl2asn. Each center has an FTP directory into which new or updated sequence files are placed. Sequence data in this division are available for BLAST homology searches against either the htgs database or the month database, which includes all new submissions for the prior month. Unfinished HTG sequences containing contigs greater than 2 kb are assigned an accession number and deposited in the HTG division. A typical HTG record might consist of all the first-pass sequence data generated from a single cosmid, BAC, YAC, or P1 clone, which together make up more than 2 kb and contain one or more gaps. A single accession number is assigned to this collection of sequences, and each record includes a clear indication of the status (phase 1 or 2) plus a prominent warning that the sequence data are unfinished and may contain errors. The accession number does not change as sequence records are updated; only the most recent version of a HTG record remains in GenBank.
Proper citation: High Throughput Genomic Sequences Division (RRID:SCR_002150) Copy
The UCLA-DOE Institute for Genomics and Proteomics carries out research in bioenergy, structural biology, genomics and proteomics, consistent with the research mission of the United States Department of Energy. Major interests of the 12 Principal Investigators and 9 Associate Members include systems approaches to organisms, structural biology, bioinformatics, and bioenergetic systems. The Institute sponsors 5 Core Technology Centers, for X-ray and NMR structural determination, bioinformatics and computation, protein expression and purification, and biochemical instrumentation. Services offered by this Institute: - Databases: * DIP (The Database of Interacting Proteins): The DIPTM database catalogs experimentally determined interactions between proteins. It combines information from a variety of sources to create a single, consistent set of protein-protein interactions. * ProLinks Database of Functional Linkages: The Prolinks database is a collection of inference methods used to predict functional linkages between proteins. These methods include the Phylogenetic Profile method which uses the presence and absence of proteins across multiple genomes to detect functional linkages; the Gene Cluster method, which uses genome proximity to predict functional linkage; Rosetta Stone, which uses a gene fusion event in a second organism to infer functional relatedness; and the Gene Neighbor method, which uses both gene proximity and phylogenetic distribution to infer linkage. - Data-to-Structure Servers: * SAVEs Structure Verification Server * Merohedral Twinning Test Server * SER Surface Entropy Reduction Server * VERIFY3D Structure Verification Server * ERRAT Structure Verification Server - Structure-to-Function Servers: * ProKnow Protein Functionator * Hot Patch Functional Site Locator
Proper citation: University of California at Los Angeles - Department of Energy Institute for Genomics and Proteomics (RRID:SCR_001921) Copy
Original SAMTOOLS package has been split into three separate repositories including Samtools, BCFtools and HTSlib. Samtools for manipulating next generation sequencing data used for reading, writing, editing, indexing,viewing nucleotide alignments in SAM,BAM,CRAM format. BCFtools used for reading, writing BCF2,VCF, gVCF files and calling, filtering, summarising SNP and short indel sequence variants. HTSlib used for reading, writing high throughput sequencing data.
Proper citation: SAMTOOLS (RRID:SCR_002105) Copy
Bioinformatics platform for storing, organizing, processing, and sharing genomic and other biomedical big data. Designed to make it easier for bioinformaticians to develop analyses, developers to create genomic web applications and IT administers to manage large-scale compute and storage genomic resources. Designed to run on top of cloud operating systems such as Amazon Web Services and OpenStack. Currently, there are implementations that work on AWS and Xen+Debian/Ubuntu. Functionally, Arvados has two major sets of capabilities: (a) data management and (b) compute management.
Proper citation: Arvados (RRID:SCR_002223) Copy
Portal for studies of genome structure and genetic variation, gene expression and gene function. Provides services including DNA sequencing of model and non-model genomes using both Next Generation and Sanger sequencing , Gene expression analysis using both microarrays and Next Generation Sequencing, High throughput genotyping of SNP and copy number variants, Data collection and analysis supported in-house high performance computing facilities and expertise, Extensive EST clone collections for a number of animal species, all of commercially available microarray tools from Affymetrix, Illumina, Agilent and Nimblegen, Parentage testing using microsatellites and smaller SNP panels. ARK-Genomics has developed network of researchers whom they support through each stage of their genomics research, from grant application, experimental design and technology selection, performing wet laboratory protocols, through to analysis of data often in conjunction with commercial partners.
Proper citation: ARK-Genomics: Centre for Functional Genomics (RRID:SCR_002214) Copy
http://sourceforge.net/projects/gmato/files/?source=navbar
A software tool used for simple sequence repeats (SSR) or microsatellite characterization. It also facilitates SSR marker design on a genomic scale, microsatellite mining at any length, and comprehensive statistical analysis for DNA sequences in any genome at any size. Analysis parameters are customizable.
Proper citation: GMATo (RRID:SCR_000165) Copy
http://www.europeanlung.org/en/projects-and-research/projects/airprom/
Consortium focused on developing computer and physical models of the airway system for patients with asthma and chronic obstructive pulmonary disease (COPD). Developing accurate models will better predict how asthma and COPD develop, since current methods can only assess the severity of disease. They aim to bridge the gaps in clinical management of airways-based disease by providing reliable models that predict disease progression and the response to treatment for each person with asthma or COPD. A data management platform provides a secure and sustainable infrastructure that semantically integrates the clinical, physiological, genetic, and experimental data produced with existing biomedical knowledge from allied consortia and public databases. This resource will be available for analysis and modeling, and will facilitate sharing, collaboration and publication within AirPROM and with the broader community. Currently the AirPROM knowledge portal is only accessible by AirPROM partners.
Proper citation: AirPROM (RRID:SCR_003827) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.