Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
The Horizontal Gene Transfer DataBase (HGT-DB) is a genomic database that includes statistical parameters such as G+C content, codon and amino-acid usage, as well as information about which genes deviate in these parameters for prokaryotic complete genomes. Under the hypothesis that genes from distantly related species have different nucleotide compositions, these deviated genes may have been acquired by horizontal gene transfer.
Proper citation: Horizontal Gene Transfer-DataBase (RRID:SCR_007706) Copy
GELBANK is a government project that provides an interactive interface for the comparison of 2DE patterns in the context of proteome sequence queries. Only proteomes of species with completed genomes (bacterial genomes, some eukaryotic genomes, human proteome) are presented in the database. The image database also contains not only scanned images, but also modeled gel patterns representing a collection of images (e.g. a master pattern for a sample). 2DE gel patterns are grouped by: tissue type, sample type, staining method used, separation technique used in the first dimension (by charge), the pH-range of the media used in first dimension, technique used in the second dimension (by size). Tools pertinent to the querying of two-dimensional gel-electrophoresis are implemented and integrated into database. When searching for sequences, tools that allow allow the discovery of sequences and alignment of multiple sequences are presented. Individual 2DE gel-patterns can be displayed or a collection of patterns can be animated.
Proper citation: GELBANK (RRID:SCR_007668) Copy
http://spock.genes.nig.ac.jp/~genome/gtop.html
GTOP is a database consists of data analyses of proteins identified by various genome projects. This database mainly uses sequence homology analyses and features extensive utilization of information on three-dimensional structures. GTOP is built by the Laboratory of Gene-Product Informatics at the National Institute of Genetics. This research is supported by the Japan Science and Technology Corporation and Grants-in-Aid for Scientific Research (Genomes in category C) from the Ministry of Education, Science, Sports and Culture of Japan. We use the following methods: Prediction of 3D structure Sequence homology search of PDB, using REVERSE PSI-BLAST. Functional predictions (family classifications) Sequence homology search of Swiss-Prot, a well-annotated sequence database, with the use of BLAST. Other analytical methods We are also carrying out the following analyses: Motif Analysis(PROSITE) Family classification(Pfam) Prediction of transmembrane helix domains(SOSUI) Prediction of coiled-coil regions(Multicoil) Repetitive sequence analysis(RepAlign)
Proper citation: GTOP - Genomes To Protein structures (RRID:SCR_007698) Copy
https://omictools.com/ecgene-tool
Database of functional annotation for alternatively spliced genes. It uses a gene-modeling algorithm that combines the genome-based expressed sequence tag (EST) clustering and graph-theoretic transcript assembly procedures. It contains genome, mRNA, and EST sequence data, as well as a genome browser application. Organisms included in the database are human, dog, chicken, fruit fly, mouse, rhesus, rat, worm, and zebrafish. Annotation is provided for the whole transcriptome, not just the alternatively spliced genes. Several viewers and applications are provided that are useful for the analysis of the transcript structure and gene expression. The summary viewer shows the gene summary and the essence of other annotation programs. The genome browser and the transcript viewer are available for comparing the gene structure of splice variants. Changes in the functional domains by alternative splicing can be seen at a glance in the transcript viewer. Two unique ways of analyzing gene expression is also provided. The SAGE tags deduced from the assembled transcripts are used to delineate quantitative expression patterns from SAGE libraries available publicly. The cDNA libraries of EST sequences in each cluster are used to infer qualitative expression patterns.
Proper citation: ECgene: Gene Modeling with Alternative Splicing (RRID:SCR_007634) Copy
A database of human, chimpanzee, mouse, and rat proteases and protease inhibitors, as well as as the growing number of hereditary diseases caused by mutations in protease genes. Analysis of the human and mouse genomes has allowed us to annotate 581 human, 580 chimpanzee, 667 mouse, and 655 rat protease genes. Proteases are classified in five different classes according to their mechanism of catalysis. Proteases are a diverse and important group of enzymes representing >2% of the human, chimpanzee, mouse and rat genomes. This group of enzymes is implicated in numerous physiological processes. The importance of proteases is illustrated by the existence of 99 different hereditary diseases due to mutations in protease genes. Furthermore, proteases have been implicated in multiple human pathologies, including vascular diseases, rheumatoid arthritis, neurodegenerative processes, and cancer. During the last ten years, our laboratory has identified and characterized more than 60 human protease genes. Due to the importance of proteolytic enzymes in human physiology and pathology, we have recently introduced the concept of Degradome, as the complete repertoire of proteases expressed by a tissue or organism. Thanks to the recent completion of the human, chimpanzee, mouse, and rat genome sequencing projects, we were able to analyze and compare for the first time the complete protease repertoire in those mammalian organisms, as well as the complement of protease inhibitor genes. This webpage also contains the Supplementary Material of Human and mouse proteases: a comparative genomic approach Nat Rev Genet (2003) 4: 544-558, Genome sequence of the brown Norway rat yields insights into mammalian evolution Nature (2004) 428: 493-521, A genomic analysis of rat proteases and protease inhibitors Genome Res. (2004) 14: 609-622, and Comparative genomic analysis of human and chimpanzee proteases Genomics (2005) 86: 638-647.
Proper citation: Mammalian Degradome Database (RRID:SCR_007624) Copy
Datasets and tools for comparative analysis and annotation of all publicly available genomes from three domains of life in a uniquely integrated context. Plasmids that are not part of a specific microbial genome sequencing project and phage genomes are also included in order to increase its genomic context for comparative analysis. The user interface (see User Interface Map) allows navigating the microbial genome data space along its three key dimensions (genes, genomes, and functions), and groups together the main comparative analysis tools. Microbial genome data analysis in IMG usually starts with the definition of an analysis context in terms of selected genomes, functional annotations, and/or genes, followed by the individual or comparative analysis of genomes, functional annotations, or genes.
Proper citation: IMG (RRID:SCR_007733) Copy
https://leger2.helmholtz-hzi.de/cgi-bin/expLeger.pl
Knowledge database and visualization tool for comparative genomics of pathogenic and non-pathogenic Listeria species.Provides information on gene functions (as annotated or supposed by literature from homologous organisms) , protein expression levels under defined experimental conditions ,subcellular localization of proteins (expected and/or experimentally validated) , biological meaning of genes and proteins based on KEGG, InterPro and Gene Ontology.
Proper citation: LEGER: the post-genome Database for Listeria Research (RRID:SCR_007760) Copy
http://phylomedb.bioinfo.cipf.es
Database for phylomes, that is, complete collections of phylogenetic trees for all proteins encoded in a given genome. It aims at providing a repository of high-quality phylogenies and alignments for proteins encoded in model species. To derive a phylome, each protein encoded in a given genome is used as a seed to retrieve its homologs in other complete genomes. These sequences are aligned and processed to derive reliable phylogenies using several phylogenetic methods. Besides providing the evolutionary history of the gene families, phylomeDB includes phylogeny based predictions of orthology and paralogy relationships., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: PhylomeDB (RRID:SCR_007850) Copy
http://www.bioinformatics.ucla.edu/ASAP2
THIS RESOURCE IS NO LONGER IN SERVICE, documented on 8/12/13. An expanded version of the Alternative Splicing Annotation Project (ASAP) database with a new interface and integration of comparative features using UCSC BLASTZ multiple alignments. It supports 9 vertebrate species, 4 insects, and nematodes, and provides with extensive alternative splicing analysis and their splicing variants. As for human alternative splicing data, newly added EST libraries were classified and included into previous tissue and cancer classification, and lists of tissue and cancer (normal) specific alternatively spliced genes are re-calculated and updated. They have created a novel orthologous exon and intron databases and their splice variants based on multiple alignment among several species. These orthologous exon and intron database can give more comprehensive homologous gene information than protein similarity based method. Furthermore, splice junction and exon identity among species can be valuable resources to elucidate species-specific genes. ASAP II database can be easily integrated with pygr (unpublished, the Python Graph Database Framework for Bioinformatics) and its powerful features such as graph query, multi-genome alignment query and etc. ASAP II can be searched by several different criteria such as gene symbol, gene name and ID (UniGene, GenBank etc.). The web interface provides 7 different kinds of views: (I) user query, UniGene annotation, orthologous genes and genome browsers; (II) genome alignment; (III) exons and orthologous exons; (IV) introns and orthologous introns; (V) alternative splicing; (IV) isoform and protein sequences; (VII) tissue and cancer vs. normal specificity. ASAP II shows genome alignments of isoforms, exons, and introns in UCSC-like genome browser. All alternative splicing relationships with supporting evidence information, types of alternative splicing patterns, and inclusion rate for skipped exons are listed in separate tables. Users can also search human data for tissue- and cancer-specific splice forms at the bottom of the gene summary page. The p-values for tissue-specificity as log-odds (LOD) scores, and highlight the results for LOD >= 3 and at least 3 EST sequences are all also reported.
Proper citation: Alternative Splicing Annotation Project II Database (RRID:SCR_000322) Copy
Database that provides free online tools to users to allow the retrieval of information related to the Drosophila genome and allows access to genome-wide and related cell-based screening of Drosophila at Harvard Medical School (for a fee) . Tools available include SnapDragon, and RNAi designer, a heat map tool for viewing screen data, and gene and amplicon search and download tools. The DRSC mainly exists to provide Drosophila genome screening services, including help with assay development and optimization, data and image analysis, and planning of follow-up assays.
Proper citation: Drosophila RNAi Screening Center (RRID:SCR_000733) Copy
http://genome.jgi.doe.gov/programs/bacteria-archaea/index.jsf
Mission: Dynamically evolve sequencing, finishing, annotation and analysis processes, exploit new technologies, and develop expertise to deliver high quality and high throughput sequence-based microbial science by listening to and responding to DOE Users and scientific community needs. GOALS 1. Expand product catalog and increase sample throughput while maintaining highest quality The MGP has been expanding its product catalog beyond a finished microbial genome and has projected to significantly up ramp throughput for the majority of its current products namely Draft Genomes, Single Cell Genomes, Quick Draft Genomes, Resequencing projects and RNAseq Project. This projected increase in microbial genomes is going hand-in-hand with and has been stimulated by new high throughput technologies and capabilities (de novo microbial Illumina assemblies, single cell genomics, Genologic sample tracking). The increased throughput will support the user community as well as JGI scientists by enabling DOE-relevant science at a grander scale. As the Program aims to generate hundreds of microbial genomes per year, our goal is to scale our production efficiency and maintain our trademark quality to best support our science mission. 2. Expand sequence space One of the ongoing missions of the MGP is to expand the coverage of the phylogenomic sequence space by generating reference genome datasets from highly diverse braches in bacterial and archaeal tree of life. The value of such effort includes the generation of phylogenetic anchors for metagenomic datasets, the improvement of annotation, an increased insight into phylogenetic distribution of functions, the discovery of novel genes, protein families, pathways and a better understanding on evolutionary diversication. 3. Make Single Cell Genomes a robust User product As the vast majority of microbes are uncultured to date, single cell genomics will be a crucial component of the MGP over the next several years to drive not only JGI science but also User community proposed single cell research. Going hand-in-hand are R&D efforts in selective single cell isolations, testing the effects of fixation of single cell sequencing, as well as single cell transcriptomics. 4. Sequence Pangenomes Combining similar genomes together creating pangenomes will allow more compact genome sequence storage and visualization and expedite analysis and annotation. Moreover, the pangenome as a representation of the whole group of organisms may be more representative of a given species within the environment. The MGP thus thrives to enable the sequencing and analysis of pangenomes. Current technology allows the sequencing of one organism strain at a time. Assuming that for most cases, several dozen strains may need to be sequenced in order to generate a more accurate pangenome for every microbial species, it becomes evident that the cost for doing so may be prohibitively high. Our goal here will be to explore new approaches and technologies for generating these pangenomes at a very low cost and analogous to what is the cost today for a single strain. 5. Expand and improve microbial annotation using transcriptomic data To improve annotation of gene structure, establish accurate transcription level and timing, provide information on gene regulation and generate information for expanding understanding of systems biology, the MGP thieves to generate transcriptomics data for larger sets of Bacteria and/or Archaea. This will enable the identification of novel regulator RNAs, as well as facilitate the understanding of uncharacterized protein families. 6. Maintain and evolve a top quality data management system To enable state of the art and world class comparative analysis of internal and external scientific projects, the JGI data integration and visualization management system for comparative analysis of microbial genomes, namely IMG, needs to be maintained and continuously evolved. The system needs to be able to support and integrate all data generated by JGI (WGS, reseq, RNAseq, -other omics data), as well as by the user community, enabling annotation and manual curation of the annotation, comparative analysis, gene-centric and pathway centric analyzes. The system should also facilitate the interation of associated metadata, enable data sharing and distribution, as well as automated data GenBank submissions. Lastly, the system needs to have the ability to scale enabling the annotation of thousands of genomes per year. 7. Drive Flagship projects To stay at the forefront of microbial genomic research, be recognized as such and enable the development new methods and tools, the MGP aims to drive DOE mission relevant flagship projects. Novel tools and methods developed will ultimately serve the user community if proven useful and implemented as part of a larger pipeline. MGP flagship projects are the GEBA and GEBA uncultured projects, as well as the GEBA-RNB, the proposed Microbial Earth and the Microbial Dark Matter Projects.
Proper citation: Microbial Genetics Resource at JGI (RRID:SCR_000570) Copy
http://genome.igib.res.in/tbvar/
Database of the variome of Mycobacterium tuberculosis (Mtb) comprising of over 29,000 single nucleotide variations created from re-analyzed data sets corresponding to over 400 isolates of Mtb. Using a systematic computational pipeline, potential functional variants and drug-resistance associated variants have been annotated. The database has an option to annotate variants from clinical re-sequencing of Mtb.
Proper citation: tbvar (RRID:SCR_001178) Copy
http://metazoa.ensembl.org/index.html
Ensembl Genomes project produces genome databases for important species from across taxonomic range, using Ensembl software system. Five sites are now available, one of which is Ensembl Metazoa, which houses metazoan species.
Proper citation: Ensembl Metazoa (RRID:SCR_000800) Copy
http://www.jbldesign.com/jmogil/enter.html
Database of genes regulated by pain derived from published manuscripts describing results of pain-relevant knockout studies. The database has two levels of exploration: across-gene and within-gene. The across-gene level, the PainGenesdbSelector, is encountered first. All genes in the database can be accessed and sorted by their gene name, protein name, common names and acronyms, or genomic position (by navigating a graphic representation of the mouse genome). The gene and protein names can be selected from an alphabetical list, or by typing a text string into a search box.
Proper citation: Pain Genes database (RRID:SCR_004771) Copy
http://www.ncbi.nlm.nih.gov/bioproject
Database of biological data related to a single initiative, originating from a single organization or from a consortium. A BioProject record provides users a single place to find links to the diverse data types generated for that project. It is a searchable collection of complete and incomplete (in-progress) large-scale sequencing, assembly, annotation, and mapping projects for cellular organisms. Submissions are supported by a web-based Submission Portal. The database facilitates organization and classification of project data submitted to NCBI, EBI and DDBJ databases that captures descriptive information about research projects that result in high volume submissions to archival databases, ties together related data across multiple archives and serves as a central portal by which to inform users of data availability. BioProject records link to corresponding data stored in archival repositories. The BioProject resource is a redesigned, expanded, replacement of the NCBI Genome Project resource. The redesign adds tracking of several data elements including more precise information about a project''''s scope, material, and objectives. Genome Project identifiers are retained in the BioProject as the ID value for a record, and an Accession number has been added. Database content is exchanged with other members of the International Nucleotide Sequence Database Collaboration (INSDC). BioProject is accessible via FTP.
Proper citation: NCBI BioProject (RRID:SCR_004801) Copy
http://deepbase.sysu.edu.cn/chipbase/
A database for decoding transcription factor binding maps, expression profiles and transcriptional regulation of long non-coding RNAs (lncRNAs, lincRNAs), microRNAs, other ncRNAs (snoRNAs, tRNAs, snRNAs, etc.) and protein-coding genes from ChIP-Seq data. ChIPBase currently includes millions of transcription factor binding sites (TFBSs) among 6 species. ChIPBase provides several web-based tools and browsers to explore TF-lncRNA, TF-miRNA, TF-mRNA, TF-ncRNA and TF-miRNA-mRNA regulatory networks., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: ChIPBase (RRID:SCR_005404) Copy
http://genome.jgi.doe.gov/programs/plants/index.jsf
The goal of the DOE JGI Plant Genome Program is to shed light on the fundamental biology of photosynthesis and transduction of solar to chemical energy. Other areas of interest include characterizing: * Ecosystems and the role of terrestrial plants and oceanic phytoplankton-in carbon sequestration. * The role of plants in coping with toxic pollutants in soils by hyper-accumulation and detoxification. * Feedstocks for biofuels, e.g., biodiesel from soybean; cellulosic ethanol from perennial grasses. * The ability to respond to environmental change (e.g., loss of diversity from monoculture produces vulnerabilities; nitrogen fixing nodules in legumes reduce fertilizer need). * The generation of useful secondary metabolites (produced largely for disease resistance)- for positive/negative control in agriculture, with attendant influence on global carbon cycle. The Plant Genome Program accomplishes the above through the following activities: # Sequence. Produce genome sequences of key plant (and algal) species to accelerate biofuel development and understand response to climate change. # Function. Develop datasets (and synthetic biology tools) to elucidate functional elements in plant genomes, with special focus on handful of flagship genomes. # Variation. Characterize natural genomic variation in plants (and their associated microbiomes), and relate to biofuel sustainability and adaptation to climate change. # Integration. Provide a centralized hub for the retrieval and deep integrated analysis of plant genome datasets.
Proper citation: Plant Genome Resource at JGI (RRID:SCR_005315) Copy
Database of computationally predicted Transcription Factors and binding sites in gamma-proteobacterial genomes. The user may browse a map containing all known E. coli transcription factors and regulatory interactions that connect them, and retrieve information on the conservation of each regulatory interaction across the 30 organisms included in the database. Downloading the information is straightforward, and navigation tabs added to dynamic pages ease navigation between the five interfaces of the database. The original prediction approach, based on the representation of binding sites through statistical models was complemented by a new approach that uses known E. coli regulatory sites as the basis for a pattern matching search of regulatory sites. The use of both approaches together resulted in a more intensive exploration of the sequence space of each regulator's binding site. These data should aid researchers in the design of microarray experiments and the interpretation of their results. They should also facilitate studies of Comparative Genomics of the regulatory networks of this group of organisms.
Proper citation: Tractor db (RRID:SCR_005610) Copy
http://www.youtube.com/ncbinlm
Videos from the National Center for Biotechnology Information including presentations and tutorials about NCBI biomolecular and biomedical literature databases and tools.
Proper citation: NCBI YouTube Channel (RRID:SCR_006084) Copy
Collects mammalian cis- and trans-regulatory elements together with experimental evidence. Regulatory elements were mapped on to assembled genomes. Resource for gene regulation and function studies. Users can retrieve primers, search TF target genes, retrieve TF motifs, search Gene Regulatory Networks and orthologs, and make use of sequence analysis tools. Uses databases such as Genbank, EPD and DBTSS, and employ promoter finding program FirstEF combined with mRNA/EST information and cross-species comparisons. Manually curated.
Proper citation: Transcriptional Regulatory Element Database (RRID:SCR_005661) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.