Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
The goals of Antibiotic Resistance Genes Database (ARGB) are to provide a centralized compendium of information on antibiotic resistance, to facilitate the consistent annotation of resistance information in newly sequenced organisms, and also to facilitate the identification and characterization of new genes. ARGB contains six types of database groups: - Resistance Type: This database contains information, such as resistance profile, mechanism, requirement, epidemiology for each type. - Resistance Gene: This database contains information, such as resistance profile, resistance type, requirement, protein and DNA sequence for each gene.This database only includes NON-REDUNDANT, NON-VECTOR, COMPLETE genes. - Antibiotic: This database contains information, such as producer, action mechanism, resistance type, for each gene. - Resistance Gene(NonRD): This database contains the same information as Resistance Gene. It does NOT include NON-REDUNDANT, NON-VECTOR genes, but includes INCOMPLETE genes. - Resistance Gene(ALL): This database contains the same information as Resistance Gene. It includes all REDUNDANT, VECTOR AND INCOMPLETE genes. - Resistance Species: This database contains resistance profile and corresponding resistance genes for each species. Furthermore, ARDB also contians three types BLAST database: - Resistance Genes Complete: Contains only NON-REDUNDANT, NON-VECTOR, COMPLETE genes sequences. - Resistance Genes Non-redundant: Contains NON-REDUNDANT, NON-VECTOR, COMPLETE, INCOMPLETE genes sequences. - Resistance Genes All: Contains all REDUNDANT, VECTOR, COMPLETE, INCOMPLETE genes sequences. Lastly, ARDB provides four types of Analytical tools: - Normal BLAST: This function allows an user to input a DNA or protein sequence, and find similar DNA (Nucleotide BLAST) or protein (Protein BLAST) sequences using blastn, blastp, blastx, tblastn, tblastx - RPS BLAST: A web RPSBLAST (RPS BLAST) interface is provided to align a query sequence against the Position Specific Scoring Matrix (PSSM) for each type. Normally, this will give the same annotation information as using regular BLAST mentioned above. - Multiple Sequences BLAST (Genome Annotation): This function allows an user to annotate multiple (less than 5000) query sequences in FASTA format. - Mutation Resistance Identification: This function allows an user to identify mutations that will cause potential antibiotic resistance, for 12 genes (16S rRNA, 23S rRNA, gyrA, gyrB, parC, parE, rpoB, katG, pncA, embB, folP, dfr). ������ :Sponsors: ARDB is funded by Uniformed Services University of the Health Sciences, administered by the Henry Jackson Foundation. :
Proper citation: Antibiotic Resistance Genes Database (RRID:SCR_007040) Copy
http://www.genoscope.cns.fr/externe/tetraodon/
The initial objective of Genoscope was to compare the genomic sequences of this fish to that of humans to help in the annotation of human genes and to estimate their number. This strategy is based on the common genetic heritage of the vertebrates: from one species of vertebrate to another, even for those as far apart as a fish and a mammal, the same genes are present for the most part. In the case of the compact genome of Tetraodon, this common complement of genes is contained in a genome eight times smaller than that of humans. Although the length of the exons is similar in these two species, the size of the introns and the intergenic sequences is greatly reduced in this fish. Furthermore, these regions, in contrast to the exons, have diverged completely since the separation of the lineages leading to humans and Tetraodon. The Exofish method, developed at Genoscope, exploits this contrast such that the conserved regions which can be identified by comparing genomic sequences of the two species, correspond only to coding regions. Using preliminary sequencing results of the genome of Tetraodon in the year 2000, Genoscope evaluated the number of human genes at about 30,000, whereas much higher estimations were current. The progress of the annotation of the human genome has since supported the Genoscope hypothesis, with values as low as 22,000 genes and a consensus of around 25,000 genes. The sequencing of the Tetraodon genome at a depth of about 8X, carried out as a collaboration between Genoscope and the Whitehead Institute Center for Genome Research (now the Broad Institute), was finished in 2002, with the production of an assembly covering 90 of the euchromatic region of the genome of the fish. This has permitted the application of Exofish at a larger scale in comparisons with the genome of humans, but also with those of the two other vertebrates sequenced at the time (Takifugu, a fish closely related to Tetraodon, and the mouse). The conserved regions detected in this way have been integrated into the annotation procedure, along with other resources (cDNA sequences from Tetraodon and ab initio predictions). Of the 28,000 genes annotated, some families were examined in detail: selenoproteins, and Type 1 cytokines and their receptors. The comparison of the proteome of Tetraodon with those of mammals has revealed some interesting differences, such as a major diversification of some hormone systems and of the collagen molecules in the fish. A search for transposable elements in the genomic sequences of Tetraodon has also revealed a high diversity (75 types), which contrasts with their scarcity; the small size of the Tetraodon genome is due to the low abundance of these elements, of which some appear to still be active. Another factor in the compactness of the Tetraodon genome, which has been confirmed by annotation, is the reduction in intron size, which approaches a lower limit of 50-60 bp, and which preferentially affects certain genes. The availability of the sequences from the genomes of humans and mice on one hand, and Takifugu and Tetraodon on the other, provide new opportunities for the study of vertebrate evolution. We have shown that the level of neutral evolution is higher in fish than in mammals. The protein sequences of fish also diverge more quickly than those of mammals. A key mechanism in evolution is gene duplication, which we have studied by taking advantage of the anchoring of the majority of the sequences from the assembly on the chromosomes. The result of this study speaks strongly in favor of a whole genome duplication event, very early in the line of ray-finned fish (Actinopterygians). An even stronger evidence came from synteny studies between the genomes of humans and Tetraodon. Using a high-resolution synteny map, we have reconstituted the genome of the vertebrate which predates this duplication - that is, the last common ancestor to all bony vertebrates (most of the vertebrates apart from cartilaginous fish and agnaths like lamprey). This ancestral karyotype contains 12 chromosomes, and the 21 Tetraodon chromosomes derive from it by the whole genome duplication and a surprisingly small number of interchromosomal rearrangements. On the contrary, exchanges between chromosomes have been much more frequent in the lineage that leads to humans. Sponsors: The project was supported by the Consortium National de Recherche en Genomique and the National Human Genome Research Institute.
Proper citation: Tetraodon Genome Browser (RRID:SCR_007079) Copy
https://www.mc.vanderbilt.edu/victr/dcc/projects/acc/index.php/Main_Page
A national consortium formed to develop, disseminate, and apply approaches to research that combine DNA biorepositories with electronic medical record (EMR) systems for large-scale, high-throughput genetic research. The consortium is composed of seven member sites exploring the ability and feasibility of using EMR systems to investigate gene-disease relationships. Themes of bioinformatics, genomic medicine, privacy and community engagement are of particular relevance to eMERGE. The consortium uses data from the EMR clinical systems that represent actual health care events and focuses on ethical issues such as privacy, confidentiality, and interactions with the broader community.
Proper citation: eMERGE Network: electronic Medical Records and Genomics (RRID:SCR_007428) Copy
http://genome.wustl.edu/projects/detail/human-gut-microbiome/
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on August 19,2022. Human Gut Microbiome Initiative (HGMI) seeks to provide simply annotated, deep draft genome sequences for 100 cultured representatives of the phylogenetic diversity documented by 16S rRNA surveys of the human gut microbiota. Humans are supra-organisms, composed of 10 times more microbial cells than human cells. Therefore, it seems appropriate to consider ourselves as a composite of many species - human, bacterial, and archaeal - and our genome as an amalgamation of human genes and the genes in ''our'' microbial genomes (''microbiome''). In the same sense, our metabolome can be considered to be a synthesis of co-evolved human and microbial traits. The total number of genes present in the human microbiome likely exceeds the number of our H. sapiens genes by orders of magnitude. Thus, without an understanding of our microbiota and microbiome, it not possible to obtain a complete picture of our genetic diversity and of our normal physiology. Our intestine is home to our largest collections of microbes: bacterial densities in the colon (up to 1 trillion cells/ml of luminal contents) are the highest recorded for any known ecosystem. The vast majority of phylogenetic types in the distal gut microbiota belong to just two divisions (phyla) of the domain Bacteria - the Bacteroidetes and the Firmicutes. Members of eight other divisions have also been identified using culture-independent 16S rRNA gene-based surveys. Metagenomic studies of complex microbial communities residing in our various body habitats are limited by the availability of suitable reference genomes for confident assignment of short sequence reads generated by highly parallel DNA sequencers, and by knowledge of the professions (niches) of community members. Therefore, HGMI, which represents a collaboration between Washington University''s Genome Center and its Center for Genome Sciences, seeks to provide simply annotated, deep draft genome sequences for 100 cultured representatives of the phylogenetic diversity documented by 16S rRNA surveys of the human gut microbiota.
Proper citation: Human Gut Microbiome Initiative (RRID:SCR_008137) Copy
http://www.gene-regulation.com/pub/databases.html
In an effort to strongly support the collaborative nature of scientific research, BIOBASE offers academic and non-profit organizations free access to reduced functionality versions of their products. TRANSFAC Professional provides gene regulation analysis solutions, offering the most comprehensive collection of eukaryotic gene regulation data. The professional paid subscription gives customers access to up-to-date data and tools not available in the free version. The public databases currently available for academic and non-profit organizations are: * TRANSFAC: contains data on transcription factors, their experimentally-proven binding sites, and regulated genes. Its broad compilation of binding sites allows the derivation of positional weight matrices. * TRANSPATH: provides data about molecules participating in signal transduction pathways and the reactions they are involved in, resulting in a complex network of interconnected signaling components.TRANSPATH focuses on signaling cascades that change the activities of transcription factors and thus alter the gene expression profile of a given cell. * PathoDB: is a database on pathologically relevant mutated forms of transcription factors and their binding sites. It comprises numerous cases of defective transcription factors or mutated transcription factor binding sites, which are known to cause pathological defects. * S/MARt DB: presents data on scaffold or matrix attached regions (S/MARs) of eukaryotic genomes, as well as about the proteins that bind to them. S/MARs organize the chromatin in the form of functionally independent loop domains gained increasing support. Scaffold or Matrix Attached Regions (S/MARs) are genomic DNA sequences through which the chromatin is tightly attached to the proteinaceous scaffold of the nucleus. * TRANSCompel: is a database on composite regulatory elements affecting gene transcription in eukaryotes. Composite regulatory elements consist of two closely situated binding sites for distinct transcription factors, and provide cross-coupling of different signaling pathways. * PathoSign Public: is a database which collects information about defective cell signaling molecules causing human diseases. While constituting a useful data repository in itself, PathoSign is also aimed at being a foundational part of a platform for modeling human disease processes.
Proper citation: Gene Regulation Databases (RRID:SCR_008033) Copy
http://ncv.unl.edu/Angelettilab/HPV/Database.html
THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone., documented August 23, 2016. The Human Papillomaviruses Database collects, curates, analyzes, and publishes genetic sequences of papillomaviruses and related cellular proteins. It includes molecular biologists, sequence analysts, computer technicians, post-docs and graduate research assistants. This Web site has two main branches. The first contains our four annual data books of papillomavirus information, called Human Papillomaviruses: A Compilation and Analysis of Nucleic Acid and Amino Acid Sequences. and the second contains papillomavirus genetic sequence data. There is also a New Items location where we store the latest changes to the database or any other current news of interest. Besides the compendium, we also provide genetic sequence information for papilloma viruses and related cellular proteins. Each year they publish a compendium of papillomavirus information called Human Papillomaviruses: A Compilation and Analysis of Nucleic Acid and Amino Acid Sequences. which can now be downloaded from this Web site.
Proper citation: HPV Sequence Database (RRID:SCR_008154) Copy
http://www.animalgenome.org/pigs/nagrp.html
Database and resources on the pig genome.
Proper citation: U.S. Pig Genome Project (RRID:SCR_008151) Copy
DPVweb provides a central source of information about viruses, viroids and satellites of plants, fungi and protozoa. Comprehensive taxonomic information, including brief descriptions of each family and genus, and classified lists of virus sequences are provided. The database also holds detailed, curated, information for all sequences of viruses, viroids and satellites of plants, fungi and protozoa that are complete or that contain at least one complete gene. For comparative purposes, it also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA genome. The start and end positions of each feature (gene, non-translated region and the like) have been recorded and checked for accuracy. As far as possible, nomenclature for genes and proteins are standardized within genera and families. Sequences of features (either as DNA or amino acid sequences) can be directly downloaded from the website in FASTA format. The sequence information can also be accessed via client software for PC computers (freely downloadable from the website) that enable users to make an easy selection of sequences and features of a chosen virus for further analyses. The public sequence databases contain vast amounts of data on virus genomes but accessing and comparing the data, except for relatively small sets of related viruses can be very time consuming. The procedure is made difficult because some of the sequences on these databases are incorrectly named, poorly annotated or redundant. The NCBI Reference Sequence project (1) provides a comprehensive, integrated, non-redundant set of sequences, including genomic DNA, transcript (RNA) and protein products, for major research organisms. This now includes curated information for a single sequence of each fully sequenced virus species. While this is a welcome development, it can only deal with complete sequences. An important feature of DPV is the opportunity to access genes (and other features) of multiple sequences quickly and accurately. Thus, for example, it is easy to obtain the nucleotide or amino acid sequences of all the available accessions of the coat protein gene of a given virus species or for a group of viruses. To increase its usefulness further, DPVweb also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA (ssDNA) genome. Sponsors: This site is supported by the Association of Applied Biologists and the Zhejiang Academy of Agricultural Sciences, Hangzhou, People''s Republic of China.
Proper citation: Descriptions of Plant Viruses (RRID:SCR_006656) Copy
http://bejerano.stanford.edu/phenotree/
Web server to search for genes involved in given phenotypic difference between mammalian species. The mouse-referenced multiple alignment data files used to perform the forward genomics screen is also available. The webserver implements one strategy of a Forward Genomics approach aiming at matching phenotype to genotype. Forward genomics matches a given pattern of phenotypic differences between species to genomic differences using a genome-wide screen. In the implementation, the divergence of the coding region of genes in mammals is measured. Given an ancestral phenotypic trait that is lost in independent mammalian lineages, it is shown that searching for genes that are more diverged in all trait-loss species can discover genes that are involved in the given phenotype.
Proper citation: Phenotree (RRID:SCR_003591) Copy
Catalog of published genome-wide association studies. Genome-wide set of genetic variants in different individuals to see if any variant is associated with trait and disease. Database of genome-wide association study (GWAS) publications including only those attempting to assay single nucleotide polymorphisms (SNPs). Publications are organized from most to least recent date of publication. Studies are identified through weekly PubMed literature searches, daily NIH-distributed compilations of news and media reports, and occasional comparisons with an existing database of GWAS literature (HuGE Navigator). Works with HANCESTRO ancestry representation.
Proper citation: GWAS: Catalog of Published Genome-Wide Association Studies (RRID:SCR_012745) Copy
The project began as a pilot study to identify inherited genetic susceptibility to prostate and breast cancer. CGEMS has developed into a robust research program involving genome-wide association studies (GWASs) for a number of cancers to identify common genetic variants that affect a person''s risk of developing cancer. In collaboration with extramural scientists, NCI''s Division of Cancer Epidemiology and Genetics (DCEG) has carried out genome-wide scans for breast, prostate, pancreatic, and lung cancers, while a GWAS of bladder cancer is currently underway. By making the data available to both intramural and extramural research scientists, as well as those in the private sector through rapid posting, NIH can leverage its resources to ensure that the dramatic advances in genomics are incorporated into rigorous population-based studies. Ultimately, findings from these studies may yield new preventive, diagnostic, and therapeutic interventions for cancer. Sponsors: This resource is supported by the U.S. National Institues Of Health.
Proper citation: CGEMS (RRID:SCR_008445) Copy
https://chordate.bpni.bio.keio.ac.jp/chordate/faba/1.4/top.html
Image resource including ascidian's three-dimensional (3D) and cross-sectional images through the developmental time course. These images were reconstructed from more than 3,000 high-resolution real images collected by confocal laser scanning microscopy (CLSM) at newly defined 26 distinct developmental stages (stages 1-26) from fertilized egg to hatching larva, which were grouped into six periods named the zygote, cleavage, gastrula, neurula, tailbud, and larva periods. The data set will be helpful in standardizing developmental stages for morphology comparison as well as for providing guidelines for several functional studies of a body plan in chordate.
Proper citation: Four-dimensional Ascidian Body Atlas (RRID:SCR_001691) Copy
http://www.worm.mpi-cbg.de/phenobank/cgi-bin/ProjectInfoPage.py
A database that provides primary data from two high-content screens that profile the set of ~900 essential C. elegans genes (~5% of the genome) required for embryo production and/or events during the first two embryonic divisions. Phenobank houses the movies, scored defects, and phenotypic classification data for the embryo-filming and gonad morphology screens.
Proper citation: PhenoBank (RRID:SCR_000930) Copy
Web application that helps design, evaluate and clone guide sequences for the CRISPR/Cas9 system. This sgRNA design tool assists with guide selection in a variety of genomes and pre-calculated results for all human coding exons as a UCSC Genome Browser track.
Proper citation: CRISPOR (RRID:SCR_015935) Copy
http://www.projects.roslin.ac.uk/sheepmap/front.html
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 23, 2016. The project aims to apply genome mapping research to sheep, utilizing previous research in sheep (in other countries) and in other species (in the UK and abroad) to the benefit of the UK sheep industry. The project itself uses existing breeding structures, knowledge of the sheep genome and experimental resources. It has three main aims: i) To use the Suffolk, Texel and Charollais Sire Referencing Schemes to detect and verify quantitative trait loci (QTLs) for growth and carcass composition traits ii) To investigate candidate genes and/or chromosomal regions for associations with production traits. iii) To investigate approaches for optimizing future genotyping strategies within the sire referencing schemes for practical and cost effective application of marker-assisted selection By using commercial breeding populations for the research, immediate application of beneficial results is possible. Potential benefits include increased genetic progress through marker assisted selection which utilizes the genotype information, correction of possible parentage errors (ultimately leading to additional genetic progress) and opportunities for using marker information for product certification. The project will benefit the UK sheep industry by the use of Marker Assisted Selection (MAS) utilizing QTL or gene variants identified in the project. Additional benefits may arise from parentage verification and correction of errors e.g. misallocation of lamb to ewe. In the longer term, opportunities may exist to use markers for quality control, tracing products to their source. The major advantage of the design of this project is that the results are immediately applicable to the breeding schemes within which the QTLs and/or genes are detected. The time lag in the application of the results that is often seen with experimental populations is minimized. The project requires close involvement with the Sire Reference Schemes, in return for their assistance the results have immediate benefit to animals within these groups.
Proper citation: UK Sheep Genome Mapping Project (RRID:SCR_002272) Copy
http://www-sequence.stanford.edu/group/candida/
The Stanford Genome Technology Center began a whole genome shotgun sequencing of strain SC5314 of Candida albicans. After reaching its original goal of 1.5X mean coverage of the haploid genome (16Mb) in summer, 1998, Stanford was awarded a supplemental grant to continue sequencing up to a coverage of 10X, performing as much assembly of the sequence as possible, using recognizable genes as nucleation points. Candida albicans is one of the most commonly encountered human pathogens, causing a wide variety of infections ranging from mucosal infections in generally healthy persons to life-threatening systemic infections in individuals with impaired immunity. Oral and esophogeal Candida infections are frequently seen in AIDS patients. Few classes of drugs are effective against these fungal infections, and all of them have limitations with regard to efficacy and side-effects.
Proper citation: Sequencing of Candida Albicans (RRID:SCR_013437) Copy
Collaborative project to bring together biochemical pathway databases and research communities focused on plant metabolism. Used to build broad network of plant metabolic pathway databases. Central feature of PMN is PlantCyc, comprehensive plant biochemical pathway database, containing curated information from literature and computational analyses about genes, enzymes, compounds, reactions, and pathways involved in primary and secondary metabolism.
Proper citation: Plant Metabolic Network (RRID:SCR_002888) Copy
http://img.jgi.doe.gov/cgi-bin/m/main.cgi
Resource for analysis and annotation of genome and metagenome datasets in comprehensive comparative context. IMG provides users with tools for analyzing publicly available genome datasets and metagenome datasets.
Proper citation: IMG System (RRID:SCR_002965) Copy
http://lab.rockefeller.edu/casanova/HGC
Data set containing a gene-specific connectome file for each human gene and computer programs for ranking lists of genes within a gene-specific connectome, clustering and plotting the genes by the functional genomic alignment (FGA) approach, and generating gene-specific connectomes. The programs were developed and tested on Mac and Linux systems. The external software required for running these programs is open-source and free of charge. The HGC is the set of all biologically plausible routes, distances, and degrees of separation between all pairs of human genes. A gene-specific connectome contains the set of all available human genes sorted on the basis of their predicted biological proximity to the specific gene of interest. The HGC is a powerful approach for human genotype-phenotype high-throughput studies, for which it can be used to rank any list of genes within a gene-specific connectome for an experimentally validated core gene. Functional genomic alignment (FGA) is equivalent to traditional multiple sequence alignment (MSA), except that it clusters genes in trees on the basis of the functional biological distance between them predicted by HGC, rather than on the basis of molecular evolutionary genetic distance. This method is therefore more suitable for disease and phenotypic studies.
Proper citation: Human Gene Connectome (RRID:SCR_002628) Copy
http://www.dnastar.com/t-genvision.aspx
A genomic visualization application to support easy generation of publication quality graphics and maps. It produces high quality images of annotated genomes but it can also be customized to accentuate specific areas of interest, such as comparing gene functionality, illustrating gene expression levels, and visualizing the coverage in an assembled contig.
Proper citation: GenVision (RRID:SCR_001166) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.