The Genomes On Line Database (GOLD) is a web resource for
September 30, 2017
The Genomes On Line Database (GOLD) is a web resource for comprehensive access to information regarding complete and ongoing genome sequencing projects worldwide. even Moore’s law for the increase of microprocessor computational power. We anticipate that, as new sequencing technologies are introduced, such as pyrosequencing (3), and the cost of existing technologies continues to decline, the number of genome sequences will continue to grow exponentially and the number of research groups able to contribute genome sequences also will dramatically increase. Therefore, the need for a searchable database that comprehensively tracks genome projects to help guide selection of new projects and provide up-to-date overview statistics will only increase. CURRENT STATUS OF THE DATABASE Published complete genomes From 350 projects at the time of its previous report (2), GOLD has grown in providing information for 1575 genome projects worldwide, today. Almost 300 of those projects are currently being reported as completed with their sequences submitted to public databases. These are 1285702-20-6 reported in GOLD as Published Complete Genomes. A genome publication is not always available in 1285702-20-6 the literature for these projects as quite often submitters choose to release their sequence data to the community prior to publication. From the 297 complete and published genome projects, 235 are bacterial, 23 are archaeal and 39 are eukaryotic. Ongoing genome projects In addition to the completed projects, there are currently 1263 ongoing sequencing projects. Of those, 697 are bacterial, 38 archaeal and 526 are eukaryotic projects. The latter includes 208 EST and 10 RST projects, in addition to the 308 genome projects. These can be retrieved by using GOLD’s search engine, selecting EST or RST or Genome at the field. From the 1263 ongoing projects, 114 are also considered complete at this point, i.e. the sequencing phase 1285702-20-6 has been completed but the data are not yet submitted to the public sequencing repositories. These can be retrieved using the search engine by selecting Complete Unpublished at the field. GOLD is not limited to providing information on sequencing projects for which results will become publicly available at some point in the future. Rather, it seeks out and displays all publicly reported projects, whether the actual data will become public at some point or remain proprietary. It is our hope that this will better serve researchers, agencies and sequencing centres in the process of selecting new projects, or identifying sources of currently existing ones. These projects can be retrieved by selecting Proprietary at the field of the Search page. GOLD currently has information for the CEACAM5 sequencing of 56 proprietary genome projects running at various private companies. Usually only the information for the sequencing project itself has been made available in these cases. A total of 25 such projects are also considered completed. Sequencing is currently being performed in a large variety of sequencing centres, through a variety of funding sources and analysis is presented in many different databases. As displayed in the indexing link of the database, GOLD reports sequencing projects from 566 sequencing centres, funded from 186 agencies, and links to 427 distinct databases that provide sequence data analysis and info for the above genome projects. NEW DEVELOPMENTS Since the last statement (2), a number of additional data fields have been added to the database. These include fresh data fields available in the project tables, as well as with the search engine. The project tables now have the following additional fields: (i) ((field. A search or browsing can also be performed within the Goldstamp IDs through the related search field. (ii) information, which provides the name and contact info of the PI responsible for a given project. All these fields are available for either search or browsing through the search engine. The most important fresh development in the database is the addition of fresh data types relevant to the.